Language-Guided Grasping in Clutter: Foundation-Model-Driven Target Selection and Task Execution Verification