no code implementations • 5 Mar 2024 • Chun-Peng Chang, Shaoxiang Wang, Alain Pagani, Didier Stricker
3D visual grounding involves matching natural language descriptions with their corresponding objects in 3D spaces.
Decision Making Object +2