Isnt this a solved problem? Cisco video conference things do this already, their cameras will swivel around to point at the active speaker.
Artificial Intelligence
Welcome to the AI Community!
Let's explore AI passionately, foster innovation, and learn together. Follow these guidelines for a vibrant and respectful community:
- Be kind and respectful.
- Share high-quality contributions.
- Stay on-topic.
- Enhance accessibility.
- Verify information.
- Encourage meaningful discussions.
You can access the AI Wiki at the following link: AI Wiki
Let's create a thriving AI community together!
I think this is different. Cisco's audio triangulation: "Audio triangulation - The microphone array behind the fabric panel that is position behind the camera pictured above is able to accurately locate voices within the room. The microphones are only used for audio triangulation ."
the robot is different because it's using binaural which uses the two "microphone" on our head (ears) and if doing it accurately, it should calculate how the sound is being received based on the pinnae and other shapes of the ear.
emphasis on the estimating