WatchThis: A Wearable Point-and-Ask Interface Powered by Vision-Language Models and XIAO ESP32S3 Sense
MIT Media Lab researchers Cathy Mengying Fang, Patrick Chwalek, Quincy Kuang, and Pattie Maes have
MIT Media Lab researchers Cathy Mengying Fang, Patrick Chwalek, Quincy Kuang, and Pattie Maes have
The next significant breakthrough in modern AI came with the advent of GPT(Generative Pre-trained Transformer).
For complex computer vision tasks, we usually require machines not only to interpret complex visual