7th European Industrial Engineering and Operations Management Conference

Smart Robot Manipulation using GPT-4o Vision

Ryan Tan, Harry Yang, Sean Jiao, Liuyang Shan, Chenxi Tao & Roger Jiao
Publisher: IEOM Society International
0 Paper Citations
1 Views
1 Downloads
Track: High School STEM Competition
Abstract

This paper explores the application of a robotic arm powered by artificial intelligence (AI) and deep learning to enhance the precision and efficiency of object manipulation tasks. Leveraging OpenAI’s GPT-4o image recognition model along with OpenCV, we implement smart computer vision techniques to accurately recognize, locate, and manipulate targeted objects. By automating control and minimizing human intervention, our approach has the potential for improving traditional kinematics-based methods. The integration of AI enables robots to learn from vision data, adapt to various scenarios, and improve over time. Our findings demonstrate the feasibility of utilizing AI and deep learning to achieve high accuracy and efficiency in robot manipulation, with profound implications for manufacturing automation and beyond.

Published in: 7th European Industrial Engineering and Operations Management Conference, Augsburg (Greater Munich), Germany

Publisher: IEOM Society International
Date of Conference: July 16-18, 2024

ISBN: 979-8-3507-1737-2
ISSN/E-ISSN: 2169-8767