--
You received this message because you are subscribed to the Google Groups "RSSC-List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rssc-list+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/rssc-list/B8430D5E-1E6E-4AF6-98DC-40EBDEE35AB7%40gmail.com.
Maybe you are asking too much of Chat GPT. 🙂
From: Alan Downing <downi...@gmail.com>
Sent: Thursday, June 13, 2024 9:53 PM
To: Gmail <thomas...@gmail.com>
Cc: RSSC-List <rssc...@googlegroups.com>
Subject: Re: [RSSC-List] Guiding a robot 🦾 arm with generative AI 🧠 - ChatGPT4o
I tried a chess board with pieces on it and ChatGPT4o seemed to hallucinate the pieces and their positions.
Alan
On Thu, Jun 13, 2024 at 9:05 PM Gmail <thomas...@gmail.com> wrote:
Today we held our weekly VIG SIG meeting. One of the discussion topics was “guiding a robot arm with a large language model”. It was noted that with GPT 3.5, one was not able to get a spacial location of an object.
After the meeting, I attempted to have GPT-4o give me the location of a red cup. It was quite successful! Although this was a simplistic use case, it does indicate that the newest version can locate objects in 3-D space. And if I can locate an object in 3-D, it might be able to grasp/move/hit that object in 3-D. One thing to note though, is that each response took several seconds. Obviously, this length of a delay wouldn’t work for many use cases.
See the images below.
<image3.jpeg><image4.jpeg><image5.jpeg>
To view this discussion on the web visit https://groups.google.com/d/msgid/rssc-list/EF8956DA-E5E0-450E-B088-D29E5361C7F4%40gmail.com.
On Jun 25, 2024, at 3:15 PM, Carl <cfsu...@gmail.com> wrote:
Do you know if the spacial results you showed with GPT-4o are part of the LLM, or did they add a secondary system for that, like arithmetic calculators?
On Thu, Jun 13, 2024 at 9:05 PM Gmail <thomas...@gmail.com> wrote:
Today we held our weekly VIG SIG meeting. One of the discussion topics was “guiding a robot arm with a large language model”. It was noted that with GPT 3.5, one was not able to get a spacial location of an object.
After the meeting, I attempted to have GPT-4o give me the location of a red cup. It was quite successful! Although this was a simplistic use case, it does indicate that the newest version can locate objects in 3-D space. And if I can locate an object in 3-D, it might be able to grasp/move/hit that object in 3-D. One thing to note though, is that each response took several seconds. Obviously, this length of a delay wouldn’t work for many use cases.
See the images below.
<image3.jpeg><image4.jpeg><image5.jpeg>
--
You received this message because you are subscribed to the Google Groups "RSSC-List" group.
To unsubscribe from this group and stop receiving emails from it, send an email to rssc-list+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/rssc-list/416c69f6-282a-429f-be79-b257908ce07bn%40googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/rssc-list/BE60CDE9-0562-4D4E-9568-F054F338DE84%40gmail.com.