Interact → Click can’t target ChatGPT “Voice conversation” button (Android 15 / One UI 7)

41 views
Skip to first unread message

Alex

unread,
Oct 11, 2025, 4:45:47 AM (4 days ago) Oct 11
to Automate for Android

Device/OS: Samsung Galaxy A16 5G · One UI 7.0 · Android 15
Target app: ChatGPT (com.openai.chatgpt)

Goal: In a flow, open a specific Custom GPT (via URL) and then tap the in-chat Voice conversation button (full voice in/out), not the keyboard dictation mic.

Flow (works except the click):

  1. Resolve activity + App start to open the GPT share link.

  2. App foreground wait for com.openai.chatgpt.

  3. Delay ≈ 600 ms.

  4. Interact → Click the Voice conversation button (bottom-right).

What I tried for the Interact block:

  • Record interactions didn’t catch the tap, so I used Record → Search (🔍) and got this absolute XPath:

/android.widget.FrameLayout[1]/android.widget.LinearLayout[1]/android.widget.FrameLayout[1]/android.widget.LinearLayout[@android:id='@com.openai.chatgpt:id/action_bar_root']/android.widget.FrameLayout[@android:id='@android:id/content']/androidx.compose.ui.platform.ComposeView[1]/android.view.View[1]/android.view.View[1]/android.view.View[1]/android.view.View[1]/android.view.View[3]/android.widget.EditText[1]/android.view.View[3]/android.view.View[1]/android.view.View[1]/android.widget.Button[1]

Questions for the community:

  1. Does anyone have a robust XPath  that matches the ChatGPT Voice conversation control on recent builds?

  2. Fallback idea: a relative XPath for “rightmost clickable in bottom bar” that uses layout attributes (e.g., @android:clickable='true' with a bottom-zone filter)

 I have attached the ChatGPT window screenshot with the Voice conversation button (bottom-right) circled in red
Screenshot.jpg

Henrik "The Developer" Lindqvist

unread,
Oct 11, 2025, 8:37:05 PM (3 days ago) Oct 11
to Automate for Android
Use the "Inspect user interface" button to export and look at the layout XML, maybe that can give you some hint of a better XPath. If it doesn't include any sensitive information then post it here.

Alex

unread,
Oct 12, 2025, 1:16:25 AM (3 days ago) Oct 12
to Automate for Android
I used the 'Inspect User Interface' to export the XML layout, out of which I construct the following xpath(s):

To start Voice (only if currently off):
(//*[@android:contentDescription='Start a voice conversation']
   /ancestor::*[@android:clickable='true'][1])

To end Voice (only if currently on):

(//*[@android:contentDescription='End voice conversation']
   /ancestor::*[@android:clickable='true'][1])

And setting "Proceed = When UI element appeared"
Reply all
Reply to author
Forward
0 new messages