chat-gpt-voice

Under ~~100~~ 200 lines of plain HTML/CSS/JS code creates a voice bot using ChatGPT API and can also write its own code and push it to GitHub using GitHub REST API.

I wrote it primarily so I could easily ping ChatGPT on my mobile device using my voice instead of typing (Added PWA support using manifest.json)

I then thought it would be neat if it could write it's own code just like OpenAI Codex playground but also optionally publish to github instead of JSBin. I looked into publishing to JSBin but I didn't see a way to POST the data there without a paid subscription although it may be possible with other online JS editors. I did add a code editor using CodeMirror and it seems to work pretty well.

Employs Web Speech API's interfaces for speech recognition and text synthesis, enabling speech-to-text and text-to-speech capabilities.

Simply tap/click on the body of the page to activate speech recognition

Live demo:

https://raw.githack.com/jay23606/chat-gpt-voice/main/chat.html (or tinyurl.com/gpt-voice)

I've also added a form of "tabs" using iframes and a way to toggle to vanilla-chatgpt

https://raw.githack.com/jay23606/chat-gpt-voice/main/tabs.html (or tinyurl.com/gpt-voice-tabs)

Added example of using Whisper endpoint for speech-to-text:

https://raw.githack.com/jay23606/chat-gpt-voice/main/record.html

Requires entering in an API key from openai.com which is stored in localStorage

To get api key(s):

https://platform.openai.com/account/api-keys

https://github.com/settings/tokens

A checkbox is included to enable or disable STT and TTS whose state is also stored in localStorage

Screenshots:

You can now instruct it to write runnable and editable JS apps and it can also push it to GitHub Pages and provide a link (requires GitHub API PAT with repo access-also stored in localStorage):

Added CodeMirror for the code editor after exploring options:

I've also added fieldset elements that act as accordians for the "ChatGPT Response", "Code Editor", and "Output Web Application" sections and have included error information in red for the latter so one does not have to go hunting in devtools to determine the cause of a script error. I've added a button that will then send errors if there are any back to ChatGPT for a resolution.

If you look under the html folder in the gh-pages branch you will find some apps this app has written

If you click the YouTube checkbox, you can now embed five YouTube videos based on the prompt and I've also added a dropdown to allow a larger variety of tab destinations

Inspiration:

https://github.com/casualwriter/vanilla-chatgpt

https://mdn.github.io/dom-examples/web-speech-api/speech-color-changer/

Speech recognition does not work in Firefox but should be possible to get working using the whisper example

Name		Name	Last commit message	Last commit date
Latest commit History 340 Commits
html		html
README.md		README.md
chat.html		chat.html
github-test.html		github-test.html
github.js		github.js
icon.png		icon.png
icon2.png		icon2.png
loading.gif		loading.gif
manifest.json		manifest.json
nilla.html		nilla.html
openai-test.html		openai-test.html
openai.js		openai.js
record.html		record.html
screenshot.png		screenshot.png
screenshot2.png		screenshot2.png
screenshot3.png		screenshot3.png
tabs.html		tabs.html
youtube.css		youtube.css
youtube.js		youtube.js

jay23606/chat-gpt-voice

Folders and files

Latest commit

History

Repository files navigation

chat-gpt-voice

About

Resources

Stars

Watchers

Forks

Languages