XO Brain Ai
https://xobrain.ai/
https://devpost.com/software/xo-mf458w
My hackathon entry building an External Brain Ai.
I plan to push forward beyond the hackathon and keep working on this.
Devpost
XO Brain Ai
XO Brain is your AI external brain. Effortlessly capturing and organizing your thoughts, notes, and tasks in real-time. XO turns your chaotic stream of consciousness into structured notes.
5 Replies
Cool project
@Michael Rea Nice project, are you splitting the chunks while recording and sending it to something like whisper?
I tried to use whisper at first but knowing when to split the audio was tricky and latency was too high. I might still use whisper for a final Passover at the end to correct potential transcription errors. I can then use the word level timestamps to have a playable audio snippet per chunk that's in the note.
I'm using deepgram to do the splitting while recording, it easier waits for a small gap in your speech to then send to my API for processing. The ms gap in speech still needs fine tuning but I've leaned towards higher so you get bigger chunks sent through.
Deepgram Docs
Transcription features - Endpointing - Deepgram Docs
Endpointing returns transcripts when pauses in speech are detected.
Great thanks, i am also building something similar for learning purposes only where i am sending the audio directly to whisper to generate transcription and then using mistral on together ai to generate data out of transcript but the turn around time is huge as whisper is too slow.