Commit 22d22a5
Sentient v2 (#57)
List of commit messages for all commits that are being merged:
* feat: advanced voice mode sandbox script.
expanded implementation details
1. insanely fast whisper for STT with Google Speech Recognition as a fallback
2. WebRTCVAD for voice activity detection. the mic will only stream to the backend when voice activity is detected
3. Sesame CSM for TTS - incorporated SOTA TTS for life-like speech capabilities
4. integrated with existing chat backend - ensures seamless integration with existing agent and memory pipelines
next steps - move VAD logic to Electron frontend, setup electron app with proper frontend for voice chats, modify sandboxed script and convert it to module that we can import into the main app server for usage, add websocket to main app server for voice streaming from electron.
* feat[Gsuite]: added gslides functions
* feat (gsuite): removed token
* feat [Gsuite]: gslides improvements
* feat: voice mode switched transcription to faster whisper, added transcription test script
* feat: orpheus TTS quantized with faster-whisper for STT, real-time voice conversations
* feat[Gsuite]: integrated gdocs functions
* feat[Gsuite]:integrated gsheet functions
* feat: voice mode (WIP)
* fix (optimization): removed model switching feature
* fix (optimization): removed redundant code
* chore (optimization): dev prod toggle in index
* fix: voice mode debuggings (still not working)
* fix (optimization): migrated graph loading logic to server
* feat (optimization): added notifications tab
* fix (optimization): duplicate context issue in intent
* fix (optimization): resolved notifications db issue
* fix (optimization): update progress, data sources for intent
* feat: advanced voice mode gradio sandboxing (DONE) faster-whisper + llama 3.2 3b + orpheus TTS with FastRTC
* fix: trying voice mode fastapi backend with websocket instead of webrtc
* fix (optimization): resolved get graph data issue
* chore (optimization): added print statements
* chore (optimization): renamed endpoints, removed redundant files
* feat: voice mode
FINALLY COMPLETED
completed voice mode integration with existing backend - user messages are saved to the chatdb, actions are added to the task queue and memory functions also work as expected. all low-latency on RTX A5000.
transcription uses faster-whisper base on CPU for now. can be moved to GPU with a larger model for better accuracy
TTS uses a 4 bit quant of orpheus
RTC functionality is supported by FastRTC
* fix: datetime fix in chat history retrieval
* fix (optimization): switched to phi4-mini, changed unified classification prompt, resolved minor backend issues
* fix (optimization): resolved app lifecycle events
* fix (optimization): npm package updates, switched to 4bit orpheus, removed unnecessary fallback in index, updated requirements (freeze versions)
* fix (optimization): memory ops
* feat (optimizations): frontend changes to voice mode components
* fix (optimization): memory crud
* feat (optimization): frontend optimizations - updating visualizations, call control buttons, etc
* feat (optimization): task page optimization, fixing webrtc client functionality, added microphone switching logic to the chat page
* fix (optimization): updated chat system prompt
* fix (optimizations): frontend fixes on tasks page
* fix (optimization): resolved task approval
* fix (optimization): fixing react children render error on task page and fixing timestamp logic
* feat (optimization): memory page frontend
* fix (optimization): agents testing
* feat (optimizations): notifications page frontend
* fix (optimization): updated prompts, resolved timestamp issue
* feat (optimizations): settings page frontend and memory type switcher minor changes
* fix[memory] : import format issue
* fix (optimization): fixed memory internet agent icons not appearing on messages
* fix (optimization): removed model selector from onboarding, dual memory fixes for retention days logic
* fix (optimization): resolved agentic message
* fix (optimization): mock mode for settings
* fix (optimization): renamed folders, researched on prompting techniques
* fix (optimization): renamed paths according to new folder structure
* fix[optimization]: prompts optimized
Co-authored-by: Abhijeet Suryawanshi <108229267+abhijeetsuryawanshi12@users.noreply.github.com>
* fix (optimization): testing scrapers
* fix[optimization] : test linkedin scraper
* fix[optimization]: replaced linkedin scraper
* fix[optimization]: minor changes
---------
Co-authored-by: abhijeetsuryawanshi12 <abhijeetsuryawanshi128@gmail.com>
Co-authored-by: itsskofficial <itsskofficial03@gmail.com>
Co-authored-by: Abhijeet Suryawanshi <108229267+abhijeetsuryawanshi12@users.noreply.github.com>1 parent 9ac37df commit 22d22a5
190 files changed
Lines changed: 24320 additions & 14568 deletions
File tree
- .vscode
- src
- client
- app
- chat
- integrations
- load
- memory
- notifications
- personality-test
- settings
- tasks
- update
- components
- agents
- icons
- voice-test
- ui
- voice-visualization
- hooks
- main
- public
- audio
- images
- scripts
- styles
- utils
- interface
- app
- chat
- memory
- settings
- tasks
- update
- components
- main
- utils
- model
- agents
- app
- chat
- common
- context
- scraper
- server
- agents
- app
- auth
- chat
- common
- context
- memory
- scraper
- tests
- utils
- voice
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
205 | 205 | | |
206 | 206 | | |
207 | 207 | | |
208 | | - | |
| 208 | + | |
209 | 209 | | |
210 | 210 | | |
211 | 211 | | |
| |||
229 | 229 | | |
230 | 230 | | |
231 | 231 | | |
| 232 | + | |
232 | 233 | | |
233 | 234 | | |
234 | 235 | | |
| |||
239 | 240 | | |
240 | 241 | | |
241 | 242 | | |
242 | | - | |
| 243 | + | |
243 | 244 | | |
244 | 245 | | |
| 246 | + | |
245 | 247 | | |
246 | | - | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
This file was deleted.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
112 | 112 | | |
113 | 113 | | |
114 | 114 | | |
115 | | - | |
| 115 | + | |
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
| |||
File renamed without changes.
File renamed without changes.
File renamed without changes.
0 commit comments