brad/LLM_Engineering_OLD

Files

Mike 7e3ddf460d Added a multimodal chatbot interface project input to community contributions

2025-07-06 17:59:51 +01:00

1.1 KiB

Raw Blame History

🧠 Agentic Voice/Text Support Chatbot

A multimodal chatbot interface with support for text and voice input, multiple large language models (LLMs), and context memory persistence — all in a single Gradio-based GUI.

🚀 Features

🔄 Multi-LLM switching: Dynamically switch between OpenAI, Anthropic Claude, and Meta LLaMA (via Ollama)
🎤 Voice input: Use your microphone with live speech-to-text transcription
💬 Contextual memory: Maintain chat history even when switching models
🧪 Prototype-ready: Built with Gradio for rapid GUI testing and development

🛠️ Technologies Used

Gradio – GUI interface
OpenAI API
Anthropic Claude API
Ollama – Local LLaMA inference
speech_recognition – Voice-to-text
sounddevice, numpy – Audio recording
.env – Environment variable management

You’ll also need:

API keys for OpenAI and Claude
Ollama installed locally to run LLaMA models
A .env file with the necessary API keys