GitHub - yohasebe/monadic-chat: 🤖 + 🐳 + 🐧 Monadic Chat is a locally hosted web application designed to create and utilize intelligent chatbots. By providing a Linux environment on Docker to GPT and other LLMs, it enables code execution and advanced tasks that require external tools.

Overview

Monadic Chat is a locally hosted web application designed to create and utilize intelligent chatbots. By providing a Linux environment on Docker to GPT and other LLMs, it allows the execution of advanced tasks that require external tools. It supports voice interaction, image and video recognition and generation, and AI-to-AI chat, making it useful not only for various AI applications but also for developing and researching AI-powered applications.

Available for Mac, Windows, and Linux (Debian/Ubuntu) with easy-to-use installers.

Changelog

Getting Started

Documentation (English/Japanese)
Installation

What is Grounding?

Monadic Chat is an AI framework grounded in the real world. The term grounding here has two meanings.

Typically, discourse involves context and purpose, which are referenced and updated as the conversation progresses. Just as in human-to-human conversations, maintaining and referencing context is useful, or even essential, in conversations with AI agents. By defining the format and structure of meta-information in advance, it is expected that conversations with AI agents will become more purposeful. The process of users and AI agents advancing discourse while sharing a foundational background is the first meaning of "grounding."

Human users can use various tools to achieve their goals. However, in many cases, AI agents cannot do this. Monadic Chat enables AI agents to execute tasks using external tools by providing them with a freely accessible Linux environment. This allows AI agents to more effectively support users in achieving their goals. Since it is an environment on Docker containers, it does not affect the host system. This is the second meaning of "grounding."

Features

Basic Structure

🤖 Use of AI assistants via various web and local APIs
⚛️ Easy Docker environment setup using a GUI app with Electron
📁 Synchronized folder for syncing local files with files inside Docker containers
📦 User-added apps and containers functionality
💬 Support for both Human/AI chat and AI/AI chat
✨ Chat functionality utilizing multiple AI models

AI + Linux Environment

🐧 Provision of a Linux environment to AI agents
🐳 Tools available to LLMs via Docker containers
- Linux (+ apt)
- Ruby (+ gem)
- Python (+ pip)
- PGVector (+ PostgreSQL)
- Selenium (+ Chrome/Chromium)
⚡️ Use of LLMs via online and local APIs
📦 Each container can be managed via SSH
📓 Integration with Jupyter Notebook

Data Management

💾 Export/import chat data
📝 Edit chat data (add, delete, edit)
💬 Specify the number of messages to send to the API as context size
📜 Set roles for messages (user, assistant, system)
🔢 Generate and import/export text embeddings from PDFs
📼 Logging of code execution and tool/function use for debugging

Voice Interaction

🔈 Text-to-speech for AI assistant responses (OpenAI or Elevenlabs)
🎙️ Speech recognition using the Whisper API (+ display of p-values)
🗺️ Automatic language detection for text-to-speech
🗣️ Choose the language and voice for text-to-speech
😊 Interactive conversation with AI agents using speech recognition and text-to-speech
🎧 Save AI assistant's spoken responses as MP3 audio files

Image/Video Recognition and Generation

🖼️ Image generation using DALL·E 3 API
👀 Recognition and description of uploaded images
📚 Upload and recognition of multiple images
🎥 Recognition and description of uploaded video content and audio

Configuration and Extension

💡 Specify and edit API parameters and system prompts
💎 Extend functionality using the Ruby programming language
🐍 Extend functionality using the Python programming language
🔍 Web search capabilities using the Tavily API
🌎 Perform web scraping using Selenium
📦 Add custom Docker containers

Support for Multiple LLM APIs

👥 Web API
🦙 Ollama in the local Docker environment
- Llama
- Phi
- Mistral
- Gemma
- DeepSeek
🤖💬🤖 AI-to-AI chat functionality

Conversations as Monads

♻️ In addition to the main response from the AI assistant, it is possible to manage the (invisible) state of the conversation by obtaining additional responses and updating values within a predefined JSON object

Developer

Yoichiro HASEBE
[email protected]

License

This software is available as open source under the terms of the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 1,010 Commits
bin		bin
docker		docker
docs		docs
icons		icons
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
Rakefile		Rakefile
index.html		index.html
main.js		main.js
mainScreen.js		mainScreen.js
notarize.js		notarize.js
package-lock.json		package-lock.json
package.json		package.json
preload.js		preload.js
settings.html		settings.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Getting Started

What is Grounding?

Features

Basic Structure

AI + Linux Environment

Data Management

Voice Interaction

Image/Video Recognition and Generation

Configuration and Extension

Support for Multiple LLM APIs

Conversations as Monads

Developer

License

About

Contributors 2

Languages

License

yohasebe/monadic-chat

Folders and files

Latest commit

History

Repository files navigation

Overview

Getting Started

What is Grounding?

Features

Basic Structure

AI + Linux Environment

Data Management

Voice Interaction

Image/Video Recognition and Generation

Configuration and Extension

Support for Multiple LLM APIs

Conversations as Monads

Developer

License

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages