What is Embedchain?
Embedchain is an Open Source Framework for personalizing LLM responses. It makes it easy to create and deploy personalized AI apps. At its core, Embedchain follows the design principle of being "Conventional but Configurable" to serve both software engineers and machine learning engineers.
Embedchain streamlines the creation of personalized LLM applications, offering a seamless process for managing various types of unstructured data. It efficiently segments data into manageable chunks, generates relevant embeddings, and stores them in a vector database for optimized retrieval. With a suite of diverse APIs, it enables users to extract contextual information, find precise answers, or engage in interactive chat conversations, all tailored to their own data.
🔧 Quick install
Python API
pip install embedchain
✨ Live demo
Checkout the Chat with PDF live demo we created using Embedchain. You can find the source code here.
🔍 Usage
For example, you can create an Elon Musk bot using the following code:
import os
from embedchain import App
os.environ["OPENAI_API_KEY"] = "<YOUR_API_KEY>"
app = App()
app.add("https://en.wikipedia.org/wiki/Elon_Musk")
app.add("https://www.forbes.com/profile/elon-musk")
app.query("How many companies does Elon Musk run and name those?")
You can also try it in your browser with Google Colab:

📖 Documentation
Comprehensive guides and API documentation are available to help you get the most out of Embedchain:
🤝 Schedule a 1-on-1 Session
Book a 1-on-1 Session with the founders, to discuss any issues, provide feedback, or explore how we can improve Embedchain for you.
🌐 Contributing
Contributions are welcome! Please check out the issues on the repository, and feel free to open a pull request.
For more information, please see the contributing guidelines.
For more reference, please go through Development Guide and Documentation Guide.
Anonymous Telemetry
We collect anonymous usage metrics to enhance our package's quality and user experience. This includes data like feature usage frequency and system info, but never personal details. The data helps us prioritize improvements and ensure compatibility. If you wish to opt-out, set the environment variable EC_TELEMETRY=false
. We prioritize data security and don't share this data externally.
Citation
If you utilize this repository, please consider citing it with:
@misc{embedchain,
author = {Taranjeet Singh, Deshraj Yadav},
title = {Embedchain: The Open Source RAG Framework},
year = {2023},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/embedchain/embedchain}},
}