
Security News
Engineering with AI Podcast: The Promise of AI-First Development
Socket CTO Ahmad Nassri shares practical AI coding techniques, tools, and team workflows, plus what still feels noisy and why shipping remains human-led.
llama-stack
Advanced tools
Quick Start | Documentation | Colab Notebook | Discord
To try Llama Stack locally, run:
curl -LsSf https://github.com/llamastack/llama-stack/raw/main/scripts/install.sh | bash
Llama Stack standardizes the core building blocks that simplify AI application development. It codifies best practices across the Llama ecosystem. More specifically, it provides
By reducing friction and complexity, Llama Stack empowers developers to focus on what they do best: building transformative generative AI applications.
Here is a list of the various API providers and available distributions that can help developers get started easily with Llama Stack. Please checkout for full list
| API Provider Builder | Environments | Agents | Inference | VectorIO | Safety | Telemetry | Post Training | Eval | DatasetIO |
|---|---|---|---|---|---|---|---|---|---|
| Meta Reference | Single Node | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| SambaNova | Hosted | ✅ | ✅ | ||||||
| Cerebras | Hosted | ✅ | |||||||
| Fireworks | Hosted | ✅ | ✅ | ✅ | |||||
| AWS Bedrock | Hosted | ✅ | ✅ | ||||||
| Together | Hosted | ✅ | ✅ | ✅ | |||||
| Groq | Hosted | ✅ | |||||||
| Ollama | Single Node | ✅ | |||||||
| TGI | Hosted/Single Node | ✅ | |||||||
| NVIDIA NIM | Hosted/Single Node | ✅ | ✅ | ||||||
| ChromaDB | Hosted/Single Node | ✅ | |||||||
| Milvus | Hosted/Single Node | ✅ | |||||||
| Qdrant | Hosted/Single Node | ✅ | |||||||
| Weaviate | Hosted/Single Node | ✅ | |||||||
| SQLite-vec | Single Node | ✅ | |||||||
| PG Vector | Single Node | ✅ | |||||||
| PyTorch ExecuTorch | On-device iOS | ✅ | ✅ | ||||||
| vLLM | Single Node | ✅ | |||||||
| OpenAI | Hosted | ✅ | |||||||
| Anthropic | Hosted | ✅ | |||||||
| Gemini | Hosted | ✅ | |||||||
| WatsonX | Hosted | ✅ | |||||||
| HuggingFace | Single Node | ✅ | ✅ | ||||||
| TorchTune | Single Node | ✅ | |||||||
| NVIDIA NEMO | Hosted | ✅ | ✅ | ✅ | ✅ | ✅ | |||
| NVIDIA | Hosted | ✅ | ✅ | ✅ |
Note: Additional providers are available through external packages. See External Providers documentation.
A Llama Stack Distribution (or "distro") is a pre-configured bundle of provider implementations for each API component. Distributions make it easy to get started with a specific deployment scenario - you can begin with a local development setup (eg. ollama) and seamlessly transition to production (eg. Fireworks) without changing your application code. Here are some of the distributions we support:
| Distribution | Llama Stack Docker | Start This Distribution |
|---|---|---|
| Starter Distribution | llamastack/distribution-starter | Guide |
| Meta Reference | llamastack/distribution-meta-reference-gpu | Guide |
| PostgreSQL | llamastack/distribution-postgres-demo |
Please checkout our Documentation page for more details.
llama CLI to work with Llama models (download, study prompts), and building/starting a Llama Stack distribution.llama-stack-client CLI, which allows you to query information about the distribution.| Language | Client SDK | Package |
|---|---|---|
| Python | llama-stack-client-python | |
| Swift | llama-stack-client-swift | |
| Typescript | llama-stack-client-typescript | |
| Kotlin | llama-stack-client-kotlin |
Check out our client SDKs for connecting to a Llama Stack server in your preferred language, you can choose from python, typescript, swift, and kotlin programming languages to quickly build your applications.
You can find more example scripts with client SDKs to talk with the Llama Stack server in our llama-stack-apps repo.
Thanks to all of our amazing contributors!
FAQs
Llama Stack
We found that llama-stack demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 5 open source maintainers collaborating on the project.
Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Security News
Socket CTO Ahmad Nassri shares practical AI coding techniques, tools, and team workflows, plus what still feels noisy and why shipping remains human-led.

Research
/Security News
A five-month operation turned 27 npm packages into durable hosting for browser-run lures that mimic document-sharing portals and Microsoft sign-in, targeting 25 organizations across manufacturing, industrial automation, plastics, and healthcare for credential theft.

Research
Fake “Phantom Shuttle” VPN Chrome extensions (active since 2017) hijack proxy auth to intercept traffic and continuously exfiltrate user credentials to attacker infrastructure.