Element Labs, Inc.

LM Studio

Discover, download, and run open models on your own computer, no command line needed.

Running open models from a polished desktop GUI

Visit Site Read the Docs

GitHub Stars

—

Contributors

—

Release Downloads

—

Latest Version

—

Maintained by: Element Labs, Inc.
First released: May 2023
Last commit: —
Pricing: Free
License: Proprietary

Runs on This Stack

The engines this app runs on and the models it ships with, linked into the rest of the research stack.

Runs on These Engines

Overview

LM Studio is a desktop application for discovering, downloading, and running open large language models locally on your own hardware. Developed by Element Labs, Inc. (first released 2023), it combines a polished chat GUI with a model manager and an OpenAI-compatible API server into a single installable app. It competes directly with tools like Ollama, Jan, GPT4All, and Msty, but distinguishes itself with a fully graphical interface that requires zero terminal usage and a free-for-commercial-use license.

The app is built for three overlapping audiences: developers who want a local backend while building or testing, privacy-conscious users who need offline AI, and non-technical team members who need to compare models in a chat window without writing code. LM Studio runs on macOS, Windows, and Linux, and supports both CPU-only and GPU-accelerated inference on Apple Silicon (via MLX and llama.cpp), NVIDIA GPUs, and AMD GPUs.

What makes LM Studio noteworthy is its balance of accessibility and developer utility. You can install it, download a model, and start chatting in under five minutes. The same instance can then expose that model over an OpenAI-compatible REST API on localhost, letting you point existing tools or scripts at it with no API key or cloud dependency. It also supports MCP (Model Context Protocol) for agentic workflows, local RAG via document attachment, and a headless CLI (llmster) for server deployments. As of 2026, it is one of the most downloaded local LLM desktop apps, with a large Discord community and active development.

What You Can Do With It

Download and Chat With Open Models

The core experience is a built-in model browser that lets you search, filter, and download models directly from Hugging Face. You can filter by parameter size, task type, tool-use capability, and whether the model fits within your device’s available RAM. Once downloaded, a model loads into the chat interface where you can adjust inference parameters (temperature, context length, top-p, etc.) and converse in a clean, multi-turn chat window. There is no terminal, no pip install, and no Python environment to manage.

Serve Models via an OpenAI-Compatible API

LM Studio’s local server exposes your loaded model through OpenAI-style endpoints (/v1/chat/completions, /v1/completions, /v1/embeddings) over localhost or your network. Any client that works with OpenAI can be pointed at http://localhost:1234/v1 (default port) with no API key. This makes it a drop-in replacement during development for teams that want free, offline inference while building.

Chat With Documents (Offline RAG)

You can attach files (PDFs, text files, code files, etc.) to a chat and ask questions about their contents. The retrieval happens entirely on your machine, so no data leaves your device. This is useful for summarizing internal documentation, analyzing logs, or querying research papers without cloud upload.

MCP Client for Agentic Workflows

LM Studio acts as an MCP client, meaning you can connect it to external MCP servers (tools, data sources, APIs) and let your local model use them. This enables agent-like behavior such as searching the web, querying a database, or running shell commands — all coordinated by a local LLM through the chat interface.

Headless Mode and SDKs

Beyond the desktop GUI, LM Studio offers llmster, a headless version installable via a one-liner on Linux, macOS, or Windows. You also get official Python and JavaScript SDKs for programmatic control, plus a CLI tool (lms) for scripting model downloads, server control, and daemon management.

Platforms, Pricing, and Requirements

Platform	Support
macOS	Intel and Apple Silicon (M-series)
Windows	x64 and ARM64
Linux	x64

Pricing: The desktop app is free for both personal and commercial use. Element Labs also offers an Enterprise tier (pricing on request) which likely includes priority support, SLAs, and custom deployment assistance. There are no feature gates on the free version — you get the full chat, server, MCP, and RAG capabilities.

Hardware requirements:

Apple Silicon (M1/M2/M3/M4): Recommended. Supports both llama.cpp (GGUF) and Apple MLX runtimes. MLX often provides faster inference on unified memory.
NVIDIA GPU: Works via CUDA (llama.cpp backend). Models above 7B parameters may need >8GB VRAM or offloading to RAM.
AMD GPU: Supported via ROCm on Linux, and experimental support on Windows.
CPU-only: Works fine for smaller models (1B–8B parameters) with acceptable speed for chat. Large models (20B+) will be slow.

Open-source status: The desktop app is closed source. Only the CLI (lms), SDKs, and headless llmster are released as open source. This is a limitation for teams that require full code transparency.

Key Features and Capabilities

Desktop Chat Interface

The main selling point: a real graphical application with a model browser, chat history, parameter controls, and a clean UI. No command line required for day-to-day use. You can switch between models mid-conversation, manage multiple sessions, and export chats.

OpenAI-Compatible Local Server

Exposes a fully compatible /v1/chat/completions endpoint. Drops in for tools like LangChain, LlamaIndex, Open Interpreter, and custom scripts. Supports streaming, JSON mode, function calling (if the model supports it), and embedding endpoints.

Chat With Documents

Local RAG with no external vector database or cloud API. LM Studio handles document parsing and chunking internally. File types supported include .txt, .pdf, .md, .py, and more. All processing stays on-device.

MCP Support

Connect MCP servers from the LM Studio interface. This allows local models to call external tools (web search, file system access, database queries) in a structured, permission-gated way. Useful for building autonomous agents on a single machine.

Additional Capabilities

Apple MLX runtime: Native performance on M-series Macs, often faster than llama.cpp for certain models.
lmstudio-hub: A platform for publishing and discovering models curated by the LM Studio team.
LM Link: Route local AI workloads across devices on your network (e.g., run inference on a powerful desktop while chatting from a laptop).
Manual model imports: Drop GGUF or MLX model files into the app’s folder if you want to bypass the browser.

Real-World Use Cases

Private, Offline AI

Run models on an air-gapped machine with no internet connection. No data ever leaves the device. This is the primary use case for legal, medical, or defense teams handling sensitive information.

Local Backend During Development

Instead of burning through free API credits or managing cloud keys, point your existing OpenAI client at LM Studio. It works for prototyping, unit testing, and building features that depend on LLM output without network latency or cost.

Non-Developers Evaluating Models

Let product managers, domain experts, or stakeholders try a model before the engineering team invests in integration. LM Studio’s chat interface is intuitive enough for non-technical users, and model downloads are one-click.

Running Agents Locally

Combine MCP servers with a capable local model (e.g., Llama 3.1 8B, DeepSeek R1) to build simple autonomous agents on your own machine. This is cheaper and more private than cloud-based agent frameworks.

CI and Server Deployments (Headless)

Use llmster in CI pipelines to test model inference, or deploy as a lightweight API server on a Linux VM without a desktop environment.

Who should look elsewhere: If you need voice input/output, image generation, or vision models, LM Studio does not support those currently. For very large models (70B+) on consumer hardware, you may find better speed with Ollama’s GPU offloading or cloud solutions like OpenAI/Anthropic.

Getting Started With LM Studio

Download the installer from [lmstudio.ai](https://lmstudio.ai). Choose the version for your OS (macOS .dmg, Windows .exe, Linux .AppImage or .deb). No account required.
Install and launch. The first run presents a model browser. Search for a small model (e.g., Llama 3.2 1B or Phi-3 mini) to start quickly. Click download; the model will appear in your library once downloaded.
Start a chat. Select the model, click “Chat”, and begin typing. Adjust temperature or context length in the side panel.
Optionally, enable the local server. Go to the “Server” tab and toggle it on. Your model is now available at http://localhost:1234/v1.
Explore advanced features: Attach documents for RAG, connect MCP servers, or install the Python/JS SDKs (pip install lmstudio, npm install @lmstudio/sdk).

Documentation: Full docs at [lmstudio.ai/docs](https://lmstudio.ai/docs). Community support on Discord (link on the website).

How It Compares

LM Studio vs. Ollama

Ollama is a lightweight CLI-based tool for running local models. It is open source, supports a wide range of backends, and integrates well with Docker and development workflows. However, it lacks a graphical chat interface out of the box. LM Studio provides a polished GUI for model discovery and chatting, making it more accessible to non-CLI users. Ollama is better for headless, scripted deployments; LM Studio is better for interactive use and for teams that need a GUI.

LM Studio vs. Jan

Jan is another open-source desktop app for local LLMs, with a similar philosophy but an MIT license (entirely open source). Jan offers a chat interface, model management, and an OpenAI-compatible server, but its ecosystem is smaller. LM Studio has more mature SDKs, MCP support, and frequent updates. If full open-source transparency is a requirement, choose Jan. If you need a more polished, feature-rich experience with SDKs and headless deployment, LM Studio is the stronger pick.

LM Studio vs. GPT4All

GPT4All (by Nomic AI) is free and open source, but its focus is on consumer-friendly offline chat with a curated set of relatively small models. It is simpler and less customizable. LM Studio supports larger models, multiple runtimes, MCP, and developer APIs. GPT4All wins on simplicity for end users who just want a private chatbot. LM Studio wins on developer utility and model variety.

Bottom line: LM Studio is the best choice for technical teams that need a free, local, GUI-driven app with a clear path from chat to API server to agentic workflows. Its limits are hardware-bound and its source is not fully open, but for most practitioners that tradeoff is acceptable.

Strengths

A real graphical app, so no terminal or Python setup is required.
Free for both personal and commercial use.
Built-in model browser and chat make it easy to get started.
Local server exposes an OpenAI-compatible API when you are ready to build.

Trade-offs

The desktop app is closed source. Only the CLI and SDKs are open.
No voice or image generation.
Speed is bound by your own hardware, with no cloud offload.

Key Features

What the app gives you out of the box, in plain language.

macOS
Windows
Linux
Apple Silicon
NVIDIA GPU
AMD GPU
CPU-Only
Local Models
Cloud Models
MCP Support
Agent Mode
Voice In / Out
Image Generation
Chat With Docs
OpenAI-Compatible Server
One-Click Install

Desktop chat interface
Download and chat with open models through a clean app, with nothing to set up beyond installing.
OpenAI-compatible local server
Expose your local models through OpenAI-style endpoints over localhost or your network.
Chat with your documents
Attach files to a chat and ask questions about them, fully offline.

Where It Shines

The jobs this app is best suited for.

Private, offline AI
Run models with no data leaving the device, for sensitive or air-gapped work.
A local backend during development
Point an existing OpenAI client at LM Studio for free, key-less inference while you build.
Trying models without the terminal
Let anyone on the team compare open models in a chat window before writing code.

Pricing

Free

Free for personal and commercial use. Enterprise tier available.

Side-by-Side

Compare LM Studio With Another App

Add a second or third app and see stars, downloads, platforms, and capabilities lined up next to each other.

Open the Comparator

Related Apps

Close alternatives worth a look before you decide.

Unsloth Studio

A no-code local web app for fine-tuning and running open models on your own hardware.

No-code local fine-tuning and chat

uv pip install unsloth && unsloth studio

Stars

—

Downloads

—

OtherOpen SourcemacOSWindowsLinux

Odysseus

PewDiePie's open-source, self-hosted AI workspace that runs on your own hardware.

Self-hosting your whole AI workflow in one app

git clone github.com/pewdiepie-archdaemon/odysseus && docker compose up -d --build

Stars

—

Downloads

—

AGPL 3.0Open SourcemacOSWindowsLinux

Frequently Asked Questions

What Is a Desktop AI App?

A desktop AI app is a program you install on your computer to chat with, code with, run agents on, or fine-tune AI models. It sits on top of the inference engines and models that do the real work and gives you a friendly interface instead of a command line.

Is LM Studio free to use?

LM Studio is offered under a Free model. Check the pricing section on this page and the app’s own site for the latest details, since tiers and limits change over time.

What platforms does LM Studio run on?

LM Studio is maintained by Element Labs, Inc.. See the capabilities section above for the exact list of platforms it supports, along with whether it runs models locally, connects to cloud APIs, or both.

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

Element Labs, Inc.

LM Studio

Discover, download, and run open models on your own computer, no command line needed.

Running open models from a polished desktop GUI

Visit Site Read the Docs

GitHub Stars

—

Contributors

—

Release Downloads

—

Latest Version

—

Maintained by: Element Labs, Inc.
First released: May 2023
Last commit: —
Pricing: Free
License: Proprietary

Runs on This Stack

The engines this app runs on and the models it ships with, linked into the rest of the research stack.

Runs on These Engines

Overview

What You Can Do With It

Download and Chat With Open Models

Serve Models via an OpenAI-Compatible API

Chat With Documents (Offline RAG)

MCP Client for Agentic Workflows

Headless Mode and SDKs

Platforms, Pricing, and Requirements

Platform	Support
macOS	Intel and Apple Silicon (M-series)
Windows	x64 and ARM64
Linux	x64

Hardware requirements:

Apple Silicon (M1/M2/M3/M4): Recommended. Supports both llama.cpp (GGUF) and Apple MLX runtimes. MLX often provides faster inference on unified memory.
NVIDIA GPU: Works via CUDA (llama.cpp backend). Models above 7B parameters may need >8GB VRAM or offloading to RAM.
AMD GPU: Supported via ROCm on Linux, and experimental support on Windows.
CPU-only: Works fine for smaller models (1B–8B parameters) with acceptable speed for chat. Large models (20B+) will be slow.

Key Features and Capabilities

Desktop Chat Interface

OpenAI-Compatible Local Server

Chat With Documents

MCP Support

Additional Capabilities

Apple MLX runtime: Native performance on M-series Macs, often faster than llama.cpp for certain models.
lmstudio-hub: A platform for publishing and discovering models curated by the LM Studio team.
LM Link: Route local AI workloads across devices on your network (e.g., run inference on a powerful desktop while chatting from a laptop).
Manual model imports: Drop GGUF or MLX model files into the app’s folder if you want to bypass the browser.

Real-World Use Cases

Private, Offline AI

Run models on an air-gapped machine with no internet connection. No data ever leaves the device. This is the primary use case for legal, medical, or defense teams handling sensitive information.

Local Backend During Development

Non-Developers Evaluating Models

Running Agents Locally

CI and Server Deployments (Headless)

Use llmster in CI pipelines to test model inference, or deploy as a lightweight API server on a Linux VM without a desktop environment.

Getting Started With LM Studio

Download the installer from [lmstudio.ai](https://lmstudio.ai). Choose the version for your OS (macOS .dmg, Windows .exe, Linux .AppImage or .deb). No account required.
Install and launch. The first run presents a model browser. Search for a small model (e.g., Llama 3.2 1B or Phi-3 mini) to start quickly. Click download; the model will appear in your library once downloaded.
Start a chat. Select the model, click “Chat”, and begin typing. Adjust temperature or context length in the side panel.
Optionally, enable the local server. Go to the “Server” tab and toggle it on. Your model is now available at http://localhost:1234/v1.
Explore advanced features: Attach documents for RAG, connect MCP servers, or install the Python/JS SDKs (pip install lmstudio, npm install @lmstudio/sdk).

Documentation: Full docs at [lmstudio.ai/docs](https://lmstudio.ai/docs). Community support on Discord (link on the website).

How It Compares

LM Studio vs. Ollama

LM Studio vs. Jan

LM Studio vs. GPT4All

Strengths

A real graphical app, so no terminal or Python setup is required.
Free for both personal and commercial use.
Built-in model browser and chat make it easy to get started.
Local server exposes an OpenAI-compatible API when you are ready to build.

Trade-offs

The desktop app is closed source. Only the CLI and SDKs are open.
No voice or image generation.
Speed is bound by your own hardware, with no cloud offload.

Key Features

What the app gives you out of the box, in plain language.

macOS
Windows
Linux
Apple Silicon
NVIDIA GPU
AMD GPU
CPU-Only
Local Models
Cloud Models
MCP Support
Agent Mode
Voice In / Out
Image Generation
Chat With Docs
OpenAI-Compatible Server
One-Click Install

Desktop chat interface
Download and chat with open models through a clean app, with nothing to set up beyond installing.
OpenAI-compatible local server
Expose your local models through OpenAI-style endpoints over localhost or your network.
Chat with your documents
Attach files to a chat and ask questions about them, fully offline.

Where It Shines

The jobs this app is best suited for.

Private, offline AI
Run models with no data leaving the device, for sensitive or air-gapped work.
A local backend during development
Point an existing OpenAI client at LM Studio for free, key-less inference while you build.
Trying models without the terminal
Let anyone on the team compare open models in a chat window before writing code.

Pricing

Free

Free for personal and commercial use. Enterprise tier available.

Side-by-Side

Compare LM Studio With Another App

Add a second or third app and see stars, downloads, platforms, and capabilities lined up next to each other.

Open the Comparator

Related Apps

Close alternatives worth a look before you decide.

Unsloth Studio

A no-code local web app for fine-tuning and running open models on your own hardware.

No-code local fine-tuning and chat

uv pip install unsloth && unsloth studio

Stars

—

Downloads

—

OtherOpen SourcemacOSWindowsLinux

Odysseus

PewDiePie's open-source, self-hosted AI workspace that runs on your own hardware.

Self-hosting your whole AI workflow in one app

git clone github.com/pewdiepie-archdaemon/odysseus && docker compose up -d --build

Stars

—

Downloads

—

AGPL 3.0Open SourcemacOSWindowsLinux

Frequently Asked Questions

What Is a Desktop AI App?

Is LM Studio free to use?

LM Studio is offered under a Free model. Check the pricing section on this page and the app’s own site for the latest details, since tiers and limits change over time.

What platforms does LM Studio run on?

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

LM Studio

Runs on This Stack

Runs on These Engines

Overview

Overview

What You Can Do With It

Download and Chat With Open Models

Serve Models via an OpenAI-Compatible API

Chat With Documents (Offline RAG)

MCP Client for Agentic Workflows

Headless Mode and SDKs

Platforms, Pricing, and Requirements

Key Features and Capabilities

Desktop Chat Interface

OpenAI-Compatible Local Server

Chat With Documents

MCP Support

Additional Capabilities

Real-World Use Cases

Private, Offline AI

Local Backend During Development

Non-Developers Evaluating Models

Running Agents Locally

CI and Server Deployments (Headless)

Getting Started With LM Studio

How It Compares

LM Studio vs. Ollama

LM Studio vs. Jan

LM Studio vs. GPT4All

Strengths

Trade-offs

Key Features

Desktop chat interface

OpenAI-compatible local server

Chat with your documents

Where It Shines

Private, offline AI

A local backend during development

Trying models without the terminal

Pricing

Compare LM Studio With Another App

Related Apps

Unsloth Studio

Odysseus

Frequently Asked Questions

What Is a Desktop AI App?

Is LM Studio free to use?

What platforms does LM Studio run on?

The AI Build Report

LM Studio

Runs on This Stack

Runs on These Engines

Overview

Overview

What You Can Do With It

Download and Chat With Open Models

Serve Models via an OpenAI-Compatible API

Chat With Documents (Offline RAG)

MCP Client for Agentic Workflows

Headless Mode and SDKs

Platforms, Pricing, and Requirements

Key Features and Capabilities

Desktop Chat Interface

OpenAI-Compatible Local Server

Chat With Documents

MCP Support

Additional Capabilities

Real-World Use Cases

Private, Offline AI

Local Backend During Development

Non-Developers Evaluating Models

Running Agents Locally

CI and Server Deployments (Headless)

Getting Started With LM Studio

How It Compares

LM Studio vs. Ollama

LM Studio vs. Jan

LM Studio vs. GPT4All

Strengths

Trade-offs