Figure AI

Figure 02 Humanoid Robot

Figure AI's second-generation humanoid robot featuring six RGB cameras, advanced dexterous hands, and OpenAI-powered speech interaction. Currently deployed in limited pilot at BMW's Spartanburg plant.

Humanoid RobotsAnnounced

EnterprisePremium / High-End

Buy on Manufacturer

Quick Specs

Height~170 cm

Vision6x RGB cameras

AI IntegrationOpenAI speech + custom VLMs

ManipulationDexterous hands with tactile sensing

ApplicationsManufacturing (BMW pilot)

LocomotionBipedal walking

Battery~5 hours operational

Specifications

Overview

The Figure 02 Humanoid Robot is Figure AI’s second-generation autonomous platform, designed specifically for high-dexterity labor and complex human-robot interaction. Unlike stationary industrial arms or specialized mobile bases, Figure 02 is a full-scale bipedal humanoid (~170 cm) built to operate in environments designed for humans. For AI engineers and researchers, this is not just a robotics platform; it is a mobile, embodied AI edge node.

Positioned in the premium enterprise tier, Figure 02 represents a significant leap from the Figure 01 prototype. It is designed to compete directly with high-end platforms like the Tesla Optimus (Gen 2) and the Boston Dynamics Atlas (Electric). While many humanoid projects remain in laboratory phases, Figure 02 is already seeing real-world deployment through a limited pilot at BMW’s Spartanburg manufacturing plant, where it is being tested for data collection and assembly line integration. For those evaluating the best humanoid robots for running AI models locally, Figure 02 stands out due to its tight integration with OpenAI’s speech-to-speech models and its sophisticated vision-language model (VLM) pipeline.

AI Performance & Specifications

The Figure 02 Humanoid Robot for AI applications is built around a centralized compute architecture that handles both low-level motor control and high-level cognitive tasks. While Figure AI has not publicly disclosed the exact silicon (e.g., NVIDIA Jetson Thor or custom ASICs), the platform’s performance is optimized for real-time inference of Vision-Language-Action (VLA) models.

Vision and Perception

The robot features a 6x RGB camera array providing a 360-degree spherical view of the environment. Unlike simple object detection, the onboard AI must process these high-resolution feeds simultaneously to maintain spatial awareness and perform depth estimation. This requires substantial memory bandwidth to avoid latency in the perception-action loop.

Onboard Compute and VRAM

For practitioners looking at Figure 02 Humanoid Robot VRAM for large language models, the system is designed to run multimodal models that integrate visual input with text-based reasoning.

AI Integration: The robot utilizes a custom VLM architecture and features a direct integration with OpenAI for natural language interaction.
Inference Capability: The onboard hardware is tuned for "fast-path" inference—prioritizing low-latency responses for speech and collision avoidance.
Battery and Power: The system features a custom battery pack providing ~5 hours of operational time, which is a significant metric for field-testing local AI agents without a tether.

While enterprise humanoid specs are often proprietary, the Figure 02 is engineered to handle the throughput required for real-time tactile sensing and 10-degree-of-freedom (DOF) hand manipulation. This necessitates a compute stack capable of processing high-frequency sensor data alongside high-level LLM reasoning.

What Models Can It Run?

The core value proposition of the Figure 02 Humanoid Robot for AI development is its ability to execute embodied AI tasks. This involves running a stack of models ranging from small, high-speed controllers to large, reasoning-heavy models.

Local LLM and VLM Execution

The Figure 02 is designed to run a combination of local and cloud-based models. For mission-critical autonomy, practitioners can expect the hardware to support:

Vision-Language Models (VLMs): Models like Llama 3.2-Vision or PaliGemma are essential for the robot to "see" and describe its environment. The onboard compute is optimized to process these visual tokens in real-time.
Local LLMs: For offline reasoning and privacy-sensitive environments, the hardware is capable of running quantized versions of Llama 3.1 8B or Mistral 7B v0.3. Using 4-bit or 8-bit quantization (GGUF/EXL2), these models can provide high tokens per second for immediate verbal feedback.
Speech Models: The OpenAI integration enables sophisticated speech-to-speech capabilities, allowing the robot to maintain context and nuance in human-robot collaboration.

Performance Estimates

While exact Figure 02 Humanoid Robot tokens per second benchmarks depend on the specific quantization used, the hardware is designed to ensure that the "thinking" time does not exceed the "acting" time. For a 7B parameter model, practitioners can expect inference speeds that exceed human speaking rates (approx. 15-20 tokens/sec), ensuring seamless interaction. For larger logic tasks, the system can offload to more powerful remote servers, though the trend in Figure AI humanoid robots for AI development is toward increasing the "on-edge" autonomy.

Use Cases & Target Audience

The Figure 02 is not a consumer toy; it is a high-end development and industrial tool.

Manufacturing and Enterprise Logistics

As demonstrated in the BMW pilot, the primary use case is the automation of complex, non-repetitive tasks in manufacturing. AI engineers can use the platform to train models on tactile feedback and precision manipulation, such as inserting parts or managing logistics in a warehouse.

AI Researchers and Agentic Workflow Developers

For teams building the best hardware for local AI agents 2025, Figure 02 provides the ultimate "body" for an agent. It allows researchers to move beyond digital-only agents into physical agency. Developers can test how an LLM-based agent navigates physical constraints, handles unexpected obstacles, and communicates with human coworkers.

Edge Deployment Scenarios

Because the Figure 02 can run models locally, it is suitable for environments with restricted connectivity (e.g., secure research labs or remote industrial sites). This makes it a primary candidate for projects requiring hardware for running 70B parameter models (via off-board compute) or mid-sized models (on-board) without data leaving the facility.

How It Compares

When evaluating the Figure 02, practitioners typically compare it against other high-end humanoid platforms.

Figure 02 vs. Tesla Optimus (Gen 2)

Manipulation: Figure 02 features highly advanced dexterous hands with tactile sensing that rivals the Optimus Gen 2. Figure's focus has been heavily on the "hand-eye" coordination required for manufacturing.
AI Ecosystem: Figure’s partnership with OpenAI gives it a distinct advantage in natural language processing and intuitive human interaction compared to Tesla’s more vertically integrated, proprietary FSD-based approach.
Availability: Both are currently in limited deployment/pilot phases, but Figure 02 has been more transparent regarding its specific testing within the automotive sector (BMW).

Figure 02 vs. Unitree H1 / G1

Tier: The Unitree G1 is positioned as a more "affordable" development platform (starting around $16k), whereas the Figure 02 is a premium enterprise-grade machine.
Compute: Figure 02 offers significantly more robust onboard compute and sensor density (6 cameras vs. Unitree’s more limited sensor suite), making it the superior choice for Figure 02 Humanoid Robot local LLM applications and complex VLM tasks.

For organizations requiring the best AI chip for local deployment within a humanoid form factor, Figure 02 offers a more integrated and "ready-to-work" solution than lower-cost alternatives, provided the budget allows for enterprise-level investment.

Compatible AI Models

Specs not available for scoring. This product is missing VRAM or memory bandwidth data.

Figure 02 Humanoid Robot

Humanoid RobotsAnnounced

EnterprisePremium / High-End

Buy on Manufacturer

Quick Specs

Height~170 cm

Vision6x RGB cameras

AI IntegrationOpenAI speech + custom VLMs

ManipulationDexterous hands with tactile sensing

ApplicationsManufacturing (BMW pilot)

LocomotionBipedal walking

Battery~5 hours operational

Specifications

Overview

AI Performance & Specifications

Vision and Perception

Onboard Compute and VRAM

For practitioners looking at Figure 02 Humanoid Robot VRAM for large language models, the system is designed to run multimodal models that integrate visual input with text-based reasoning.

AI Integration: The robot utilizes a custom VLM architecture and features a direct integration with OpenAI for natural language interaction.
Inference Capability: The onboard hardware is tuned for "fast-path" inference—prioritizing low-latency responses for speech and collision avoidance.
Battery and Power: The system features a custom battery pack providing ~5 hours of operational time, which is a significant metric for field-testing local AI agents without a tether.

What Models Can It Run?

Local LLM and VLM Execution

The Figure 02 is designed to run a combination of local and cloud-based models. For mission-critical autonomy, practitioners can expect the hardware to support:

Vision-Language Models (VLMs): Models like Llama 3.2-Vision or PaliGemma are essential for the robot to "see" and describe its environment. The onboard compute is optimized to process these visual tokens in real-time.
Local LLMs: For offline reasoning and privacy-sensitive environments, the hardware is capable of running quantized versions of Llama 3.1 8B or Mistral 7B v0.3. Using 4-bit or 8-bit quantization (GGUF/EXL2), these models can provide high tokens per second for immediate verbal feedback.
Speech Models: The OpenAI integration enables sophisticated speech-to-speech capabilities, allowing the robot to maintain context and nuance in human-robot collaboration.

Performance Estimates

Use Cases & Target Audience

The Figure 02 is not a consumer toy; it is a high-end development and industrial tool.

Manufacturing and Enterprise Logistics

AI Researchers and Agentic Workflow Developers

Edge Deployment Scenarios

How It Compares

When evaluating the Figure 02, practitioners typically compare it against other high-end humanoid platforms.

Figure 02 vs. Tesla Optimus (Gen 2)

Manipulation: Figure 02 features highly advanced dexterous hands with tactile sensing that rivals the Optimus Gen 2. Figure's focus has been heavily on the "hand-eye" coordination required for manufacturing.
AI Ecosystem: Figure’s partnership with OpenAI gives it a distinct advantage in natural language processing and intuitive human interaction compared to Tesla’s more vertically integrated, proprietary FSD-based approach.
Availability: Both are currently in limited deployment/pilot phases, but Figure 02 has been more transparent regarding its specific testing within the automotive sector (BMW).

Figure 02 vs. Unitree H1 / G1

Tier: The Unitree G1 is positioned as a more "affordable" development platform (starting around $16k), whereas the Figure 02 is a premium enterprise-grade machine.
Compute: Figure 02 offers significantly more robust onboard compute and sensor density (6 cameras vs. Unitree’s more limited sensor suite), making it the superior choice for Figure 02 Humanoid Robot local LLM applications and complex VLM tasks.

Compatible AI Models

Specs not available for scoring. This product is missing VRAM or memory bandwidth data.

Figure 02 Humanoid Robot

Quick Specs

Specifications

Overview

AI Performance & Specifications

Vision and Perception

Onboard Compute and VRAM

What Models Can It Run?

Local LLM and VLM Execution

Performance Estimates

Use Cases & Target Audience

Manufacturing and Enterprise Logistics

AI Researchers and Agentic Workflow Developers

Edge Deployment Scenarios

How It Compares

Figure 02 vs. Tesla Optimus (Gen 2)

Figure 02 vs. Unitree H1 / G1

Compatible AI Models

Compatible AI Models

Similar Products

Unitree H1-2

Figure 03

Apptronik Apollo

Boston Dynamics Atlas (Electric)

Figure 02 Humanoid Robot

Quick Specs

Specifications

Overview

AI Performance & Specifications

Vision and Perception

Onboard Compute and VRAM

What Models Can It Run?

Local LLM and VLM Execution

Performance Estimates

Use Cases & Target Audience

Manufacturing and Enterprise Logistics

AI Researchers and Agentic Workflow Developers

Edge Deployment Scenarios

How It Compares

Figure 02 vs. Tesla Optimus (Gen 2)

Figure 02 vs. Unitree H1 / G1

Compatible AI Models

Compatible AI Models

Similar Products

Unitree H1-2

Figure 03

Apptronik Apollo

Boston Dynamics Atlas (Electric)