← Back to the AI Box Compare Compact, Rack and Rack Pro

Which size for which organisation?

All boxes share the same stack, the same behaviour, the same sovereignty guarantees. What changes: capacity, supported models, price.

★ Most popular

LMbox Compact

For small firms of 5-15 people, no server room needed

Soundproofed desktop chassis, plugs into a standard wall outlet. Built for law firms, confidential SMBs, or branch offices without a dedicated server room.

Concurrent users up to 10

Avg throughput ≈ 22 tok/s

Time-to-first-token ≈ 800 ms

Power · noise 180 W · 25 dB

22 000 €

+ 6 000 € / yr

Request a quote

LMbox Rack

For 30-80 person firms or 50-150 person mid-market with a server room

Standard 2U datacenter chassis. Same CPU/RAM as LMbox Compact, with doubled storage (RAID10), redundant hot-swap PSU and 10 GbE - for IT teams with a server room.

Concurrent users up to 25

Avg throughput ≈ 22 tok/s

Time-to-first-token ≈ 800 ms

Power · noise 220 W · 55 dB

25 000 €

+ 6 000 € / yr

Request a quote

LMbox Rack Pro

For 100-300 person mid-market, multi-BU, heavy RAG - GPU-accelerated

2U rack chassis with doubled RAM, doubled storage, standard NVIDIA RTX A4000 GPU and 25 GbE networking. For mid-market multi-BU deployments with dozens of concurrent agents, massive RAG ingestion, and Gemma 4 MTP speedup.

Concurrent users up to 100

Avg throughput ≈ 80 tok/s

Time-to-first-token ≈ 320 ms

Power · noise 380 W · 58 dB

52 000 €

+ 12 000 € / yr

Request a quote

Full specifications

Attribute	LMbox Compact ★	LMbox Rack	LMbox Rack Pro
Hardware
Form factor	Desktop soundproofed · 21 × 39 × 46 cm (Fractal Define 7 Mini)	2U rack 19" · 8,7 × 43,7 × 64,8 cm (Supermicro CSE-825 class)	2U rack 19" + GPU NVIDIA · 8,7 × 43,7 × 64,8 cm
Processor	AMD Ryzen 9 7950X · 16 cores / 32 threads · Zen 4 · AVX-512	AMD Ryzen 9 7950X · 16 cores / 32 threads · Zen 4 · AVX-512	AMD Ryzen 9 7950X · 16 cores / 32 threads · Zen 4 · AVX-512
DDR5 ECC RAM	128 Go	128 Go	256 Go
Storage	2 To	4 To	8 To
AI accelerator	CPU inference (AVX-512 + AMX optimisations llama.cpp)	CPU inference (AVX-512 + AMX optimisations llama.cpp)	NVIDIA RTX A4000 · 16 GB VRAM · 140 W TDP · single-slot low-profile
Network	2 × 2,5 GbE · BMC IPMI/Redfish dédié	2 × 10 GbE + 2 × 1 GbE management · BMC IPMI/Redfish dédié	2 × 25 GbE + 2 × 1 GbE management · BMC IPMI/Redfish dédié
Physical
Dimensions (h × w × d)	21 × 39 × 46 cm (mid-tower compact micro-ATX soundproofed)	8,7 × 43,7 × 64,8 cm (2U rack 19", alim redondante)	8,7 × 43,7 × 64,8 cm (2U rack 19", alim redondante 2× 1000W)
Weight	12.0 kg	18.0 kg	22.0 kg
Power draw	180 W	220 W	380 W
Noise level	25 dB	55 dB	58 dB
Performance
Concurrent users	10 users	25 users	100 users
Avg throughput	≈ 22 tok/s	≈ 22 tok/s	≈ 80 tok/s
Time-to-first-token	≈ 800 ms	≈ 800 ms	≈ 320 ms
Models
Supported models	Mistral Small 4 MoE 119B/6B · Gemma 4 31B + MTP drafter · Qwen 3.6 27B · bge-m3 embedding · Whisper medium	Mistral Small 4 MoE 119B/6B · Gemma 4 31B + MTP drafter · Qwen 3.6 27B · bge-m3 embedding · Whisper medium	Mistral Small 4 MoE 119B/6B · Gemma 4 31B + MTP drafter (GPU-accelerated) · Qwen 3.6 27B · Codestral 22B · bge-m3 (GPU embeddings) · Whisper large-v3
Pricing
CAPEX (one-shot)	22 000 €	25 000 €	52 000 €
Annual support	6 000 € / an	6 000 € / an	12 000 € / an

Decision helper

How to choose?

LMbox S

Title

Body

LMbox M

Title

Body

LMbox L

Title

Body

LMbox XL

Title

Body

Talk to an expert