SLM360

The First AI That Thinks, Learns, and Remembers On-Device

// overview

SLM360 is a complete, privacy-first AI system comprising two purpose-built foundation models (SLM360 Nano, a 6.4M-parameter encoder, and SLM360 Base, a 125M-parameter decoder), a hybrid NLU pipeline, and an on-device continual learning framework. All implemented in pure Rust with zero external ML dependencies.

// specs

Specifications

50MB

Footprint

39ms

Latency

100%

Banking77

98%

SNIPS

<100ms

Reasoning

~64KB

Memory/User

// architecture

Architecture

 1  Tier 1: Pattern Matching (<1ms) - Regex, exact match, contains, fuzzy
 2  Tier 2: MicroTransformer, 85K params (2-5ms) - BPE tokenizer, 1-layer transformer
 3  Tier 3: SLM360 Nano, 6.4M params (<5ms) - Full encoder with GQA + SwiGLU
 4  Tier 4: SLM360 Base, 125M params (<50ms/tok) - Causal decoder for generation

// features

Features

01Hybrid classification: pattern matching (<1ms) + semantic embeddings (39ms) with confidence arbitration
02Multi-step reasoning engine with conditional execution, sequences, and automatic rollback in <100ms
035-tier SmartMemory: short-term, episodic, semantic, procedural, and meta-learning
04Predictive context engine: anticipates user needs from topic transitions, time patterns, and entity preferences
05Cross-platform deployment: Linux, macOS, Windows, WebAssembly, with iOS and Android planned
06100% on-device processing. Privacy-first by architecture, not policy
0750MB total footprint: ONNX model (32MB), pattern engine (8MB), reasoning (4MB), SmartMemory (2MB), cache (3MB), runtime (1MB)
08196 tests passing with comprehensive coverage across NLU, reasoning, memory, WASM, and async

// benchmarks

Benchmarks

Dataset	Score	Comparison
Banking77	100%	BERT-base: 93.1%
SNIPS	98%	BERT-base: 98.0%
Forgetting Rate (with EWC)	<2%	Without EWC: 23%
Correction Success Rate	87%	-
Energy per Query	0.001 Wh	Cloud LLM: 0.42-29 Wh