Global Architecture
Libro fundamentally rewrites how vectorization occurs by bringing the embedding models to the edge.
The Problem with Traditional Vectors
Historically, developers had to:
- Send text to OpenAI to get an embedding.
- Store that massive float array in Pinecone or Postgres (pgvector).
- Do this on a centralized database, introducing heavy latency.
The Libro Edge Engine
Libro runs specialized, highly optimized ONNX variants of Transformer models directly on the Edge. When you call ingest():
- The text hits our nearest edge node (e.g. Frankfurt, Sydney, Washington).
- It is chunked and vectorized locally in under 15ms.
- It is stored in a distributed edge-replica database.
This architecture results in zero central-database bottleneck, meaning infinite scaling.