Embedding (Vector embedding) — AI Week Radar glossary

An embedding model maps a chunk of input to a fixed-dimensional vector (typically 384, 768, 1536, or 3072 dimensions). The training objective is set up so that semantically similar inputs end up close in the space — "how to fix a leaky tap" and "plumbing repair guide" should be neighbours; "how to fix a leaky tap" and "Mongolian throat singing" should not.

Production stacks usually use a hosted embedding model (OpenAI text-embedding-3, Cohere embed-v3, Voyage AI) or an open one (BGE, E5, GTE) plus a vector database (Pinecone, Weaviate, Qdrant, pgvector, Cloudflare Vectorize). The choice of embedding model matters more than the choice of DB — the DB just stores and searches what the model produced.

Embedding

See also

RAG

Context window