memories embed

memories embed [options]

Generate vector embeddings for memories that don't have them yet. These embeddings power semantic search (memories search --semantic).

How it works

The command:

Everything runs locally — no API calls, no data leaves your machine.

Option	Description
`--all`	Regenerate embeddings for all memories, not just missing ones
`--dry-run`	Show what would be embedded without doing it

Generate embeddings for memories without them:

memories embed

Output:

Embedding 42 memories...

Model: all-MiniLM-L6-v2 (384d, fast)
First run downloads the model. Subsequent runs are faster.

✓ Embedded 42 memories

Preview what would be embedded:

memories embed --dry-run

Regenerate all embeddings (after switching models):

memories embed --all

Memories added with memories add automatically get embeddings. You don't need to run memories embed for new memories.

Run memories embed when:

After importing memories — If you used memories import or memories ingest
After syncing from cloud — If memories were synced without embeddings
After switching models — Run memories embed --all to regenerate
First-time semantic search — If you have existing memories and want to use --semantic

The default model is all-MiniLM-L6-v2:

You can switch to higher-quality models for better semantic accuracy:

# View available models
memories config model

# Switch to a better model
memories config model gte-base
memories embed --all

See Embedding Models for detailed comparison and recommendations.

Embedding storage depends on the model:

If the model download fails, check your internet connection and try again:

memories embed

The model is cached at ~/.cache/memories/models/, so partial downloads will resume.

If semantic search isn't returning results:

If your database is getting large, embeddings are the main contributor. Consider: