Add comment about batch_size reduction for llama-server stability
All checks were successful
BotServer CI/CD / build (push) Successful in 2m9s

This commit is contained in:
Rodrigo Rodriguez (Pragmatismo) 2026-04-12 09:59:49 -03:00
parent d3673e1f34
commit 694fb91efe

View file

@ -43,7 +43,7 @@ impl Default for EmbeddingConfig {
embedding_model: "BAAI/bge-multilingual-gemma2".to_string(),
embedding_key: None,
dimensions: 2048,
batch_size: 2,
batch_size: 2, // Reduced from 16 to prevent llama-server crash
timeout_seconds: 60,
max_concurrent_requests: 1,
connect_timeout_seconds: 10,