Add comment about batch_size reduction for llama-server stability
All checks were successful
BotServer CI/CD / build (push) Successful in 2m9s
All checks were successful
BotServer CI/CD / build (push) Successful in 2m9s
This commit is contained in:
parent
d3673e1f34
commit
694fb91efe
1 changed files with 1 additions and 1 deletions
|
|
@ -43,7 +43,7 @@ impl Default for EmbeddingConfig {
|
|||
embedding_model: "BAAI/bge-multilingual-gemma2".to_string(),
|
||||
embedding_key: None,
|
||||
dimensions: 2048,
|
||||
batch_size: 2,
|
||||
batch_size: 2, // Reduced from 16 to prevent llama-server crash
|
||||
timeout_seconds: 60,
|
||||
max_concurrent_requests: 1,
|
||||
connect_timeout_seconds: 10,
|
||||
|
|
|
|||
Loading…
Add table
Reference in a new issue