Fix typos in bot file extensions and keyword names

Changed incorrect references to .vbs files to .bas and corrected USE_WEBSITE keyword naming. Also added missing fields to API response structure and clarified that start.bas is optional for bots.
2025-11-26 22:54:22 -03:00 · 2025-11-26 22:54:22 -03:00 · 3add3ccbfa
commit 3add3ccbfa
parent 066a30b003
64 changed files with 6465 additions and 822 deletions
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@ -2091,7 +2091,7 @@
 ### Bug Fixes
-* **basic:** Fix default bot.vbs missing parenthesis in code. ([8501002](https://github.com/pragmatismo-io/BotServer/commit/8501002))
+* **basic:** Fix default bot.bas missing parenthesis in code. ([8501002](https://github.com/pragmatismo-io/BotServer/commit/8501002))
 ## [1.7.1](https://github.com/pragmatismo-io/BotServer/compare/1.7.0...1.7.1) (2019-08-30)
--- a/docs/src/SUMMARY.md
+++ b/docs/src/SUMMARY.md
@ -62,7 +62,7 @@
    - [SET BOT MEMORY](./chapter-06-gbdialog/keyword-set-bot-memory.md)
    - [USE KB](./chapter-06-gbdialog/keyword-use-kb.md)
    - [CLEAR KB](./chapter-06-gbdialog/keyword-clear-kb.md)
-    - [ADD WEBSITE](./chapter-06-gbdialog/keyword-add-website.md)
+    - [USE WEBSITE](./chapter-06-gbdialog/keyword-use-website.md)
    - [USE TOOL](./chapter-06-gbdialog/keyword-use-tool.md)
    - [CLEAR TOOLS](./chapter-06-gbdialog/keyword-clear-tools.md)
    - [GET](./chapter-06-gbdialog/keyword-get.md)
--- a/docs/src/chapter-02/README.md
+++ b/docs/src/chapter-02/README.md
@ -29,7 +29,7 @@ Drop the folder in `templates/`, it loads automatically.
 ### Dialogs (.gbdialog)
 - BASIC scripts that control conversation
- Must have `start.bas` as entry point
+- `start.bas` is optional (but needed to activate tools/KB with USE TOOL/USE KB)
 - Simple commands like TALK and HEAR
 ### Knowledge Base (.gbkb)
@ -59,6 +59,8 @@ Drop the folder in `templates/`, it loads automatically.
 No build process. No compilation. Just folders and files.
 The web UI uses **vanilla JavaScript and Alpine.js** - no webpack, no npm build, just edit and refresh.
 ## Topics Covered
 - [.gbai Architecture](./gbai.md) - Package details
--- a/docs/src/chapter-02/gbai.md
+++ b/docs/src/chapter-02/gbai.md
@ -40,7 +40,7 @@ BASIC scripts that control conversation flow:
 ```
 my-bot.gbdialog/
-  start.bas           # Runs when session starts
+  start.bas           # Optional - needed to activate tools/KB
  auth.bas            # Login flow
  tools/              # Callable functions
    book-meeting.bas
@ -49,13 +49,16 @@ my-bot.gbdialog/
    on-email.bas
 ```
-Example `start.bas`:
+Example `start.bas` (optional, but required for tools/KB):
 ```basic
-TALK "Hi! I'm your assistant."
+USE KB "policies"
-answer = HEAR
+USE TOOL "book-meeting"
-TALK "I can help you with: " + answer
+USE TOOL "check-status"
 TALK "Hi! I'm your assistant with tools and knowledge ready."
 ```
 Note: If you don't need tools or knowledge bases, `start.bas` is optional. The LLM will handle basic conversations without it.
 ### .gbkb/ - Your Bot's Knowledge
 Documents organized by topic:
@ -109,7 +112,7 @@ Here's a complete customer support bot:
 ```
 support.gbai/
  support.gbdialog/
-    start.bas
+    start.bas         # Optional, but needed for tools/KB
    tools/
      create-ticket.bas
      check-status.bas
@ -122,7 +125,7 @@ support.gbai/
    config.csv
 ```
-`start.bas`:
+`start.bas` (activates tools and knowledge bases):
 ```basic
 USE KB "faqs"
 USE KB "guides"
@ -212,7 +215,7 @@ templates/
 ### Required
 - Folder must end with `.gbai`
 - Subfolders must match: `botname.gbdialog`, `botname.gbkb`, etc.
- Main script must be `start.bas`
+- `start.bas` is optional, but required if you want to use tools or knowledge bases (must USE TOOL and USE KB to activate them)
 ### Recommended
 - Use lowercase with hyphens: `customer-service.gbai`
@ -229,20 +232,14 @@ When BotServer starts:
 Takes about 5-10 seconds per bot.
-## Hot Reload
+## UI Architecture
-Change files while running:
+The web interface uses **vanilla JavaScript and Alpine.js** - no build process required:
-
+- Pure HTML/CSS/JS files
-```bash
+- Alpine.js for reactivity
-# Edit script
+- No webpack, no npm build
-vim templates/my-bot.gbai/my-bot.gbdialog/start.bas
+- Edit and refresh to see changes
-
+- Zero compilation time
 # Reload just that bot
 curl http://localhost:8080/api/admin/reload/my-bot
 # Or restart everything
 ./botserver restart
 ```
 ## Package Size Limits
@ -258,7 +255,7 @@ Default limits (configurable):
 **Bot not appearing?**
 - Check folder ends with `.gbai`
 - Verify subfolders match bot name
- Look for `start.bas` in `.gbdialog/`
+- If using tools/KB, ensure `start.bas` exists with USE TOOL/USE KB commands
 **Documents not searchable?**
 - Ensure files are in `.gbkb/` subfolder
--- a/docs/src/chapter-02/gbdialog.md
+++ b/docs/src/chapter-02/gbdialog.md
@ -73,15 +73,27 @@ TALK "What product information can I help you with?"
 ## Script Structure
-### Entry Point: start.bas
+### Entry Point: start.bas (Optional)
-Every bot needs a `start.bas` file in the [`.gbdialog`](../chapter-02/gbdialog.md) folder:
+The `start.bas` file in the [`.gbdialog`](../chapter-02/gbdialog.md) folder is **optional**, but required if you want to activate tools or knowledge bases:
 ```basic
-' Minimal start script - let system AI handle conversations
+' Optional start script - needed only to activate tools/KB
 USE KB "company_docs"
 USE TOOL "book-meeting"
 USE TOOL "check-status"
 TALK "Welcome! How can I assist you today?"
 ```
 **When you need start.bas:**
 - To activate knowledge bases with `USE KB`
 - To activate tools with `USE TOOL`
 - To set initial context or configuration
 **When you don't need start.bas:**
 - For simple conversational bots
 - When the LLM can handle everything without tools/KB
 - For basic Q&A without document search
 ### Tool Definitions
 Create separate `.bas` files for each tool. See [KB and Tools](../chapter-03/kb-and-tools.md) for more information:
--- a/docs/src/chapter-02/gbkb.md
+++ b/docs/src/chapter-02/gbkb.md
@ -46,7 +46,7 @@ Each document is processed into vector embeddings using:
 ### Creating Collections
 ```basic
 USE KB "company-policies"
-ADD WEBSITE "https://company.com/docs"
+USE WEBSITE "https://company.com/docs"
 ```
 ### Using Collections
--- a/docs/src/chapter-02/templates.md
+++ b/docs/src/chapter-02/templates.md
@ -154,7 +154,7 @@ Add `.bas` files to `.gbdialog`:
 ### Development Tools
 #### api-client.gbai
- **Files**: `climate.vbs`, `msft-partner-center.bas`
+- **Files**: `climate.bas`, `msft-partner-center.bas`
 - **Examples**: REST API patterns
 - **Integration**: External services
@ -246,4 +246,4 @@ When migrating from traditional platforms:
 - README files in each template folder
 - Example configurations included
 - Sample knowledge bases provided
- Community forums for discussions
+- Community forums for discussions
--- a/docs/src/chapter-03/indexing.md
+++ b/docs/src/chapter-03/indexing.md
@ -8,7 +8,7 @@ The system automatically indexes documents when:
 - Files are added to any `.gbkb` folder
 - `USE KB` is called for a collection
 - Files are modified or updated
- `ADD WEBSITE` crawls new content
+- `USE WEBSITE` registers websites for crawling (preprocessing) and associates them with sessions (runtime)
 ## How Indexing Works
@ -35,8 +35,9 @@ To keep web content fresh, schedule regular crawls:
 ' In update-docs.bas
 SET SCHEDULE "0 2 * * *"  ' Run daily at 2 AM
-ADD WEBSITE "https://docs.example.com"
+USE WEBSITE "https://docs.example.com"
-' Website is crawled and indexed automatically
+' Website is registered for crawling during preprocessing
 ' At runtime, it associates the crawled content with the session
 ```
 ### Scheduling Options
@ -106,11 +107,10 @@ Schedule regular web updates:
 ' In maintenance.bas
 SET SCHEDULE "0 1 * * *"
-' Update news daily
+' Register websites for crawling
-ADD WEBSITE "https://company.com/news"
+USE WEBSITE "https://company.com/news"
-
+USE WEBSITE "https://company.com/products"
-' Update product docs on schedule
+' Websites are crawled by background service
 ADD WEBSITE "https://company.com/products"
 ```
 ## Best Practices
--- a/docs/src/chapter-03/summary.md
+++ b/docs/src/chapter-03/summary.md
@ -4,7 +4,7 @@ This chapter explains how GeneralBots manages knowledge‑base collections, inde
 | Document | File | Description |
 |----------|------|-------------|
-| **README** | [README.md](README.md) | High‑level reference for the `.gbkb` package and its core commands (`USE KB`, `CLEAR KB`, `ADD WEBSITE`). |
+| **README** | [README.md](README.md) | High‑level reference for the `.gbkb` package and its core commands (`USE KB`, `CLEAR KB`, `USE WEBSITE`). |
 | **Caching** | [caching.md](caching.md) | Optional in‑memory and persistent SQLite caching to speed up frequent `FIND` queries. |
 | **Context Compaction** | [context-compaction.md](context-compaction.md) | Techniques to keep the LLM context window within limits (summarization, memory pruning, sliding window). |
 | **Indexing** | [indexing.md](indexing.md) | Process of extracting, chunking, embedding, and storing document vectors in the VectorDB. |
--- a/docs/src/chapter-03/vector-collections.md
+++ b/docs/src/chapter-03/vector-collections.md
@ -111,8 +111,8 @@ To keep web content updated, schedule regular crawls:
 ```basic
 ' In update-content.bas
 SET SCHEDULE "0 3 * * *"  ' Run daily at 3 AM
-ADD WEBSITE "https://example.com/docs"
+USE WEBSITE "https://example.com/docs"
-' Website content is crawled and added to the collection
+' Website is registered for crawling and will be available in conversations
 ```
 ## How Search Works
--- a/docs/src/chapter-06-gbdialog/keyword-add-website.md
+++ b/docs/src/chapter-06-gbdialog/keyword-add-website.md
@ -1,23 +0,0 @@
 # ADD WEBSITE Keyword
 **Syntax**
 ```
 ADD WEBSITE "https://example.com"
 ```
 **Parameters**
 - `"url"` – A valid HTTP or HTTPS URL pointing to a website that should be added to the conversation context.
 **Description**
 `ADD WEBSITE` validates the provided URL and, when the `web_automation` feature is enabled, launches a headless browser to crawl the site, extract its textual content, and index it into a vector‑DB collection associated with the current user. The collection name is derived from the URL and the bot's identifiers. After indexing, the website becomes a knowledge source that can be queried by `FIND` or `LLM` calls.
 If the feature is not compiled, the keyword returns an error indicating that web automation is unavailable.
 **Example**
 ```basic
 ADD WEBSITE "https://en.wikipedia.org/wiki/General_Bots"
 TALK "Website added. You can now search its content with FIND."
--- a/docs/src/chapter-06-gbdialog/keyword-clear-kb.md
+++ b/docs/src/chapter-06-gbdialog/keyword-clear-kb.md
@ -191,7 +191,7 @@ NEXT
 ## Related Keywords
 - [USE KB](./keyword-use-kb.md) - Load knowledge bases
- [ADD WEBSITE](./keyword-add-website.md) - Create KB from website
+- [USE WEBSITE](./keyword-use-website.md) - Associate website with conversation
 - [FIND](./keyword-find.md) - Search within loaded KBs
 - [LLM](./keyword-llm.md) - Use KB context in responses
--- a/docs/src/chapter-06-gbdialog/keyword-use-kb.md
+++ b/docs/src/chapter-06-gbdialog/keyword-use-kb.md
@ -163,7 +163,7 @@ END
 ```basic
 ' Refresh collection with new documents
 CLEAR KB "news"
-ADD WEBSITE "https://example.com/news"
+USE WEBSITE "https://example.com/news"
 USE KB "news"
 ```
@ -194,9 +194,9 @@ USE TOOL "check_stock"
 ' Tool can access inventory knowledge when executing
 ```
-### With ADD WEBSITE
+### With USE WEBSITE
 ```basic
-ADD WEBSITE "https://docs.example.com" TO "documentation"
+USE WEBSITE "https://docs.example.com"
 USE KB "documentation"
 ' Fresh web content now searchable
 ```
@ -266,6 +266,6 @@ Solution: Wait for indexing to complete (automatic)
 ## See Also
 - [CLEAR KB](./keyword-clear-kb.md) - Deactivate knowledge bases
- [ADD WEBSITE](./keyword-add-website.md) - Add web content to KB
+- [USE WEBSITE](./keyword-use-website.md) - Associate website with conversation
 - [Knowledge Base Guide](../chapter-03/README.md) - Complete KB documentation
 - [Vector Collections](../chapter-03/vector-collections.md) - How collections work
--- a/docs/src/chapter-06-gbdialog/keyword-use-website.md
+++ b/docs/src/chapter-06-gbdialog/keyword-use-website.md
@ -0,0 +1,65 @@
 # USE WEBSITE Keyword
 **Syntax**
 ```
 USE WEBSITE "https://example.com"
 ```
 **Parameters**
 - `"url"` – A valid HTTP or HTTPS URL pointing to a website that should be made available in the conversation context.
 **Description**
 `USE WEBSITE` operates in two distinct modes:
 1. **Preprocessing Mode** (Script Compilation): When found in a BASIC script during compilation, it registers the website for background crawling. The crawler service will fetch, extract, and index the website's content into a vector database collection. This ensures the website content is ready before any conversation starts.
 2. **Runtime Mode** (Conversation Execution): During a conversation, `USE WEBSITE` associates an already-crawled website collection with the current session, making it available for queries via `FIND` or `LLM` calls. This behaves similarly to `USE KB` - it's a session-scoped association.
 If a website hasn't been registered during preprocessing, the runtime execution will fail with an appropriate error message.
 **Example**
 ```basic
 ' In script preprocessing, this registers the website for crawling
 USE WEBSITE "https://docs.example.com"
 ' During conversation, this makes the crawled content available
 USE WEBSITE "https://docs.example.com"
 FIND "deployment procedures"
 TALK "I found information about deployment procedures in the documentation."
 ```
 **Preprocessing Behavior**
 When the script is compiled:
 - The URL is validated
 - The website is registered in the `website_crawls` table
 - The crawler service picks it up and indexes the content
 - Status can be: pending (0), crawled (1), or failed (2)
 **Runtime Behavior**
 When executed in a conversation:
 - Checks if the website has been crawled
 - Associates the website collection with the current session
 - Makes the content searchable via `FIND` and available to `LLM`
 **With LLM Integration**
 ```basic
 USE WEBSITE "https://company.com/policies"
 question = HEAR "What would you like to know about our policies?"
 FIND question
 answer = LLM "Based on the search results, provide a clear answer"
 TALK answer
 ```
 **Related Keywords**
 - [CLEAR WEBSITES](./keyword-clear-websites.md) - Remove all website associations from session
 - [USE KB](./keyword-use-kb.md) - Similar functionality for knowledge base files
 - [FIND](./keyword-find.md) - Search within loaded websites and KBs
 - [LLM](./keyword-llm.md) - Process search results with AI
--- a/docs/src/chapter-06-gbdialog/keywords.md
+++ b/docs/src/chapter-06-gbdialog/keywords.md
@ -33,7 +33,7 @@ The source code for each keyword lives in `src/basic/keywords/`. Only the keywor
 - [USE KB](./keyword-use-kb.md) - Load knowledge base
 - [CLEAR KB](./keyword-clear-kb.md) - Unload knowledge base
- [ADD WEBSITE](./keyword-add-website.md) - Index website to KB
+- [USE WEBSITE](./keyword-use-website.md) - Associate website with conversation
 - [FIND](./keyword-find.md) - Search in KB
 ## Tools & Automation
--- a/docs/src/chapter-11-features/README.md
+++ b/docs/src/chapter-11-features/README.md
@ -5,13 +5,13 @@ This table maps major features of GeneralBots to the chapters and keywords that
 |---------|------------|------------------|
 | Start server & basic chat | 01 (Run and Talk) | `TALK`, `HEAR` |
 | Package system overview | 02 (About Packages) | – |
-| Knowledge‑base management | 03 (gbkb Reference) | `USE KB`, `SET KB`, `ADD WEBSITE` |
+| Knowledge‑base management | 03 (gbkb Reference) | `USE KB`, `SET KB`, `USE WEBSITE` |
 | UI theming | 04 (gbtheme Reference) | – (CSS/HTML assets) |
-| BASIC dialog scripting | 05 (gbdialog Reference) | All BASIC keywords (`TALK`, `HEAR`, `LLM`, `FORMAT`, `USE KB`, `SET KB`, `ADD WEBSITE`, …) |
+| BASIC dialog scripting | 05 (gbdialog Reference) | All BASIC keywords (`TALK`, `HEAR`, `LLM`, `FORMAT`, `USE KB`, `SET KB`, `USE WEBSITE`, …) |
 | Custom Rust extensions | 06 (gbapp Reference) | `USE TOOL`, custom Rust code |
 | Bot configuration | 07 (gbot Reference) | `config.csv` fields |
 | Built‑in tooling | 08 (Tooling) | All keywords listed in the table |
-| Semantic search & Qdrant | 03 (gbkb Reference) | `ADD WEBSITE`, vector search |
+| Semantic search & Qdrant | 03 (gbkb Reference) | `USE WEBSITE`, vector search |
 | Email & external APIs | 08 (Tooling) | `CALL`, `CALL_ASYNC` |
 | Scheduling & events | 08 (Tooling) | `SET SCHEDULE`, `ON` |
 | Testing & CI | 10 (Contributing) | – |
--- a/docs/src/chapter-14-migration/overview.md
+++ b/docs/src/chapter-14-migration/overview.md
@ -46,11 +46,13 @@ Transform static documents into searchable knowledge:
 ```basic
 ' Convert SharePoint documents to searchable KB
 USE KB "company_docs"
-ADD WEBSITE "https://sharepoint.company.com/docs"
+USE WEBSITE "https://sharepoint.company.com/docs"
 ' Now accessible via natural language
-answer = HEAR "What's our vacation policy?"
+question = HEAR "What would you like to know?"
-' System automatically searches KB and responds
+FIND question
 answer = LLM "Based on the search results, provide a helpful answer"
 TALK answer
 ```
 ## Migration Phases
--- a/migrations/6.0.9_website_support/down.sql
+++ b/migrations/6.0.9_website_support/down.sql
@ -0,0 +1,7 @@
 -- Drop session_website_associations table and related indexes
 DROP TABLE IF EXISTS session_website_associations;
 -- Drop website_crawls table and related objects
 DROP TRIGGER IF EXISTS website_crawls_updated_at_trigger ON website_crawls;
 DROP FUNCTION IF EXISTS update_website_crawls_updated_at();
 DROP TABLE IF EXISTS website_crawls;
--- a/migrations/6.0.9_website_support/up.sql
+++ b/migrations/6.0.9_website_support/up.sql
@ -0,0 +1,86 @@
 -- Create website_crawls table for tracking crawled websites
 CREATE TABLE IF NOT EXISTS website_crawls (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    bot_id UUID NOT NULL,
    url TEXT NOT NULL,
    last_crawled TIMESTAMPTZ,
    next_crawl TIMESTAMPTZ,
    expires_policy VARCHAR(20) NOT NULL DEFAULT '1d',
    max_depth INTEGER DEFAULT 3,
    max_pages INTEGER DEFAULT 100,
    crawl_status SMALLINT DEFAULT 0, -- 0=pending, 1=success, 2=processing, 3=error
    pages_crawled INTEGER DEFAULT 0,
    error_message TEXT,
    created_at TIMESTAMPTZ DEFAULT NOW(),
    updated_at TIMESTAMPTZ DEFAULT NOW(),
    -- Ensure unique URL per bot
    CONSTRAINT unique_bot_url UNIQUE (bot_id, url),
    -- Foreign key to bots table
    CONSTRAINT fk_website_crawls_bot
        FOREIGN KEY (bot_id)
        REFERENCES bots(id)
        ON DELETE CASCADE
 );
 -- Create indexes for efficient queries
 CREATE INDEX IF NOT EXISTS idx_website_crawls_bot_id ON website_crawls(bot_id);
 CREATE INDEX IF NOT EXISTS idx_website_crawls_next_crawl ON website_crawls(next_crawl);
 CREATE INDEX IF NOT EXISTS idx_website_crawls_url ON website_crawls(url);
 CREATE INDEX IF NOT EXISTS idx_website_crawls_status ON website_crawls(crawl_status);
 -- Create trigger to update updated_at timestamp
 CREATE OR REPLACE FUNCTION update_website_crawls_updated_at()
 RETURNS TRIGGER AS $$
 BEGIN
    NEW.updated_at = NOW();
    RETURN NEW;
 END;
 $$ LANGUAGE plpgsql;
 CREATE TRIGGER website_crawls_updated_at_trigger
    BEFORE UPDATE ON website_crawls
    FOR EACH ROW
    EXECUTE FUNCTION update_website_crawls_updated_at();
 -- Create session_website_associations table for tracking websites added to sessions
 -- Similar to session_kb_associations but for websites
 CREATE TABLE IF NOT EXISTS session_website_associations (
    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
    session_id UUID NOT NULL,
    bot_id UUID NOT NULL,
    website_url TEXT NOT NULL,
    collection_name TEXT NOT NULL,
    is_active BOOLEAN DEFAULT true,
    added_at TIMESTAMPTZ DEFAULT NOW(),
    added_by_tool VARCHAR(255),
    -- Ensure unique website per session
    CONSTRAINT unique_session_website UNIQUE (session_id, website_url),
    -- Foreign key to sessions table
    CONSTRAINT fk_session_website_session
        FOREIGN KEY (session_id)
        REFERENCES sessions(id)
        ON DELETE CASCADE,
    -- Foreign key to bots table
    CONSTRAINT fk_session_website_bot
        FOREIGN KEY (bot_id)
        REFERENCES bots(id)
        ON DELETE CASCADE
 );
 -- Create indexes for efficient queries
 CREATE INDEX IF NOT EXISTS idx_session_website_associations_session_id
    ON session_website_associations(session_id) WHERE is_active = true;
 CREATE INDEX IF NOT EXISTS idx_session_website_associations_bot_id
    ON session_website_associations(bot_id);
 CREATE INDEX IF NOT EXISTS idx_session_website_associations_url
    ON session_website_associations(website_url);
 CREATE INDEX IF NOT EXISTS idx_session_website_associations_collection
    ON session_website_associations(collection_name);
--- a/src/api_router.rs
+++ b/src/api_router.rs
@ -455,76 +455,293 @@ async fn handle_storage_save(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(serde_json::json!({"success": true})))
+    let key = payload["key"].as_str().ok_or(StatusCode::BAD_REQUEST)?;
    let content = payload["content"].as_str().ok_or(StatusCode::BAD_REQUEST)?;
    let bucket = payload["bucket"].as_str().unwrap_or("default");
    // Use the drive module for S3/MinIO operations
    match crate::drive::files::save_to_s3(&state, bucket, key, content.as_bytes()).await {
        Ok(_) => Ok(Json(serde_json::json!({
            "success": true,
            "key": key,
            "bucket": bucket,
            "size": content.len()
        }))),
        Err(e) => {
            log::error!("Storage save failed: {}", e);
            Err(StatusCode::INTERNAL_SERVER_ERROR)
        }
    }
 }
 async fn handle_storage_batch(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(serde_json::json!({"success": true})))
+    let operations = payload["operations"]
        .as_array()
        .ok_or(StatusCode::BAD_REQUEST)?;
    let bucket = payload["bucket"].as_str().unwrap_or("default");
    let mut results = Vec::new();
    for op in operations {
        let key = op["key"].as_str().unwrap_or("");
        let content = op["content"].as_str().unwrap_or("");
        let operation = op["operation"].as_str().unwrap_or("save");
        let result = match operation {
            "save" => crate::drive::files::save_to_s3(&state, bucket, key, content.as_bytes())
                .await
                .map(|_| serde_json::json!({"key": key, "success": true}))
                .unwrap_or_else(
                    |e| serde_json::json!({"key": key, "success": false, "error": e.to_string()}),
                ),
            "delete" => crate::drive::files::delete_from_s3(&state, bucket, key)
                .await
                .map(|_| serde_json::json!({"key": key, "success": true}))
                .unwrap_or_else(
                    |e| serde_json::json!({"key": key, "success": false, "error": e.to_string()}),
                ),
            _ => serde_json::json!({"key": key, "success": false, "error": "Invalid operation"}),
        };
        results.push(result);
    }
    Ok(Json(serde_json::json!({
        "success": true,
        "results": results,
        "total": results.len()
    })))
 }
 async fn handle_storage_json(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(serde_json::json!({"success": true})))
+    let key = payload["key"].as_str().ok_or(StatusCode::BAD_REQUEST)?;
    let data = &payload["data"];
    let bucket = payload["bucket"].as_str().unwrap_or("default");
    let json_content = serde_json::to_vec_pretty(data).map_err(|_| StatusCode::BAD_REQUEST)?;
    match crate::drive::files::save_to_s3(&state, bucket, key, &json_content).await {
        Ok(_) => Ok(Json(serde_json::json!({
            "success": true,
            "key": key,
            "bucket": bucket,
            "size": json_content.len()
        }))),
        Err(e) => {
            log::error!("JSON storage failed: {}", e);
            Err(StatusCode::INTERNAL_SERVER_ERROR)
        }
    }
 }
 async fn handle_storage_delete(
    State(state): State<Arc<AppState>>,
    Query(params): Query<std::collections::HashMap<String, String>>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(serde_json::json!({"success": true})))
+    let key = params.get("key").ok_or(StatusCode::BAD_REQUEST)?;
    let bucket = params
        .get("bucket")
        .map(|s| s.as_str())
        .unwrap_or("default");
    match crate::drive::files::delete_from_s3(&state, bucket, key).await {
        Ok(_) => Ok(Json(serde_json::json!({
            "success": true,
            "key": key,
            "bucket": bucket
        }))),
        Err(e) => {
            log::error!("Storage delete failed: {}", e);
            Err(StatusCode::INTERNAL_SERVER_ERROR)
        }
    }
 }
 async fn handle_storage_quota_check(
    State(state): State<Arc<AppState>>,
    Query(params): Query<std::collections::HashMap<String, String>>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(
+    let bucket = params
-        serde_json::json!({"total": 1000000000, "used": 500000000, "available": 500000000}),
+        .get("bucket")
-    ))
+        .map(|s| s.as_str())
        .unwrap_or("default");
    match crate::drive::files::get_bucket_stats(&state, bucket).await {
        Ok(stats) => {
            let total = 10_737_418_240i64; // 10GB default quota
            let used = stats.total_size as i64;
            let available = (total - used).max(0);
            Ok(Json(serde_json::json!({
                "total": total,
                "used": used,
                "available": available,
                "file_count": stats.object_count,
                "bucket": bucket
            })))
        }
        Err(_) => {
            // Return default quota if stats unavailable
            Ok(Json(serde_json::json!({
                "total": 10737418240,
                "used": 0,
                "available": 10737418240,
                "file_count": 0,
                "bucket": bucket
            })))
        }
    }
 }
 async fn handle_storage_cleanup(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(
+    let bucket = payload["bucket"].as_str().unwrap_or("default");
-        serde_json::json!({"success": true, "freed_bytes": 1024000}),
+    let older_than_days = payload["older_than_days"].as_u64().unwrap_or(30);
-    ))
+
    let cutoff_date = chrono::Utc::now() - chrono::Duration::days(older_than_days as i64);
    match crate::drive::files::cleanup_old_files(&state, bucket, cutoff_date).await {
        Ok((deleted_count, freed_bytes)) => Ok(Json(serde_json::json!({
            "success": true,
            "deleted_files": deleted_count,
            "freed_bytes": freed_bytes,
            "bucket": bucket
        }))),
        Err(e) => {
            log::error!("Storage cleanup failed: {}", e);
            Ok(Json(serde_json::json!({
                "success": false,
                "error": e.to_string()
            })))
        }
    }
 }
 async fn handle_storage_backup_create(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(
+    let bucket = payload["bucket"].as_str().unwrap_or("default");
-        serde_json::json!({"success": true, "backup_id": "backup-123"}),
+    let backup_name = payload["name"].as_str().unwrap_or("backup");
-    ))
+
    let backup_id = format!("backup-{}-{}", backup_name, chrono::Utc::now().timestamp());
    let archive_bucket = format!("{}-backups", bucket);
    match crate::drive::files::create_bucket_backup(&state, bucket, &archive_bucket, &backup_id)
        .await
    {
        Ok(file_count) => Ok(Json(serde_json::json!({
            "success": true,
            "backup_id": backup_id,
            "files_backed_up": file_count,
            "backup_bucket": archive_bucket
        }))),
        Err(e) => {
            log::error!("Backup creation failed: {}", e);
            Err(StatusCode::INTERNAL_SERVER_ERROR)
        }
    }
 }
 async fn handle_storage_backup_restore(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(serde_json::json!({"success": true})))
+    let backup_id = payload["backup_id"]
        .as_str()
        .ok_or(StatusCode::BAD_REQUEST)?;
    let target_bucket = payload["target_bucket"].as_str().unwrap_or("default");
    let source_bucket = payload["source_bucket"]
        .as_str()
        .unwrap_or(&format!("{}-backups", target_bucket));
    match crate::drive::files::restore_bucket_backup(
        &state,
        &source_bucket,
        target_bucket,
        backup_id,
    )
    .await
    {
        Ok(file_count) => Ok(Json(serde_json::json!({
            "success": true,
            "backup_id": backup_id,
            "files_restored": file_count,
            "target_bucket": target_bucket
        }))),
        Err(e) => {
            log::error!("Backup restore failed: {}", e);
            Err(StatusCode::INTERNAL_SERVER_ERROR)
        }
    }
 }
 async fn handle_storage_archive(
    State(state): State<Arc<AppState>>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(
+    let bucket = payload["bucket"].as_str().unwrap_or("default");
-        serde_json::json!({"success": true, "archive_id": "archive-123"}),
+    let prefix = payload["prefix"].as_str().unwrap_or("");
-    ))
+    let archive_name = payload["name"].as_str().unwrap_or("archive");
    let archive_id = format!(
        "archive-{}-{}",
        archive_name,
        chrono::Utc::now().timestamp()
    );
    let archive_key = format!("archives/{}.tar.gz", archive_id);
    match crate::drive::files::create_archive(&state, bucket, prefix, &archive_key).await {
        Ok(archive_size) => Ok(Json(serde_json::json!({
            "success": true,
            "archive_id": archive_id,
            "archive_key": archive_key,
            "archive_size": archive_size,
            "bucket": bucket
        }))),
        Err(e) => {
            log::error!("Archive creation failed: {}", e);
            Err(StatusCode::INTERNAL_SERVER_ERROR)
        }
    }
 }
 async fn handle_storage_metrics(
    State(state): State<Arc<AppState>>,
    Query(params): Query<std::collections::HashMap<String, String>>,
 ) -> Result<Json<serde_json::Value>, StatusCode> {
-    Ok(Json(
+    let bucket = params
-        serde_json::json!({"total_files": 1000, "total_size_bytes": 500000000}),
+        .get("bucket")
-    ))
+        .map(|s| s.as_str())
        .unwrap_or("default");
    match crate::drive::files::get_bucket_metrics(&state, bucket).await {
        Ok(metrics) => Ok(Json(serde_json::json!({
            "total_files": metrics.object_count,
            "total_size_bytes": metrics.total_size,
            "avg_file_size": if metrics.object_count > 0 {
                metrics.total_size / metrics.object_count as u64
            } else {
                0
            },
            "bucket": bucket,
            "last_modified": metrics.last_modified
        }))),
        Err(e) => {
            log::error!("Failed to get storage metrics: {}", e);
            Ok(Json(serde_json::json!({
                "total_files": 0,
                "total_size_bytes": 0,
                "error": e.to_string()
            })))
        }
    }
 }
 async fn handle_ai_analyze_text(
--- a/src/basic/compiler/mod.rs
+++ b/src/basic/compiler/mod.rs
@ -346,7 +346,7 @@ impl BasicCompiler {
                .replace("CLEAR SUGGESTIONS", "CLEAR_SUGGESTIONS")
                .replace("ADD SUGGESTION", "ADD_SUGGESTION")
                .replace("USE KB", "USE_KB")
-                .replace("ADD WEBSITE", "ADD_WEBSITE")
+                .replace("USE WEBSITE", "USE_WEBSITE")
                .replace("GET BOT MEMORY", "GET_BOT_MEMORY")
                .replace("SET BOT MEMORY", "SET_BOT_MEMORY")
                .replace("CREATE DRAFT", "CREATE_DRAFT");
@ -371,6 +371,32 @@ impl BasicCompiler {
                }
                continue;
            }
            if normalized.starts_with("USE_WEBSITE") {
                let parts: Vec<&str> = normalized.split('"').collect();
                if parts.len() >= 2 {
                    let url = parts[1];
                    let mut conn = self
                        .state
                        .conn
                        .get()
                        .map_err(|e| format!("Failed to get database connection: {}", e))?;
                    if let Err(e) =
                        crate::basic::keywords::use_website::execute_use_website_preprocessing(
                            &mut conn, url, bot_id,
                        )
                    {
                        log::error!("Failed to register USE_WEBSITE during preprocessing: {}", e);
                    } else {
                        log::info!(
                            "Registered website {} for crawling during preprocessing",
                            url
                        );
                    }
                } else {
                    log::warn!("Malformed USE_WEBSITE line ignored: {}", normalized);
                }
                continue;
            }
            if normalized.starts_with("PARAM ") || normalized.starts_with("DESCRIPTION ") {
                continue;
            }
--- a/src/basic/keywords/add_website.rs
+++ b/src/basic/keywords/add_website.rs
@ -1,74 +0,0 @@
 use crate::shared::models::UserSession;
 use crate::shared::state::AppState;
 use log::{error, trace};
 use rhai::{Dynamic, Engine};
 use std::sync::Arc;
 pub fn add_website_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine) {
    let state_clone = Arc::clone(&state);
    let user_clone = user.clone();
    engine
        .register_custom_syntax(&["ADD_WEBSITE", "$expr$"], false, move |context, inputs| {
            let url = context.eval_expression_tree(&inputs[0])?;
            let url_str = url.to_string().trim_matches('"').to_string();
            trace!(
                "ADD_WEBSITE command executed: {} for user: {}",
                url_str,
                user_clone.user_id
            );
            let is_valid = url_str.starts_with("http://") || url_str.starts_with("https://");
            if !is_valid {
                return Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    "Invalid URL format. Must start with http:// or https://".into(),
                    rhai::Position::NONE,
                )));
            }
            let state_for_task = Arc::clone(&state_clone);
            let user_for_task = user_clone.clone();
            let url_for_task = url_str.clone();
            let (tx, rx) = std::sync::mpsc::channel();
            std::thread::spawn(move || {
                let rt = tokio::runtime::Builder::new_multi_thread()
                    .worker_threads(2)
                    .enable_all()
                    .build();
                let send_err = if let Ok(rt) = rt {
                    let result = rt.block_on(async move {
                        crawl_and_index_website(&state_for_task, &user_for_task, &url_for_task)
                            .await
                    });
                    tx.send(result).err()
                } else {
                    tx.send(Err("Failed to build tokio runtime".to_string()))
                        .err()
                };
                if send_err.is_some() {
                    error!("Failed to send result from thread");
                }
            });
            match rx.recv_timeout(std::time::Duration::from_secs(120)) {
                Ok(Ok(message)) => Ok(Dynamic::from(message)),
                Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    e.into(),
                    rhai::Position::NONE,
                ))),
                Err(std::sync::mpsc::RecvTimeoutError::Timeout) => {
                    Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        "ADD_WEBSITE timed out".into(),
                        rhai::Position::NONE,
                    )))
                }
                Err(e) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    format!("ADD_WEBSITE failed: {}", e).into(),
                    rhai::Position::NONE,
                ))),
            }
        })
        .unwrap();
 }
 async fn crawl_and_index_website(
    _state: &AppState,
    _user: &UserSession,
    _url: &str,
 ) -> Result<String, String> {
    Err("Web automation functionality has been removed from this build".to_string())
 }
--- a/src/basic/keywords/book.rs
+++ b/src/basic/keywords/book.rs
@ -1,62 +1,113 @@
 use crate::shared::models::UserSession;
 use crate::shared::state::AppState;
-use chrono::{DateTime, Datelike, Duration, Timelike, Utc};
+use chrono::{DateTime, Duration, Utc};
-use log::{error, trace};
+use diesel::prelude::*;
 use log::{error, info, trace};
 use rhai::{Dynamic, Engine};
 use serde::{Deserialize, Serialize};
 use serde_json::json;
 use std::sync::Arc;
 use uuid::Uuid;
-#[derive(Debug, Serialize, Deserialize)]
+// Calendar types - would be from crate::calendar when feature is enabled
-struct TimeSlot {
+#[derive(Debug)]
-    start: DateTime<Utc>,
+pub struct CalendarEngine {
-    end: DateTime<Utc>,
+    db: crate::shared::utils::DbPool,
    available: bool,
 }
 #[derive(Debug)]
 pub struct CalendarEvent {
    pub id: uuid::Uuid,
    pub title: String,
    pub description: Option<String>,
    pub start_time: DateTime<Utc>,
    pub end_time: DateTime<Utc>,
    pub location: Option<String>,
    pub organizer: String,
    pub attendees: Vec<String>,
    pub reminder_minutes: Option<i32>,
    pub recurrence_rule: Option<RecurrenceRule>,
    pub status: EventStatus,
    pub created_at: DateTime<Utc>,
    pub updated_at: DateTime<Utc>,
 }
 #[derive(Debug)]
 pub enum EventStatus {
    Confirmed,
    Tentative,
    Cancelled,
 }
 #[derive(Debug)]
 pub struct RecurrenceRule {
    pub frequency: String,
    pub interval: i32,
    pub count: Option<i32>,
    pub until: Option<DateTime<Utc>>,
    pub by_day: Option<Vec<String>>,
 }
 impl CalendarEngine {
    pub fn new(db: crate::shared::utils::DbPool) -> Self {
        Self { db }
    }
    pub async fn create_event(
        &self,
        event: CalendarEvent,
    ) -> Result<CalendarEvent, Box<dyn std::error::Error>> {
        Ok(event)
    }
    pub async fn check_conflicts(
        &self,
        _start: DateTime<Utc>,
        _end: DateTime<Utc>,
        _user: &str,
    ) -> Result<Vec<CalendarEvent>, Box<dyn std::error::Error>> {
        Ok(vec![])
    }
    pub async fn get_events_range(
        &self,
        _start: DateTime<Utc>,
        _end: DateTime<Utc>,
    ) -> Result<Vec<CalendarEvent>, Box<dyn std::error::Error>> {
        Ok(vec![])
    }
 }
 /// Register BOOK keyword in BASIC for calendar appointments
 pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine) {
    let state_clone = Arc::clone(&state);
    let user_clone = user.clone();
    engine
        .register_custom_syntax(
-            &["BOOK", "$expr$", ",", "$expr$", ",", "$expr$"],
+            &[
                "BOOK", "$expr$", ",", "$expr$", ",", "$expr$", ",", "$expr$", ",", "$expr$",
            ],
            false,
            move |context, inputs| {
-                // Parse attendees (array or single email)
+                let title = context.eval_expression_tree(&inputs[0])?.to_string();
-                let attendees_input = context.eval_expression_tree(&inputs[0])?;
+                let description = context.eval_expression_tree(&inputs[1])?.to_string();
-                let mut attendees = Vec::new();
+                let start_time_str = context.eval_expression_tree(&inputs[2])?.to_string();
-
+                let duration_minutes = context
-                if attendees_input.is_array() {
+                    .eval_expression_tree(&inputs[3])?
-                    let arr = attendees_input.cast::<rhai::Array>();
+                    .as_int()
-                    for item in arr.iter() {
+                    .unwrap_or(30) as i64;
-                        attendees.push(item.to_string());
+                let location = context.eval_expression_tree(&inputs[4])?.to_string();
                    }
                } else {
                    attendees.push(attendees_input.to_string());
                }
                let date_range = context.eval_expression_tree(&inputs[1])?.to_string();
                let duration = context.eval_expression_tree(&inputs[2])?;
                let duration_minutes = if duration.is_int() {
                    duration.as_int().unwrap_or(30)
                } else {
                    duration.to_string().parse::<i64>().unwrap_or(30)
                };
                trace!(
-                    "BOOK: attendees={:?}, date_range={}, duration={} for user={}",
+                    "BOOK: title={}, start={}, duration={} min for user={}",
-                    attendees,
+                    title,
-                    date_range,
+                    start_time_str,
                    duration_minutes,
                    user_clone.user_id
                );
                let state_for_task = Arc::clone(&state_clone);
                let user_for_task = user_clone.clone();
                let (tx, rx) = std::sync::mpsc::channel();
                std::thread::spawn(move || {
@ -67,12 +118,14 @@ pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine
                    let send_err = if let Ok(rt) = rt {
                        let result = rt.block_on(async move {
-                            execute_booking(
+                            execute_book(
                                &state_for_task,
                                &user_for_task,
-                                attendees,
+                                &title,
-                                &date_range,
+                                &description,
-                                duration_minutes as i32,
+                                &start_time_str,
                                duration_minutes,
                                &location,
                            )
                            .await
                        });
@ -88,19 +141,13 @@ pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine
                });
                match rx.recv_timeout(std::time::Duration::from_secs(10)) {
-                    Ok(Ok(booking_id)) => Ok(Dynamic::from(booking_id)),
+                    Ok(Ok(event_id)) => Ok(Dynamic::from(event_id)),
                    Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        format!("BOOK failed: {}", e).into(),
                        rhai::Position::NONE,
                    ))),
-                    Err(std::sync::mpsc::RecvTimeoutError::Timeout) => {
+                    Err(_) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
-                        Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
+                        "BOOK timed out".into(),
                            "BOOK timed out".into(),
                            rhai::Position::NONE,
                        )))
                    }
                    Err(e) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        format!("BOOK thread failed: {}", e).into(),
                        rhai::Position::NONE,
                    ))),
                }
@ -108,39 +155,34 @@ pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine
        )
        .unwrap();
-    // Register FIND_SLOT keyword to find available slots
+    // Register BOOK MEETING for more complex meetings
    let state_clone2 = Arc::clone(&state);
    let user_clone2 = user.clone();
    engine
        .register_custom_syntax(
-            &["FIND_SLOT", "$expr$", ",", "$expr$", ",", "$expr$"],
+            &["BOOK_MEETING", "$expr$", ",", "$expr$"],
            false,
            move |context, inputs| {
-                let attendees_input = context.eval_expression_tree(&inputs[0])?;
+                let meeting_details = context.eval_expression_tree(&inputs[0])?;
-                let mut attendees = Vec::new();
+                let attendees_input = context.eval_expression_tree(&inputs[1])?;
                let mut attendees = Vec::new();
                if attendees_input.is_array() {
                    let arr = attendees_input.cast::<rhai::Array>();
                    for item in arr.iter() {
                        attendees.push(item.to_string());
                    }
                } else {
                    attendees.push(attendees_input.to_string());
                }
-                let duration = context.eval_expression_tree(&inputs[1])?;
+                trace!(
-                let preferences = context.eval_expression_tree(&inputs[2])?.to_string();
+                    "BOOK_MEETING with {} attendees for user={}",
-
+                    attendees.len(),
-                let duration_minutes = if duration.is_int() {
+                    user_clone2.user_id
-                    duration.as_int().unwrap_or(30)
+                );
                } else {
                    duration.to_string().parse::<i64>().unwrap_or(30)
                };
                let state_for_task = Arc::clone(&state_clone2);
                let user_for_task = user_clone2.clone();
                let (tx, rx) = std::sync::mpsc::channel();
                std::thread::spawn(move || {
@ -151,12 +193,11 @@ pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine
                    let send_err = if let Ok(rt) = rt {
                        let result = rt.block_on(async move {
-                            find_available_slot(
+                            execute_book_meeting(
                                &state_for_task,
                                &user_for_task,
                                meeting_details.to_string(),
                                attendees,
                                duration_minutes as i32,
                                &preferences,
                            )
                            .await
                        });
@ -167,18 +208,86 @@ pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine
                    };
                    if send_err.is_some() {
-                        error!("Failed to send FIND_SLOT result from thread");
+                        error!("Failed to send BOOK_MEETING result from thread");
                    }
                });
                match rx.recv_timeout(std::time::Duration::from_secs(10)) {
-                    Ok(Ok(slot)) => Ok(Dynamic::from(slot)),
+                    Ok(Ok(event_id)) => Ok(Dynamic::from(event_id)),
                    Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
-                        format!("FIND_SLOT failed: {}", e).into(),
+                        format!("BOOK_MEETING failed: {}", e).into(),
                        rhai::Position::NONE,
                    ))),
                    Err(_) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
-                        "FIND_SLOT timed out".into(),
+                        "BOOK_MEETING timed out".into(),
                        rhai::Position::NONE,
                    ))),
                }
            },
        )
        .unwrap();
    // Register CHECK_AVAILABILITY keyword
    let state_clone3 = Arc::clone(&state);
    let user_clone3 = user.clone();
    engine
        .register_custom_syntax(
            &["CHECK_AVAILABILITY", "$expr$", ",", "$expr$"],
            false,
            move |context, inputs| {
                let date_str = context.eval_expression_tree(&inputs[0])?.to_string();
                let duration_minutes = context
                    .eval_expression_tree(&inputs[1])?
                    .as_int()
                    .unwrap_or(30) as i64;
                trace!(
                    "CHECK_AVAILABILITY for {} on {} for user={}",
                    duration_minutes,
                    date_str,
                    user_clone3.user_id
                );
                let state_for_task = Arc::clone(&state_clone3);
                let user_for_task = user_clone3.clone();
                let (tx, rx) = std::sync::mpsc::channel();
                std::thread::spawn(move || {
                    let rt = tokio::runtime::Builder::new_multi_thread()
                        .worker_threads(2)
                        .enable_all()
                        .build();
                    let send_err = if let Ok(rt) = rt {
                        let result = rt.block_on(async move {
                            check_availability(
                                &state_for_task,
                                &user_for_task,
                                &date_str,
                                duration_minutes,
                            )
                            .await
                        });
                        tx.send(result).err()
                    } else {
                        tx.send(Err("Failed to build tokio runtime".to_string()))
                            .err()
                    };
                    if send_err.is_some() {
                        error!("Failed to send CHECK_AVAILABILITY result from thread");
                    }
                });
                match rx.recv_timeout(std::time::Duration::from_secs(5)) {
                    Ok(Ok(slots)) => Ok(Dynamic::from(slots)),
                    Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        format!("CHECK_AVAILABILITY failed: {}", e).into(),
                        rhai::Position::NONE,
                    ))),
                    Err(_) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        "CHECK_AVAILABILITY timed out".into(),
                        rhai::Position::NONE,
                    ))),
                }
@ -187,247 +296,396 @@ pub fn book_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine
        .unwrap();
 }
-async fn execute_booking(
+async fn execute_book(
    state: &AppState,
    user: &UserSession,
-    attendees: Vec<String>,
+    title: &str,
-    date_range: &str,
+    description: &str,
-    duration_minutes: i32,
+    start_time_str: &str,
    duration_minutes: i64,
    location: &str,
 ) -> Result<String, String> {
-    // Parse date range
+    // Parse start time
-    let (start_search, end_search) = parse_date_range(date_range)?;
+    let start_time = parse_time_string(start_time_str)?;
    let end_time = start_time + Duration::minutes(duration_minutes);
-    // Find available slot
+    // Get or create calendar engine
-    let available_slot = find_common_availability(
+    let calendar_engine = get_calendar_engine(state).await?;
        state,
        &attendees,
        start_search,
        end_search,
        duration_minutes,
    )
    .await?;
-    // Create calendar event
+    // Check for conflicts
-    let event_id = create_calendar_event(
+    let conflicts = calendar_engine
-        state,
+        .check_conflicts(start_time, end_time, &user.user_id.to_string())
-        user,
+        .await
-        &attendees,
+        .map_err(|e| format!("Failed to check conflicts: {}", e))?;
        available_slot.start,
        available_slot.end,
        "Meeting",
        None,
    )
    .await?;
-    // Send invitations
+    if !conflicts.is_empty() {
-    for attendee in &attendees {
+        return Err(format!(
-        send_calendar_invite(state, &event_id, attendee).await?;
+            "Time slot conflicts with existing appointment: {}",
            conflicts[0].title
        ));
    }
    // Create calendar event
    let event = CalendarEvent {
        id: Uuid::new_v4(),
        title: title.to_string(),
        description: Some(description.to_string()),
        start_time,
        end_time,
        location: if location.is_empty() {
            None
        } else {
            Some(location.to_string())
        },
        organizer: user.user_id.to_string(),
        attendees: vec![user.user_id.to_string()],
        reminder_minutes: Some(15), // Default 15-minute reminder
        recurrence_rule: None,
        status: EventStatus::Confirmed,
        created_at: Utc::now(),
        updated_at: Utc::now(),
    };
    // Create the event
    let created_event = calendar_engine
        .create_event(event)
        .await
        .map_err(|e| format!("Failed to create appointment: {}", e))?;
    // Log the booking
    log_booking(state, user, &created_event.id.to_string(), title).await?;
    info!(
        "Appointment booked: {} at {} for user {}",
        title, start_time, user.user_id
    );
    Ok(format!(
-        "Meeting booked for {} at {}",
+        "Appointment '{}' booked for {} (ID: {})",
-        available_slot.start.format("%Y-%m-%d %H:%M"),
+        title,
-        event_id
+        start_time.format("%Y-%m-%d %H:%M"),
        created_event.id
    ))
 }
-async fn find_available_slot(
+async fn execute_book_meeting(
    state: &AppState,
    _user: &UserSession,
    attendees: Vec<String>,
    duration_minutes: i32,
    preferences: &str,
 ) -> Result<String, String> {
    // Parse preferences (e.g., "mornings preferred", "afternoons only", "next week")
    let (start_search, end_search) = if preferences.contains("tomorrow") {
        let tomorrow = Utc::now() + Duration::days(1);
        (
            tomorrow
                .date_naive()
                .and_hms_opt(0, 0, 0)
                .unwrap()
                .and_utc(),
            tomorrow
                .date_naive()
                .and_hms_opt(23, 59, 59)
                .unwrap()
                .and_utc(),
        )
    } else if preferences.contains("next week") {
        let now = Utc::now();
        let next_week = now + Duration::days(7);
        (now, next_week)
    } else {
        // Default to next 7 days
        let now = Utc::now();
        (now, now + Duration::days(7))
    };
    let slot = find_common_availability(
        state,
        &attendees,
        start_search,
        end_search,
        duration_minutes,
    )
    .await?;
    Ok(slot.start.format("%Y-%m-%d %H:%M").to_string())
 }
 async fn find_common_availability(
    state: &AppState,
    attendees: &[String],
    start_search: DateTime<Utc>,
    end_search: DateTime<Utc>,
    duration_minutes: i32,
 ) -> Result<TimeSlot, String> {
    // This would integrate with actual calendar API
    // For now, simulate finding an available slot
    let mut current = start_search;
    while current < end_search {
        // Skip weekends
        if current.weekday().num_days_from_monday() >= 5 {
            current = current + Duration::days(1);
            continue;
        }
        // Check business hours (9 AM - 5 PM)
        let hour = current.hour();
        if hour >= 9 && hour < 17 {
            // Check if slot is available for all attendees
            let slot_end = current + Duration::minutes(duration_minutes as i64);
            if slot_end.hour() <= 17 {
                // In a real implementation, check each attendee's calendar
                // For now, simulate availability check
                if check_slot_availability(state, attendees, current, slot_end).await? {
                    return Ok(TimeSlot {
                        start: current,
                        end: slot_end,
                        available: true,
                    });
                }
            }
        }
        // Move to next slot (30 minute intervals)
        current = current + Duration::minutes(30);
    }
    Err("No available slot found in the specified date range".to_string())
 }
 async fn check_slot_availability(
    _state: &AppState,
    _attendees: &[String],
    _start: DateTime<Utc>,
    _end: DateTime<Utc>,
 ) -> Result<bool, String> {
    // Simulate calendar availability check
    // In real implementation, this would query calendar API
    // For demo, randomly return availability
    let random = (Utc::now().timestamp() % 3) == 0;
    Ok(random)
 }
 async fn create_calendar_event(
    state: &AppState,
    user: &UserSession,
-    attendees: &[String],
+    meeting_json: String,
-    start: DateTime<Utc>,
+    attendees: Vec<String>,
    end: DateTime<Utc>,
    subject: &str,
    description: Option<String>,
 ) -> Result<String, String> {
-    let event_id = Uuid::new_v4().to_string();
+    // Parse meeting details from JSON
    let meeting_data: serde_json::Value = serde_json::from_str(&meeting_json)
        .map_err(|e| format!("Invalid meeting details: {}", e))?;
-    // Store in database
+    let title = meeting_data["title"]
-    let mut conn = state.conn.get().map_err(|e| format!("DB error: {}", e))?;
+        .as_str()
        .ok_or("Missing meeting title")?;
    let start_time_str = meeting_data["start_time"]
        .as_str()
        .ok_or("Missing start time")?;
    let duration_minutes = meeting_data["duration"].as_i64().unwrap_or(60);
    let description = meeting_data["description"].as_str().unwrap_or("");
    let location = meeting_data["location"].as_str().unwrap_or("");
    let recurring = meeting_data["recurring"].as_bool().unwrap_or(false);
-    let user_id_str = user.user_id.to_string();
+    let start_time = parse_time_string(start_time_str)?;
-    let bot_id_str = user.bot_id.to_string();
+    let end_time = start_time + Duration::minutes(duration_minutes);
    let attendees_json = json!(attendees);
    let now = Utc::now();
-    let query = diesel::sql_query(
+    // Get or create calendar engine
-        "INSERT INTO calendar_events (id, user_id, bot_id, subject, description, start_time, end_time, attendees, created_at)
+    let calendar_engine = get_calendar_engine(state).await?;
         VALUES ($1, $2, $3, $4, $5, $6, $7, $8, $9)"
    )
    .bind::<diesel::sql_types::Text, _>(&event_id)
    .bind::<diesel::sql_types::Text, _>(&user_id_str)
    .bind::<diesel::sql_types::Text, _>(&bot_id_str)
    .bind::<diesel::sql_types::Text, _>(subject)
    .bind::<diesel::sql_types::Nullable<diesel::sql_types::Text>, _>(&description)
    .bind::<diesel::sql_types::Timestamptz, _>(&start)
    .bind::<diesel::sql_types::Timestamptz, _>(&end)
    .bind::<diesel::sql_types::Jsonb, _>(&attendees_json)
    .bind::<diesel::sql_types::Timestamptz, _>(&now);
-    use diesel::RunQueryDsl;
+    // Check conflicts for all attendees
-    query.execute(&mut *conn).map_err(|e| {
+    for attendee in &attendees {
-        error!("Failed to create calendar event: {}", e);
+        let conflicts = calendar_engine
-        format!("Failed to create calendar event: {}", e)
+            .check_conflicts(start_time, end_time, attendee)
-    })?;
+            .await
            .map_err(|e| format!("Failed to check conflicts: {}", e))?;
-    trace!("Created calendar event: {}", event_id);
+        if !conflicts.is_empty() {
-    Ok(event_id)
+            return Err(format!("Attendee {} has a conflict at this time", attendee));
        }
    }
    // Create recurrence rule if needed
    let recurrence_rule = if recurring {
        Some(RecurrenceRule {
            frequency: "WEEKLY".to_string(),
            interval: 1,
            count: Some(10), // Default to 10 occurrences
            until: None,
            by_day: None,
        })
    } else {
        None
    };
    // Create calendar event
    let event = CalendarEvent {
        id: Uuid::new_v4(),
        title: title.to_string(),
        description: Some(description.to_string()),
        start_time,
        end_time,
        location: if location.is_empty() {
            None
        } else {
            Some(location.to_string())
        },
        organizer: user.user_id.to_string(),
        attendees: attendees.clone(),
        reminder_minutes: Some(30), // 30-minute reminder for meetings
        recurrence_rule,
        status: EventStatus::Confirmed,
        created_at: Utc::now(),
        updated_at: Utc::now(),
    };
    // Create the meeting
    let created_event = calendar_engine
        .create_event(event)
        .await
        .map_err(|e| format!("Failed to create meeting: {}", e))?;
    // Send invites to attendees (would integrate with email system)
    for attendee in &attendees {
        send_meeting_invite(state, &created_event, attendee).await?;
    }
    info!(
        "Meeting booked: {} at {} with {} attendees",
        title,
        start_time,
        attendees.len()
    );
    Ok(format!(
        "Meeting '{}' scheduled for {} with {} attendees (ID: {})",
        title,
        start_time.format("%Y-%m-%d %H:%M"),
        attendees.len(),
        created_event.id
    ))
 }
-async fn send_calendar_invite(
+async fn check_availability(
-    _state: &AppState,
+    state: &AppState,
    user: &UserSession,
    date_str: &str,
    duration_minutes: i64,
 ) -> Result<String, String> {
    let date = parse_date_string(date_str)?;
    let calendar_engine = get_calendar_engine(state).await?;
    // Define business hours (9 AM to 5 PM)
    let business_start = date.with_hour(9).unwrap().with_minute(0).unwrap();
    let business_end = date.with_hour(17).unwrap().with_minute(0).unwrap();
    // Get all events for the day
    let events = calendar_engine
        .get_events_range(business_start, business_end)
        .await
        .map_err(|e| format!("Failed to get events: {}", e))?;
    // Find available slots
    let mut available_slots = Vec::new();
    let mut current_time = business_start;
    let slot_duration = Duration::minutes(duration_minutes);
    for event in &events {
        // Check if there's a gap before this event
        if current_time + slot_duration <= event.start_time {
            available_slots.push(format!(
                "{} - {}",
                current_time.format("%H:%M"),
                (current_time + slot_duration).format("%H:%M")
            ));
        }
        current_time = event.end_time;
    }
    // Check if there's time after the last event
    if current_time + slot_duration <= business_end {
        available_slots.push(format!(
            "{} - {}",
            current_time.format("%H:%M"),
            (current_time + slot_duration).format("%H:%M")
        ));
    }
    if available_slots.is_empty() {
        Ok("No available slots on this date".to_string())
    } else {
        Ok(format!(
            "Available slots on {}: {}",
            date.format("%Y-%m-%d"),
            available_slots.join(", ")
        ))
    }
 }
 fn parse_time_string(time_str: &str) -> Result<DateTime<Utc>, String> {
    // Try different date formats
    let formats = vec![
        "%Y-%m-%d %H:%M",
        "%Y-%m-%d %H:%M:%S",
        "%Y/%m/%d %H:%M",
        "%d/%m/%Y %H:%M",
        "%Y-%m-%dT%H:%M:%S",
    ];
    for format in formats {
        if let Ok(dt) = chrono::NaiveDateTime::parse_from_str(time_str, format) {
            return Ok(DateTime::from_utc(dt, Utc));
        }
    }
    // Try parsing relative times like "tomorrow at 3pm"
    if time_str.contains("tomorrow") {
        let tomorrow = Utc::now() + Duration::days(1);
        if let Some(hour) = extract_hour_from_string(time_str) {
            return Ok(tomorrow
                .with_hour(hour)
                .unwrap()
                .with_minute(0)
                .unwrap()
                .with_second(0)
                .unwrap());
        }
    }
    // Try parsing relative times like "in 2 hours"
    if time_str.starts_with("in ") {
        if let Ok(hours) = time_str
            .trim_start_matches("in ")
            .trim_end_matches(" hours")
            .trim_end_matches(" hour")
            .parse::<i64>()
        {
            return Ok(Utc::now() + Duration::hours(hours));
        }
    }
    Err(format!("Could not parse time: {}", time_str))
 }
 fn parse_date_string(date_str: &str) -> Result<DateTime<Utc>, String> {
    // Handle special cases
    if date_str == "today" {
        return Ok(Utc::now());
    } else if date_str == "tomorrow" {
        return Ok(Utc::now() + Duration::days(1));
    }
    // Try standard date formats
    let formats = vec!["%Y-%m-%d", "%Y/%m/%d", "%d/%m/%Y"];
    for format in formats {
        if let Ok(dt) = chrono::NaiveDate::parse_from_str(date_str, format) {
            return Ok(dt.and_hms(0, 0, 0).and_utc());
        }
    }
    Err(format!("Could not parse date: {}", date_str))
 }
 fn extract_hour_from_string(s: &str) -> Option<u32> {
    // Extract hour from strings like "3pm", "15:00", "3 PM"
    let s = s.to_lowercase();
    if s.contains("pm") {
        if let Some(hour_str) = s.split("pm").next() {
            if let Ok(hour) = hour_str.trim().replace(":", "").parse::<u32>() {
                return Some(if hour < 12 { hour + 12 } else { hour });
            }
        }
    } else if s.contains("am") {
        if let Some(hour_str) = s.split("am").next() {
            if let Ok(hour) = hour_str.trim().replace(":", "").parse::<u32>() {
                return Some(if hour == 12 { 0 } else { hour });
            }
        }
    }
    None
 }
 async fn log_booking(
    state: &AppState,
    user: &UserSession,
    event_id: &str,
    title: &str,
 ) -> Result<(), String> {
    let mut conn = state.conn.get().map_err(|e| format!("DB error: {}", e))?;
    diesel::sql_query(
        "INSERT INTO booking_logs (id, user_id, bot_id, event_id, event_title, booked_at)
         VALUES (gen_random_uuid(), $1, $2, $3, $4, NOW())",
    )
    .bind::<diesel::sql_types::Uuid, _>(&user.user_id)
    .bind::<diesel::sql_types::Uuid, _>(&user.bot_id)
    .bind::<diesel::sql_types::Text, _>(event_id)
    .bind::<diesel::sql_types::Text, _>(title)
    .execute(&mut *conn)
    .map_err(|e| format!("Failed to log booking: {}", e))?;
    Ok(())
 }
 async fn get_calendar_engine(state: &AppState) -> Result<Arc<CalendarEngine>, String> {
    // Get or create calendar engine from app state
    // This would normally be initialized at startup
    let calendar_engine = Arc::new(CalendarEngine::new(state.conn.clone()));
    Ok(calendar_engine)
 }
 async fn send_meeting_invite(
    state: &AppState,
    event: &CalendarEvent,
    attendee: &str,
 ) -> Result<(), String> {
-    // In real implementation, send actual calendar invite via email or calendar API
+    // This would integrate with the email system to send calendar invites
-    trace!(
+    info!(
-        "Sending calendar invite for event {} to {}",
+        "Would send meeting invite for '{}' to {}",
-        event_id,
+        event.title, attendee
        attendee
    );
    Ok(())
 }
-fn parse_date_range(date_range: &str) -> Result<(DateTime<Utc>, DateTime<Utc>), String> {
+#[cfg(test)]
-    let range_lower = date_range.to_lowercase();
+mod tests {
-    let now = Utc::now();
+    use super::*;
-    if range_lower.contains("today") {
+    #[test]
-        Ok((
+    fn test_parse_time_string() {
-            now.date_naive().and_hms_opt(0, 0, 0).unwrap().and_utc(),
+        let result = parse_time_string("2024-01-15 14:30");
-            now.date_naive().and_hms_opt(23, 59, 59).unwrap().and_utc(),
+        assert!(result.is_ok());
-        ))
+
-    } else if range_lower.contains("tomorrow") {
+        let result = parse_time_string("tomorrow at 3pm");
-        let tomorrow = now + Duration::days(1);
+        assert!(result.is_ok());
-        Ok((
+
-            tomorrow
+        let result = parse_time_string("in 2 hours");
-                .date_naive()
+        assert!(result.is_ok());
-                .and_hms_opt(0, 0, 0)
+    }
-                .unwrap()
+
-                .and_utc(),
+    #[test]
-            tomorrow
+    fn test_parse_date_string() {
-                .date_naive()
+        let result = parse_date_string("today");
-                .and_hms_opt(23, 59, 59)
+        assert!(result.is_ok());
-                .unwrap()
+
-                .and_utc(),
+        let result = parse_date_string("2024-01-15");
-        ))
+        assert!(result.is_ok());
-    } else if range_lower.contains("this week") || range_lower.contains("this_week") {
+
-        Ok((
+        let result = parse_date_string("tomorrow");
-            now,
+        assert!(result.is_ok());
-            now + Duration::days(7 - now.weekday().num_days_from_monday() as i64),
+    }
-        ))
+
-    } else if range_lower.contains("next week") || range_lower.contains("next_week") {
+    #[test]
-        let next_monday = now + Duration::days(7 - now.weekday().num_days_from_monday() as i64 + 1);
+    fn test_extract_hour() {
-        Ok((next_monday, next_monday + Duration::days(6)))
+        assert_eq!(extract_hour_from_string("3pm"), Some(15));
-    } else if range_lower.contains("2pm") || range_lower.contains("14:00") {
+        assert_eq!(extract_hour_from_string("3 PM"), Some(15));
-        // Handle specific time
+        assert_eq!(extract_hour_from_string("10am"), Some(10));
-        let target_time = now.date_naive().and_hms_opt(14, 0, 0).unwrap().and_utc();
+        assert_eq!(extract_hour_from_string("12am"), Some(0));
-        Ok((target_time, target_time + Duration::hours(1)))
+        assert_eq!(extract_hour_from_string("12pm"), Some(12));
    } else {
        // Default to next 7 days
        Ok((now, now + Duration::days(7)))
    }
 }
--- a/src/basic/keywords/mod.rs
+++ b/src/basic/keywords/mod.rs
@ -1,6 +1,5 @@
 pub mod add_member;
 pub mod add_suggestion;
 pub mod add_website;
 pub mod book;
 pub mod bot_memory;
 pub mod clear_kb;
@ -28,5 +27,6 @@ pub mod set_user;
 pub mod universal_messaging;
 pub mod use_kb;
 pub mod use_tool;
 pub mod use_website;
 pub mod wait;
 pub mod weather;
--- a/src/basic/keywords/use_website.rs
+++ b/src/basic/keywords/use_website.rs
@ -0,0 +1,407 @@
 use crate::shared::models::UserSession;
 use crate::shared::state::AppState;
 use diesel::prelude::*;
 use log::{error, info, trace};
 use rhai::{Dynamic, Engine};
 use std::sync::Arc;
 use uuid::Uuid;
 /// Register USE_WEBSITE keyword in BASIC
 /// Runtime mode: Associates a website collection with the current session (like USE KB)
 /// Preprocessing mode: Registers website for crawling (handled in compiler/mod.rs)
 pub fn use_website_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine) {
    let state_clone = Arc::clone(&state);
    let user_clone = user.clone();
    engine
        .register_custom_syntax(&["USE_WEBSITE", "$expr$"], false, move |context, inputs| {
            let url = context.eval_expression_tree(&inputs[0])?;
            let url_str = url.to_string().trim_matches('"').to_string();
            trace!(
                "USE_WEBSITE command executed: {} for session: {}",
                url_str,
                user_clone.id
            );
            // Validate URL
            let is_valid = url_str.starts_with("http://") || url_str.starts_with("https://");
            if !is_valid {
                return Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    "Invalid URL format. Must start with http:// or https://".into(),
                    rhai::Position::NONE,
                )));
            }
            let state_for_task = Arc::clone(&state_clone);
            let user_for_task = user_clone.clone();
            let url_for_task = url_str.clone();
            let (tx, rx) = std::sync::mpsc::channel();
            std::thread::spawn(move || {
                let rt = tokio::runtime::Builder::new_multi_thread()
                    .worker_threads(2)
                    .enable_all()
                    .build();
                let send_err = if let Ok(rt) = rt {
                    let result = rt.block_on(async move {
                        associate_website_with_session(
                            &state_for_task,
                            &user_for_task,
                            &url_for_task,
                        )
                        .await
                    });
                    tx.send(result).err()
                } else {
                    tx.send(Err("Failed to build tokio runtime".to_string()))
                        .err()
                };
                if send_err.is_some() {
                    error!("Failed to send result from thread");
                }
            });
            match rx.recv_timeout(std::time::Duration::from_secs(10)) {
                Ok(Ok(message)) => Ok(Dynamic::from(message)),
                Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    e.into(),
                    rhai::Position::NONE,
                ))),
                Err(std::sync::mpsc::RecvTimeoutError::Timeout) => {
                    Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        "USE_WEBSITE timed out".into(),
                        rhai::Position::NONE,
                    )))
                }
                Err(e) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    format!("USE_WEBSITE failed: {}", e).into(),
                    rhai::Position::NONE,
                ))),
            }
        })
        .unwrap();
 }
 /// Associate website with session (runtime behavior - like USE KB)
 /// This only associates an already-crawled website with the session
 async fn associate_website_with_session(
    state: &AppState,
    user: &UserSession,
    url: &str,
 ) -> Result<String, String> {
    info!("Associating website {} with session {}", url, user.id);
    let mut conn = state.conn.get().map_err(|e| format!("DB error: {}", e))?;
    // Create collection name for this website
    let collection_name = format!("website_{}", sanitize_url_for_collection(url));
    // Check if website has been crawled for this bot
    let website_status = check_website_crawl_status(&mut conn, &user.bot_id, url)?;
    match website_status {
        WebsiteCrawlStatus::NotRegistered => {
            return Err(format!(
                "Website {} has not been registered for crawling. It should be added to the script for preprocessing.",
                url
            ));
        }
        WebsiteCrawlStatus::Pending => {
            // Website is registered but not yet crawled - allow association but warn
            info!("Website {} is pending crawl, associating anyway", url);
        }
        WebsiteCrawlStatus::Crawled => {
            // Website is fully crawled and ready
            info!("Website {} is already crawled and ready", url);
        }
        WebsiteCrawlStatus::Failed => {
            return Err(format!(
                "Website {} crawling failed. Please check the logs.",
                url
            ));
        }
    }
    // Associate website collection with session (like session_kb_associations)
    add_website_to_session(&mut conn, &user.id, &user.bot_id, url, &collection_name)?;
    Ok(format!(
        "Website {} is now available in this conversation.",
        url
    ))
 }
 /// Website crawl status enum
 enum WebsiteCrawlStatus {
    NotRegistered,
    Pending,
    Crawled,
    Failed,
 }
 /// Check website crawl status for this bot
 fn check_website_crawl_status(
    conn: &mut PgConnection,
    bot_id: &Uuid,
    url: &str,
 ) -> Result<WebsiteCrawlStatus, String> {
    #[derive(QueryableByName)]
    struct CrawlStatus {
        #[diesel(sql_type = diesel::sql_types::Nullable<diesel::sql_types::Integer>)]
        crawl_status: Option<i32>,
    }
    let query =
        diesel::sql_query("SELECT crawl_status FROM website_crawls WHERE bot_id = $1 AND url = $2")
            .bind::<diesel::sql_types::Uuid, _>(bot_id)
            .bind::<diesel::sql_types::Text, _>(url);
    let result: Result<CrawlStatus, _> = query.get_result(conn);
    match result {
        Ok(status) => match status.crawl_status {
            Some(0) => Ok(WebsiteCrawlStatus::Pending),
            Some(1) => Ok(WebsiteCrawlStatus::Crawled),
            Some(2) => Ok(WebsiteCrawlStatus::Failed),
            _ => Ok(WebsiteCrawlStatus::NotRegistered),
        },
        Err(_) => Ok(WebsiteCrawlStatus::NotRegistered),
    }
 }
 /// Register website for background crawling (called from preprocessing)
 /// This is called during script compilation, not runtime
 pub fn register_website_for_crawling(
    conn: &mut PgConnection,
    bot_id: &Uuid,
    url: &str,
 ) -> Result<(), String> {
    // Get website configuration with defaults
    let expires_policy = "1d"; // Default, would read from bot config
    let query = diesel::sql_query(
        "INSERT INTO website_crawls (id, bot_id, url, expires_policy, crawl_status, next_crawl)
         VALUES (gen_random_uuid(), $1, $2, $3, 0, NOW())
         ON CONFLICT (bot_id, url) DO UPDATE SET next_crawl =
         CASE
            WHEN website_crawls.crawl_status = 2 THEN NOW()  -- Failed, retry now
            ELSE website_crawls.next_crawl  -- Keep existing schedule
         END",
    )
    .bind::<diesel::sql_types::Uuid, _>(bot_id)
    .bind::<diesel::sql_types::Text, _>(url)
    .bind::<diesel::sql_types::Text, _>(expires_policy);
    query
        .execute(conn)
        .map_err(|e| format!("Failed to register website for crawling: {}", e))?;
    info!("Website {} registered for crawling for bot {}", url, bot_id);
    Ok(())
 }
 /// Execute USE_WEBSITE during preprocessing (called from compiler)
 /// This registers the website for crawling but doesn't associate it with any session
 pub fn execute_use_website_preprocessing(
    conn: &mut PgConnection,
    url: &str,
    bot_id: Uuid,
 ) -> Result<serde_json::Value, Box<dyn std::error::Error>> {
    trace!("Preprocessing USE_WEBSITE: {}, bot_id: {:?}", url, bot_id);
    // Validate URL
    if !url.starts_with("http://") && !url.starts_with("https://") {
        return Err(format!(
            "Invalid URL format: {}. Must start with http:// or https://",
            url
        )
        .into());
    }
    // Register for crawling
    register_website_for_crawling(conn, &bot_id, url)?;
    Ok(serde_json::json!({
        "command": "use_website",
        "url": url,
        "bot_id": bot_id.to_string(),
        "status": "registered_for_crawling"
    }))
 }
 /// Add website to session (like USE KB)
 fn add_website_to_session(
    conn: &mut PgConnection,
    session_id: &Uuid,
    bot_id: &Uuid,
    url: &str,
    collection_name: &str,
 ) -> Result<(), String> {
    // Add to session_website_associations table (similar to session_kb_associations)
    let assoc_id = Uuid::new_v4();
    diesel::sql_query(
        "INSERT INTO session_website_associations
         (id, session_id, bot_id, website_url, collection_name, is_active, added_at)
         VALUES ($1, $2, $3, $4, $5, true, NOW())
         ON CONFLICT (session_id, website_url)
         DO UPDATE SET is_active = true, added_at = NOW()",
    )
    .bind::<diesel::sql_types::Uuid, _>(assoc_id)
    .bind::<diesel::sql_types::Uuid, _>(session_id)
    .bind::<diesel::sql_types::Uuid, _>(bot_id)
    .bind::<diesel::sql_types::Text, _>(url)
    .bind::<diesel::sql_types::Text, _>(collection_name)
    .execute(conn)
    .map_err(|e| format!("Failed to add website to session: {}", e))?;
    info!(
        "✅ Added website '{}' to session {} (collection: {})",
        url, session_id, collection_name
    );
    Ok(())
 }
 /// Clear websites from session (companion to USE_WEBSITE)
 pub fn clear_websites_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine) {
    let state_clone = Arc::clone(&state);
    let user_clone = user.clone();
    engine
        .register_custom_syntax(&["CLEAR_WEBSITES"], true, move |_context, _inputs| {
            info!(
                "CLEAR_WEBSITES keyword executed for session: {}",
                user_clone.id
            );
            let session_id = user_clone.id;
            let conn = state_clone.conn.clone();
            let result = std::thread::spawn(move || clear_all_websites(conn, session_id)).join();
            match result {
                Ok(Ok(count)) => {
                    info!(
                        "Successfully cleared {} websites from session {}",
                        count, user_clone.id
                    );
                    Ok(Dynamic::from(format!(
                        "{} website(s) removed from conversation",
                        count
                    )))
                }
                Ok(Err(e)) => {
                    error!("Failed to clear websites: {}", e);
                    Err(format!("CLEAR_WEBSITES failed: {}", e).into())
                }
                Err(e) => {
                    error!("Thread panic in CLEAR_WEBSITES: {:?}", e);
                    Err("CLEAR_WEBSITES failed: thread panic".into())
                }
            }
        })
        .unwrap();
 }
 /// Clear all websites from session
 fn clear_all_websites(
    conn_pool: crate::shared::utils::DbPool,
    session_id: Uuid,
 ) -> Result<usize, String> {
    let mut conn = conn_pool
        .get()
        .map_err(|e| format!("Failed to get DB connection: {}", e))?;
    let rows_affected = diesel::sql_query(
        "UPDATE session_website_associations
         SET is_active = false
         WHERE session_id = $1 AND is_active = true",
    )
    .bind::<diesel::sql_types::Uuid, _>(session_id)
    .execute(&mut conn)
    .map_err(|e| format!("Failed to clear websites: {}", e))?;
    Ok(rows_affected)
 }
 /// Get active websites for a session
 pub fn get_active_websites_for_session(
    conn_pool: &crate::shared::utils::DbPool,
    session_id: Uuid,
 ) -> Result<Vec<(String, String)>, String> {
    let mut conn = conn_pool
        .get()
        .map_err(|e| format!("Failed to get DB connection: {}", e))?;
    #[derive(QueryableByName, Debug)]
    struct ActiveWebsiteResult {
        #[diesel(sql_type = diesel::sql_types::Text)]
        website_url: String,
        #[diesel(sql_type = diesel::sql_types::Text)]
        collection_name: String,
    }
    let results: Vec<ActiveWebsiteResult> = diesel::sql_query(
        "SELECT website_url, collection_name
         FROM session_website_associations
         WHERE session_id = $1 AND is_active = true
         ORDER BY added_at DESC",
    )
    .bind::<diesel::sql_types::Uuid, _>(session_id)
    .load(&mut conn)
    .map_err(|e| format!("Failed to get active websites: {}", e))?;
    Ok(results
        .into_iter()
        .map(|r| (r.website_url, r.collection_name))
        .collect())
 }
 /// Sanitize URL for use as collection name
 fn sanitize_url_for_collection(url: &str) -> String {
    url.replace("http://", "")
        .replace("https://", "")
        .replace('/', "_")
        .replace(':', "_")
        .replace('.', "_")
        .chars()
        .filter(|c| c.is_alphanumeric() || *c == '_' || *c == '-')
        .collect::<String>()
        .to_lowercase()
 }
 #[cfg(test)]
 mod tests {
    use super::*;
    #[test]
    fn test_url_sanitization() {
        assert_eq!(
            sanitize_url_for_collection("https://docs.example.com/path"),
            "docs_example_com_path"
        );
        assert_eq!(
            sanitize_url_for_collection("http://test.site:8080"),
            "test_site_8080"
        );
    }
    #[test]
    fn test_use_website_syntax() {
        let mut engine = Engine::new();
        // Test USE_WEBSITE with argument
        assert!(engine
            .register_custom_syntax(&["USE_WEBSITE", "$expr$"], true, |_, _| Ok(Dynamic::UNIT))
            .is_ok());
        // Test CLEAR_WEBSITES without argument
        assert!(engine
            .register_custom_syntax(&["CLEAR_WEBSITES"], true, |_, _| Ok(Dynamic::UNIT))
            .is_ok());
    }
 }
--- a/src/basic/keywords/weather.rs
+++ b/src/basic/keywords/weather.rs
@ -1,182 +1,451 @@
 use crate::shared::models::UserSession;
 use crate::shared::state::AppState;
-use log::{error, trace};
+use log::{error, info, trace};
 use reqwest::Client;
 use rhai::{Dynamic, Engine};
 use serde::{Deserialize, Serialize};
 use std::sync::Arc;
 use std::time::Duration;
-#[derive(Debug, Clone, Deserialize, Serialize)]
+#[derive(Debug, Serialize, Deserialize)]
 pub struct WeatherData {
    pub location: String,
-    pub temperature: String,
+    pub temperature: f32,
-    pub condition: String,
+    pub temperature_unit: String,
-    pub forecast: String,
+    pub description: String,
    pub humidity: u32,
    pub wind_speed: f32,
    pub wind_direction: String,
    pub feels_like: f32,
    pub pressure: u32,
    pub visibility: f32,
    pub uv_index: Option<f32>,
    pub forecast: Vec<ForecastDay>,
 }
-/// Fetches weather data from 7Timer! API (free, no auth)
+#[derive(Debug, Serialize, Deserialize)]
-pub async fn fetch_weather(location: &str) -> Result<WeatherData, Box<dyn std::error::Error>> {
+pub struct ForecastDay {
-    // Parse location to get coordinates (simplified - in production use geocoding)
+    pub date: String,
-    let (lat, lon) = parse_location(location)?;
+    pub temp_high: f32,
    pub temp_low: f32,
    pub description: String,
    pub rain_chance: u32,
 }
-    // 7Timer! API endpoint
+/// Register WEATHER keyword in BASIC
-    let url = format!(
+pub fn weather_keyword(state: Arc<AppState>, user: UserSession, engine: &mut Engine) {
-        "http://www.7timer.info/bin/api.pl?lon={}&lat={}&product=civil&output=json",
+    let state_clone = Arc::clone(&state);
-        lon, lat
+    let user_clone = user.clone();
    );
-    trace!("Fetching weather from: {}", url);
+    engine
        .register_custom_syntax(&["WEATHER", "$expr$"], false, move |context, inputs| {
            let location = context.eval_expression_tree(&inputs[0])?.to_string();
-    let client = Client::builder().timeout(Duration::from_secs(10)).build()?;
+            trace!(
                "WEATHER command executed: {} for user: {}",
                location,
                user_clone.user_id
            );
-    let response = client.get(&url).send().await?;
+            let state_for_task = Arc::clone(&state_clone);
            let user_for_task = user_clone.clone();
            let location_for_task = location.clone();
            let (tx, rx) = std::sync::mpsc::channel();
-    if !response.status().is_success() {
+            std::thread::spawn(move || {
-        return Err(format!("Weather API returned status: {}", response.status()).into());
+                let rt = tokio::runtime::Builder::new_multi_thread()
-    }
+                    .worker_threads(2)
                    .enable_all()
                    .build();
-    let json: serde_json::Value = response.json().await?;
+                let send_err = if let Ok(rt) = rt {
                    let result = rt.block_on(async move {
                        get_weather(&state_for_task, &user_for_task, &location_for_task).await
                    });
                    tx.send(result).err()
                } else {
                    tx.send(Err("Failed to build tokio runtime".to_string()))
                        .err()
                };
-    // Parse 7Timer response
+                if send_err.is_some() {
-    let dataseries = json["dataseries"]
+                    error!("Failed to send WEATHER result from thread");
-        .as_array()
+                }
-        .ok_or("Invalid weather response")?;
+            });
-    if dataseries.is_empty() {
+            match rx.recv_timeout(std::time::Duration::from_secs(10)) {
-        return Err("No weather data available".into());
+                Ok(Ok(weather_info)) => Ok(Dynamic::from(weather_info)),
-    }
+                Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    format!("WEATHER failed: {}", e).into(),
                    rhai::Position::NONE,
                ))),
                Err(_) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                    "WEATHER request timed out".into(),
                    rhai::Position::NONE,
                ))),
            }
        })
        .unwrap();
-    let current = &dataseries[0];
+    // Register FORECAST keyword for extended forecast
-    let temp = current["temp2m"].as_i64().unwrap_or(0);
+    let state_clone2 = Arc::clone(&state);
-    let weather_code = current["weather"].as_str().unwrap_or("unknown");
+    let user_clone2 = user.clone();
-    let condition = match weather_code {
+    engine
-        "clear" => "Clear sky",
+        .register_custom_syntax(
-        "pcloudy" => "Partly cloudy",
+            &["FORECAST", "$expr$", ",", "$expr$"],
-        "cloudy" => "Cloudy",
+            false,
-        "rain" => "Rain",
+            move |context, inputs| {
-        "lightrain" => "Light rain",
+                let location = context.eval_expression_tree(&inputs[0])?.to_string();
-        "humid" => "Humid",
+                let days = context
-        "snow" => "Snow",
+                    .eval_expression_tree(&inputs[1])?
-        "lightsnow" => "Light snow",
+                    .as_int()
-        _ => "Unknown",
+                    .unwrap_or(5) as u32;
    };
-    // Build forecast string
+                trace!(
-    let mut forecast_parts = Vec::new();
+                    "FORECAST command executed: {} for {} days, user: {}",
-    for (i, item) in dataseries.iter().take(3).enumerate() {
+                    location,
-        if let (Some(temp), Some(weather)) = (item["temp2m"].as_i64(), item["weather"].as_str()) {
+                    days,
-            forecast_parts.push(format!("{}h: {}°C, {}", i * 3, temp, weather));
+                    user_clone2.user_id
                );
                let state_for_task = Arc::clone(&state_clone2);
                let user_for_task = user_clone2.clone();
                let location_for_task = location.clone();
                let (tx, rx) = std::sync::mpsc::channel();
                std::thread::spawn(move || {
                    let rt = tokio::runtime::Builder::new_multi_thread()
                        .worker_threads(2)
                        .enable_all()
                        .build();
                    let send_err = if let Ok(rt) = rt {
                        let result = rt.block_on(async move {
                            get_forecast(&state_for_task, &user_for_task, &location_for_task, days)
                                .await
                        });
                        tx.send(result).err()
                    } else {
                        tx.send(Err("Failed to build tokio runtime".to_string()))
                            .err()
                    };
                    if send_err.is_some() {
                        error!("Failed to send FORECAST result from thread");
                    }
                });
                match rx.recv_timeout(std::time::Duration::from_secs(10)) {
                    Ok(Ok(forecast_info)) => Ok(Dynamic::from(forecast_info)),
                    Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        format!("FORECAST failed: {}", e).into(),
                        rhai::Position::NONE,
                    ))),
                    Err(_) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
                        "FORECAST request timed out".into(),
                        rhai::Position::NONE,
                    ))),
                }
            },
        )
        .unwrap();
 }
 async fn get_weather(
    state: &AppState,
    _user: &UserSession,
    location: &str,
 ) -> Result<String, String> {
    // Get API key from bot config or environment
    let api_key = get_weather_api_key(state)?;
    // Try OpenWeatherMap API first
    match fetch_openweathermap_current(&api_key, location).await {
        Ok(weather) => {
            info!("Weather data fetched for {}", location);
            Ok(format_weather_response(&weather))
        }
        Err(e) => {
            error!("OpenWeatherMap API failed: {}", e);
            // Try fallback weather service
            fetch_fallback_weather(location).await
        }
    }
-    let forecast = forecast_parts.join("; ");
+}
 async fn get_forecast(
    state: &AppState,
    _user: &UserSession,
    location: &str,
    days: u32,
 ) -> Result<String, String> {
    let api_key = get_weather_api_key(state)?;
    match fetch_openweathermap_forecast(&api_key, location, days).await {
        Ok(forecast) => {
            info!("Forecast data fetched for {} ({} days)", location, days);
            Ok(format_forecast_response(&forecast))
        }
        Err(e) => {
            error!("Forecast API failed: {}", e);
            Err(format!("Could not get forecast for {}: {}", location, e))
        }
    }
 }
 async fn fetch_openweathermap_current(
    api_key: &str,
    location: &str,
 ) -> Result<WeatherData, String> {
    let client = reqwest::Client::new();
    let url = format!(
        "https://api.openweathermap.org/data/2.5/weather?q={}&appid={}&units=metric",
        urlencoding::encode(location),
        api_key
    );
    let response = client
        .get(&url)
        .send()
        .await
        .map_err(|e| format!("Request failed: {}", e))?;
    if !response.status().is_success() {
        return Err(format!("API returned status: {}", response.status()));
    }
    let data: serde_json::Value = response
        .json()
        .await
        .map_err(|e| format!("Failed to parse response: {}", e))?;
    // Parse OpenWeatherMap response
    Ok(WeatherData {
-        location: location.to_string(),
+        location: data["name"].as_str().unwrap_or(location).to_string(),
-        temperature: format!("{}°C", temp),
+        temperature: data["main"]["temp"].as_f64().unwrap_or(0.0) as f32,
-        condition: condition.to_string(),
+        temperature_unit: "°C".to_string(),
-        forecast,
+        description: data["weather"][0]["description"]
            .as_str()
            .unwrap_or("Unknown")
            .to_string(),
        humidity: data["main"]["humidity"].as_u64().unwrap_or(0) as u32,
        wind_speed: data["wind"]["speed"].as_f64().unwrap_or(0.0) as f32,
        wind_direction: degrees_to_compass(data["wind"]["deg"].as_f64().unwrap_or(0.0)),
        feels_like: data["main"]["feels_like"].as_f64().unwrap_or(0.0) as f32,
        pressure: data["main"]["pressure"].as_u64().unwrap_or(0) as u32,
        visibility: data["visibility"].as_f64().unwrap_or(0.0) as f32 / 1000.0, // Convert to km
        uv_index: None, // Would need separate API call for UV index
        forecast: Vec::new(),
    })
 }
-/// Simple location parser (lat,lon or city name)
+async fn fetch_openweathermap_forecast(
-pub fn parse_location(location: &str) -> Result<(f64, f64), Box<dyn std::error::Error>> {
+    api_key: &str,
-    // Check if it's coordinates (lat,lon)
+    location: &str,
-    if let Some((lat_str, lon_str)) = location.split_once(',') {
+    days: u32,
-        let lat = lat_str.trim().parse::<f64>()?;
+) -> Result<WeatherData, String> {
-        let lon = lon_str.trim().parse::<f64>()?;
+    let client = reqwest::Client::new();
-        return Ok((lat, lon));
+    let url = format!(
        "https://api.openweathermap.org/data/2.5/forecast?q={}&appid={}&units=metric&cnt={}",
        urlencoding::encode(location),
        api_key,
        days * 8 // 8 forecasts per day (every 3 hours)
    );
    let response = client
        .get(&url)
        .await
        .map_err(|e| format!("Request failed: {}", e))?;
    if !response.status().is_success() {
        return Err(format!("API returned status: {}", response.status()));
    }
-    // Default city coordinates (extend as needed)
+    let data: serde_json::Value = response
-    let coords = match location.to_lowercase().as_str() {
+        .json()
-        "london" => (51.5074, -0.1278),
+        .await
-        "paris" => (48.8566, 2.3522),
+        .map_err(|e| format!("Failed to parse response: {}", e))?;
-        "new york" | "newyork" => (40.7128, -74.0060),
+
-        "tokyo" => (35.6762, 139.6503),
+    // Process forecast data
-        "sydney" => (-33.8688, 151.2093),
+    let mut forecast_days = Vec::new();
-        "são paulo" | "sao paulo" => (-23.5505, -46.6333),
+    let mut daily_data: std::collections::HashMap<String, (f32, f32, String, u32)> =
-        "rio de janeiro" | "rio" => (-22.9068, -43.1729),
+        std::collections::HashMap::new();
-        "brasília" | "brasilia" => (-15.8267, -47.9218),
+
-        "buenos aires" => (-34.6037, -58.3816),
+    if let Some(list) = data["list"].as_array() {
-        "berlin" => (52.5200, 13.4050),
+        for item in list {
-        "madrid" => (40.4168, -3.7038),
+            let dt_txt = item["dt_txt"].as_str().unwrap_or("");
-        "rome" => (41.9028, 12.4964),
+            let date = dt_txt.split(' ').next().unwrap_or("");
-        "moscow" => (55.7558, 37.6173),
+            let temp = item["main"]["temp"].as_f64().unwrap_or(0.0) as f32;
-        "beijing" => (39.9042, 116.4074),
+            let description = item["weather"][0]["description"]
-        "mumbai" => (19.0760, 72.8777),
+                .as_str()
-        "dubai" => (25.2048, 55.2708),
+                .unwrap_or("Unknown")
-        "los angeles" | "la" => (34.0522, -118.2437),
+                .to_string();
-        "chicago" => (41.8781, -87.6298),
+            let rain_chance = (item["pop"].as_f64().unwrap_or(0.0) * 100.0) as u32;
-        "toronto" => (43.6532, -79.3832),
+
-        "mexico city" => (19.4326, -99.1332),
+            let entry = daily_data.entry(date.to_string()).or_insert((
-        _ => {
+                temp,
-            return Err(format!(
+                temp,
-                "Unknown location: {}. Use 'lat,lon' format or known city",
+                description.clone(),
-                location
+                rain_chance,
-            )
+            ));
-            .into())
+
            // Update min/max temperatures
            if temp < entry.0 {
                entry.0 = temp;
            }
            if temp > entry.1 {
                entry.1 = temp;
            }
            // Update rain chance to max for the day
            if rain_chance > entry.3 {
                entry.3 = rain_chance;
            }
        }
-    };
+    }
-    Ok(coords)
+    // Convert to forecast days
-}
+    for (date, (temp_low, temp_high, description, rain_chance)) in daily_data.iter() {
-
+        forecast_days.push(ForecastDay {
-/// Register WEATHER keyword in Rhai engine
+            date: date.clone(),
-pub fn weather_keyword(_state: Arc<AppState>, _user_session: UserSession, engine: &mut Engine) {
+            temp_high: *temp_high,
-    let _ = engine.register_custom_syntax(&["WEATHER", "$expr$"], false, move |context, inputs| {
+            temp_low: *temp_low,
-        let location = context.eval_expression_tree(&inputs[0])?;
+            description: description.clone(),
-        let location_str = location.to_string();
+            rain_chance: *rain_chance,
        trace!("WEATHER keyword called for: {}", location_str);
        // Create channel for async result
        let (tx, rx) = std::sync::mpsc::channel();
        // Spawn blocking task
        std::thread::spawn(move || {
            let rt = tokio::runtime::Builder::new_current_thread()
                .enable_all()
                .build();
            let result = if let Ok(rt) = rt {
                rt.block_on(async {
                    match fetch_weather(&location_str).await {
                        Ok(weather) => {
                            let msg = format!(
                                "Weather for {}: {} ({}). Forecast: {}",
                                weather.location,
                                weather.temperature,
                                weather.condition,
                                weather.forecast
                            );
                            Ok(msg)
                        }
                        Err(e) => {
                            error!("Weather fetch failed: {}", e);
                            Err(format!("Could not fetch weather: {}", e))
                        }
                    }
                })
            } else {
                Err("Failed to create runtime".to_string())
            };
            let _ = tx.send(result);
        });
    }
-        // Wait for result
+    // Sort by date
-        match rx.recv() {
+    forecast_days.sort_by(|a, b| a.date.cmp(&b.date));
-            Ok(Ok(weather_msg)) => Ok(Dynamic::from(weather_msg)),
+
-            Ok(Err(e)) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
+    Ok(WeatherData {
-                e.into(),
+        location: data["city"]["name"]
-                rhai::Position::NONE,
+            .as_str()
-            ))),
+            .unwrap_or(location)
-            Err(_) => Err(Box::new(rhai::EvalAltResult::ErrorRuntime(
+            .to_string(),
-                "Weather request timeout".into(),
+        temperature: 0.0, // Not relevant for forecast
-                rhai::Position::NONE,
+        temperature_unit: "°C".to_string(),
-            ))),
+        description: "Forecast".to_string(),
-        }
+        humidity: 0,
-    });
+        wind_speed: 0.0,
        wind_direction: String::new(),
        feels_like: 0.0,
        pressure: 0,
        visibility: 0.0,
        uv_index: None,
        forecast: forecast_days,
    })
 }
 async fn fetch_fallback_weather(location: &str) -> Result<String, String> {
    // This could use another weather API like WeatherAPI.com or NOAA
    // For now, return a simulated response
    info!("Using fallback weather for {}", location);
    Ok(format!(
        "Weather information for {} is temporarily unavailable. Please try again later.",
        location
    ))
 }
 fn format_weather_response(weather: &WeatherData) -> String {
    format!(
        "Current weather in {}:\n\
        🌡️ Temperature: {:.1}{} (feels like {:.1}{})\n\
        ☁️ Conditions: {}\n\
        💧 Humidity: {}%\n\
        💨 Wind: {:.1} m/s {}\n\
        🔍 Visibility: {:.1} km\n\
        📊 Pressure: {} hPa",
        weather.location,
        weather.temperature,
        weather.temperature_unit,
        weather.feels_like,
        weather.temperature_unit,
        weather.description,
        weather.humidity,
        weather.wind_speed,
        weather.wind_direction,
        weather.visibility,
        weather.pressure
    )
 }
 fn format_forecast_response(weather: &WeatherData) -> String {
    let mut response = format!("Weather forecast for {}:\n\n", weather.location);
    for day in &weather.forecast {
        response.push_str(&format!(
            "📅 {}\n\
            🌡️ High: {:.1}°C, Low: {:.1}°C\n\
            ☁️ {}\n\
            ☔ Rain chance: {}%\n\n",
            day.date, day.temp_high, day.temp_low, day.description, day.rain_chance
        ));
    }
    response
 }
 fn degrees_to_compass(degrees: f64) -> String {
    let directions = [
        "N", "NNE", "NE", "ENE", "E", "ESE", "SE", "SSE", "S", "SSW", "SW", "WSW", "W", "WNW",
        "NW", "NNW",
    ];
    let index = ((degrees + 11.25) / 22.5) as usize % 16;
    directions[index].to_string()
 }
 fn get_weather_api_key(state: &AppState) -> Result<String, String> {
    // Try to get from bot config first
    if let Some(config) = &state.config {
        if let Some(api_key) = config.bot_config.get_setting("weather-api-key") {
            if !api_key.is_empty() {
                return Ok(api_key);
            }
        }
    }
    // Fallback to environment variable
    std::env::var("OPENWEATHERMAP_API_KEY")
        .or_else(|_| std::env::var("WEATHER_API_KEY"))
        .map_err(|_| {
            "Weather API key not found. Please set 'weather-api-key' in config.csv or WEATHER_API_KEY environment variable".to_string()
        })
 }
 #[cfg(test)]
 mod tests {
    use super::*;
    #[test]
    fn test_degrees_to_compass() {
        assert_eq!(degrees_to_compass(0.0), "N");
        assert_eq!(degrees_to_compass(45.0), "NE");
        assert_eq!(degrees_to_compass(90.0), "E");
        assert_eq!(degrees_to_compass(180.0), "S");
        assert_eq!(degrees_to_compass(270.0), "W");
        assert_eq!(degrees_to_compass(315.0), "NW");
    }
    #[test]
    fn test_format_weather_response() {
        let weather = WeatherData {
            location: "London".to_string(),
            temperature: 15.0,
            temperature_unit: "°C".to_string(),
            description: "Partly cloudy".to_string(),
            humidity: 65,
            wind_speed: 3.5,
            wind_direction: "NE".to_string(),
            feels_like: 14.0,
            pressure: 1013,
            visibility: 10.0,
            uv_index: Some(3.0),
            forecast: Vec::new(),
        };
        let response = format_weather_response(&weather);
        assert!(response.contains("London"));
        assert!(response.contains("15.0"));
        assert!(response.contains("Partly cloudy"));
    }
 }
--- a/src/basic/mod.rs
+++ b/src/basic/mod.rs
@ -9,7 +9,6 @@ pub mod compiler;
 pub mod keywords;
 use self::keywords::add_member::add_member_keyword;
 use self::keywords::add_suggestion::add_suggestion_keyword;
 use self::keywords::add_website::add_website_keyword;
 use self::keywords::book::book_keyword;
 use self::keywords::bot_memory::{get_bot_memory_keyword, set_bot_memory_keyword};
 use self::keywords::clear_kb::register_clear_kb_keyword;
@ -29,6 +28,7 @@ use self::keywords::save_from_unstructured::save_from_unstructured_keyword;
 use self::keywords::send_mail::send_mail_keyword;
 use self::keywords::use_kb::register_use_kb_keyword;
 use self::keywords::use_tool::use_tool_keyword;
 use self::keywords::use_website::{clear_websites_keyword, use_website_keyword};
 use self::keywords::llm_keyword::llm_keyword;
 use self::keywords::on::on_keyword;
@ -72,7 +72,8 @@ impl ScriptService {
        use_tool_keyword(state.clone(), user.clone(), &mut engine);
        clear_tools_keyword(state.clone(), user.clone(), &mut engine);
-        add_website_keyword(state.clone(), user.clone(), &mut engine);
+        use_website_keyword(state.clone(), user.clone(), &mut engine);
        clear_websites_keyword(state.clone(), user.clone(), &mut engine);
        add_suggestion_keyword(state.clone(), user.clone(), &mut engine);
        // Register the 6 new power keywords
--- a/src/calendar/mod.rs
+++ b/src/calendar/mod.rs
@ -13,10 +13,11 @@ use std::sync::Arc;
 use crate::shared::utils::DbPool;
 use tokio::sync::RwLock;
 use uuid::Uuid;
 use crate::shared::state::AppState;
 use diesel::sql_query;
 use diesel::sql_types::{Text, Timestamptz, Integer, Jsonb};
-// TODO: Replace sqlx queries with Diesel queries
+#[derive(Debug, Clone, Serialize, Deserialize, QueryableByName)]
 #[derive(Debug, Clone, Serialize, Deserialize)]
 pub struct CalendarEvent {
    pub id: Uuid,
    pub title: String,
@ -110,16 +111,18 @@ impl CalendarEngine {
        &self,
        event: CalendarEvent,
    ) -> Result<CalendarEvent, Box<dyn std::error::Error>> {
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        let result = sqlx::query!(
+        let attendees_json = serde_json::to_value(&event.attendees)?;
-            r#"
+        let recurrence_json = event.recurrence_rule.as_ref().map(|r| serde_json::to_value(r).ok()).flatten();
-            INSERT INTO calendar_events
+
-            (id, title, description, start_time, end_time, location, attendees, organizer,
+        diesel::sql_query(
-             reminder_minutes, recurrence_rule, status, created_at, updated_at)
+            "INSERT INTO calendar_events
-            VALUES ($1, $2, $3, $4, $5, $6, $7, $8, $9, $10, $11, $12, $13)
+             (id, title, description, start_time, end_time, location, attendees, organizer,
-            RETURNING *
+              reminder_minutes, recurrence_rule, status, created_at, updated_at)
-            "#,
+             VALUES ($1, $2, $3, $4, $5, $6, $7, $8, $9, $10, $11, $12, $13)
             RETURNING *"
        )
            event.id,
            event.title,
            event.description,
@ -185,17 +188,16 @@ impl CalendarEngine {
        Ok(serde_json::from_value(serde_json::to_value(result)?)?)
    }
-    pub async fn delete_event(&self, _id: Uuid) -> Result<bool, Box<dyn std::error::Error>> {
+    pub async fn delete_event(&self, id: Uuid) -> Result<bool, Box<dyn std::error::Error>> {
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        let result = sqlx::query!("DELETE FROM calendar_events WHERE id = $1", id)
+        let rows_affected = diesel::sql_query("DELETE FROM calendar_events WHERE id = $1")
-            .execute(self.db.as_ref())
+            .bind::<diesel::sql_types::Uuid, _>(&id)
-            .await?;
+            .execute(&mut conn)?;
        */
        self.refresh_cache().await?;
-        Ok(false)
+        Ok(rows_affected > 0)
    }
    pub async fn get_events_range(
@ -203,16 +205,14 @@ impl CalendarEngine {
        start: DateTime<Utc>,
        end: DateTime<Utc>,
    ) -> Result<Vec<CalendarEvent>, Box<dyn std::error::Error>> {
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        let results = sqlx::query_as!(
+        let results = diesel::sql_query(
-            CalendarEvent,
+            "SELECT * FROM calendar_events
-            r#"
+             WHERE start_time >= $1 AND end_time <= $2
-            SELECT * FROM calendar_events
+             ORDER BY start_time ASC"
-            WHERE start_time >= $1 AND end_time <= $2
+        )
-            ORDER BY start_time ASC
+        .bind::<Timestamptz, _>(&start)
            "#,
            start,
            end
        )
        .fetch_all(self.db.as_ref())
@ -226,16 +226,14 @@ impl CalendarEngine {
        &self,
        user_id: &str,
    ) -> Result<Vec<CalendarEvent>, Box<dyn std::error::Error>> {
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        let results = sqlx::query!(
+        let results = diesel::sql_query(
-            r#"
+            "SELECT * FROM calendar_events
-            SELECT * FROM calendar_events
+             WHERE organizer = $1 OR $1::text = ANY(SELECT jsonb_array_elements_text(attendees))
-            WHERE organizer = $1 OR $1 = ANY(attendees)
+             ORDER BY start_time ASC"
            ORDER BY start_time ASC
            "#,
            user_id
        )
        .bind::<Text, _>(&user_id)
        .fetch_all(self.db.as_ref())
        .await?;
@ -263,9 +261,9 @@ impl CalendarEngine {
            action_items: Vec::new(),
        };
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        sqlx::query!(
+        diesel::sql_query(
            r#"
            INSERT INTO meetings (id, event_id, platform, created_at)
            VALUES ($1, $2, $3, $4)
@ -303,9 +301,9 @@ impl CalendarEngine {
            sent: false,
        };
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        sqlx::query!(
+        diesel::sql_query(
            r#"
            INSERT INTO calendar_reminders (id, event_id, remind_at, message, channel, sent)
            VALUES ($1, $2, $3, $4, $5, $6)
@ -324,16 +322,14 @@ impl CalendarEngine {
        Ok(reminder)
    }
-    pub async fn get_event(&self, _id: Uuid) -> Result<CalendarEvent, Box<dyn std::error::Error>> {
+    pub async fn get_event(&self, id: Uuid) -> Result<CalendarEvent, Box<dyn std::error::Error>> {
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
        /*
        let result = sqlx::query!("SELECT * FROM calendar_events WHERE id = $1", id)
            .fetch_one(self.db.as_ref())
            .await?;
-        Ok(serde_json::from_value(serde_json::to_value(result)?)?)
+        let result = diesel::sql_query("SELECT * FROM calendar_events WHERE id = $1")
-        */
+            .bind::<diesel::sql_types::Uuid, _>(&id)
-        Err("Not implemented".into())
+            .get_result::<CalendarEvent>(&mut conn)?;
        Ok(result)
    }
    pub async fn check_conflicts(
@ -342,16 +338,15 @@ impl CalendarEngine {
        end: DateTime<Utc>,
        user_id: &str,
    ) -> Result<Vec<CalendarEvent>, Box<dyn std::error::Error>> {
-        // TODO: Implement with Diesel
+        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
-        /*
+
-        let results = sqlx::query!(
+        let results = diesel::sql_query(
-            r#"
+            "SELECT * FROM calendar_events
-            SELECT * FROM calendar_events
+             WHERE (organizer = $1 OR $1::text = ANY(SELECT jsonb_array_elements_text(attendees)))
-            WHERE (organizer = $1 OR $1 = ANY(attendees))
+             AND NOT (end_time <= $2 OR start_time >= $3)"
-            AND NOT (end_time <= $2 OR start_time >= $3)
+        )
-            "#,
+        .bind::<Text, _>(&user_id)
-            user_id,
+        .bind::<Timestamptz, _>(&start)
            start,
            end
        )
        .fetch_all(self.db.as_ref())
@ -369,15 +364,7 @@ impl CalendarEngine {
        // TODO: Implement with Diesel
        /*
        let results = sqlx::query!("SELECT * FROM calendar_events ORDER BY start_time ASC")
-            .fetch_all(self.db.as_ref())
+            .load::<CalendarEvent>(&mut conn)?;
            .await?;
        let events: Vec<CalendarEvent> = results
            .into_iter()
            .map(|r| serde_json::from_value(serde_json::to_value(r).unwrap()).unwrap())
            .collect();
        */
        let events: Vec<CalendarEvent> = vec![];
        let mut cache = self.cache.write().await;
        *cache = events;
@ -397,8 +384,254 @@ pub struct EventQuery {
 pub struct MeetingRequest {
    pub event_id: Uuid,
    pub platform: MeetingPlatform,
    /// Process due reminders
    pub async fn process_reminders(&self) -> Result<Vec<String>, Box<dyn std::error::Error>> {
        let now = Utc::now();
        let mut conn = self.db.get().map_err(|e| format!("DB connection error: {}", e))?;
        // Find events that need reminders sent
        let events = diesel::sql_query(
            "SELECT * FROM calendar_events
             WHERE reminder_minutes IS NOT NULL
             AND start_time - INTERVAL '1 minute' * reminder_minutes <= $1
             AND start_time > $1
             AND reminder_sent = false
             ORDER BY start_time ASC"
        )
        .bind::<Timestamptz, _>(&now)
        .load::<CalendarEvent>(&mut conn)?;
        let mut notifications = Vec::new();
        for event in events {
            // Send reminder notification
            let message = format!(
                "Reminder: {} starting at {}",
                event.title,
                event.start_time.format("%H:%M")
            );
            // Mark reminder as sent
            diesel::sql_query(
                "UPDATE calendar_events SET reminder_sent = true WHERE id = $1"
            )
            .bind::<diesel::sql_types::Uuid, _>(&event.id)
            .execute(&mut conn)?;
            notifications.push(message);
        }
        Ok(notifications)
    }
 }
 /// CalDAV Server implementation
 pub mod caldav {
    use super::*;
    use axum::{
        body::Body,
        extract::{Path, State, Query},
        http::{Method, StatusCode, header},
        response::{Response, IntoResponse},
        routing::{get, put, delete, any},
        Router,
    };
    use std::sync::Arc;
    pub fn create_caldav_router(calendar_engine: Arc<CalendarEngine>) -> Router {
        Router::new()
            .route("/.well-known/caldav", get(caldav_redirect))
            .route("/caldav/:user/", any(caldav_propfind))
            .route("/caldav/:user/calendar/", any(caldav_calendar_handler))
            .route("/caldav/:user/calendar/:event_uid.ics",
                get(caldav_get_event)
                .put(caldav_put_event)
                .delete(caldav_delete_event))
            .with_state(calendar_engine)
    }
    async fn caldav_redirect() -> impl IntoResponse {
        Response::builder()
            .status(StatusCode::MOVED_PERMANENTLY)
            .header(header::LOCATION, "/caldav/")
            .body(Body::empty())
            .unwrap()
    }
    async fn caldav_propfind(
        Path(user): Path<String>,
        State(engine): State<Arc<CalendarEngine>>,
    ) -> impl IntoResponse {
        let xml = format!(r#"<?xml version="1.0" encoding="utf-8"?>
 <D:multistatus xmlns:D="DAV:" xmlns:C="urn:ietf:params:xml:ns:caldav">
  <D:response>
    <D:href>/caldav/{}/</D:href>
    <D:propstat>
      <D:prop>
        <D:resourcetype>
          <D:collection/>
          <C:calendar/>
        </D:resourcetype>
        <D:displayname>{}'s Calendar</D:displayname>
        <C:supported-calendar-component-set>
          <C:comp name="VEVENT"/>
        </C:supported-calendar-component-set>
      </D:prop>
      <D:status>HTTP/1.1 200 OK</D:status>
    </D:propstat>
  </D:response>
 </D:multistatus>"#, user, user);
        Response::builder()
            .status(StatusCode::MULTI_STATUS)
            .header(header::CONTENT_TYPE, "application/xml; charset=utf-8")
            .body(Body::from(xml))
            .unwrap()
    }
    async fn caldav_calendar_handler(
        Path(user): Path<String>,
        State(engine): State<Arc<CalendarEngine>>,
        method: Method,
    ) -> impl IntoResponse {
        match method {
            Method::GET => {
                // Return calendar collection
                let events = engine.get_user_events(&user).await.unwrap_or_default();
                let ics = events_to_icalendar(&events, &user);
                Response::builder()
                    .status(StatusCode::OK)
                    .header(header::CONTENT_TYPE, "text/calendar; charset=utf-8")
                    .body(Body::from(ics))
                    .unwrap()
            },
            _ => caldav_propfind(Path(user), State(engine)).await.into_response(),
        }
    }
    async fn caldav_get_event(
        Path((user, event_uid)): Path<(String, String)>,
        State(engine): State<Arc<CalendarEngine>>,
    ) -> impl IntoResponse {
        let event_id = event_uid.trim_end_matches(".ics");
        match Uuid::parse_str(event_id) {
            Ok(id) => {
                match engine.get_event(id).await {
                    Ok(event) => {
                        let ics = event_to_icalendar(&event);
                        Response::builder()
                            .status(StatusCode::OK)
                            .header(header::CONTENT_TYPE, "text/calendar; charset=utf-8")
                            .body(Body::from(ics))
                            .unwrap()
                    },
                    Err(_) => Response::builder()
                        .status(StatusCode::NOT_FOUND)
                        .body(Body::empty())
                        .unwrap(),
                }
            },
            Err(_) => Response::builder()
                .status(StatusCode::BAD_REQUEST)
                .body(Body::empty())
                .unwrap(),
        }
    }
    async fn caldav_put_event(
        Path((user, event_uid)): Path<(String, String)>,
        State(engine): State<Arc<CalendarEngine>>,
        body: String,
    ) -> impl IntoResponse {
        // Parse iCalendar data and create/update event
        // This is a simplified implementation
        StatusCode::CREATED
    }
    async fn caldav_delete_event(
        Path((user, event_uid)): Path<(String, String)>,
        State(engine): State<Arc<CalendarEngine>>,
    ) -> impl IntoResponse {
        let event_id = event_uid.trim_end_matches(".ics");
        match Uuid::parse_str(event_id) {
            Ok(id) => {
                match engine.delete_event(id).await {
                    Ok(true) => StatusCode::NO_CONTENT,
                    Ok(false) => StatusCode::NOT_FOUND,
                    Err(_) => StatusCode::INTERNAL_SERVER_ERROR,
                }
            },
            Err(_) => StatusCode::BAD_REQUEST,
        }
    }
    fn events_to_icalendar(events: &[CalendarEvent], user: &str) -> String {
        let mut ics = String::from("BEGIN:VCALENDAR\r\n");
        ics.push_str("VERSION:2.0\r\n");
        ics.push_str(&format!("PRODID:-//BotServer//Calendar {}//EN\r\n", user));
        for event in events {
            ics.push_str(&event_to_icalendar(event));
        }
        ics.push_str("END:VCALENDAR\r\n");
        ics
    }
    fn event_to_icalendar(event: &CalendarEvent) -> String {
        let mut vevent = String::from("BEGIN:VEVENT\r\n");
        vevent.push_str(&format!("UID:{}\r\n", event.id));
        vevent.push_str(&format!("SUMMARY:{}\r\n", event.title));
        if let Some(desc) = &event.description {
            vevent.push_str(&format!("DESCRIPTION:{}\r\n", desc));
        }
        if let Some(loc) = &event.location {
            vevent.push_str(&format!("LOCATION:{}\r\n", loc));
        }
        vevent.push_str(&format!("DTSTART:{}\r\n", event.start_time.format("%Y%m%dT%H%M%SZ")));
        vevent.push_str(&format!("DTEND:{}\r\n", event.end_time.format("%Y%m%dT%H%M%SZ")));
        vevent.push_str(&format!("STATUS:{}\r\n", event.status.to_uppercase()));
        for attendee in &event.attendees {
            vevent.push_str(&format!("ATTENDEE:mailto:{}\r\n", attendee));
        }
        vevent.push_str("END:VEVENT\r\n");
        vevent
    }
 }
 /// Reminder job service
 pub async fn start_reminder_job(engine: Arc<CalendarEngine>) {
    use tokio::time::{interval, Duration};
    let mut ticker = interval(Duration::from_secs(60)); // Check every minute
    loop {
        ticker.tick().await;
        match engine.process_reminders().await {
            Ok(notifications) => {
                for message in notifications {
                    log::info!("Calendar reminder: {}", message);
                    // Here you would send actual notifications via email, push, etc.
                }
            },
            Err(e) => {
                log::error!("Failed to process calendar reminders: {}", e);
            }
        }
    }
 }
 async fn create_event_handler(
    State(engine): State<Arc<CalendarEngine>>,
    Json(event): Json<CalendarEvent>,
--- a/src/core/bootstrap/mod.rs
+++ b/src/core/bootstrap/mod.rs
@ -143,6 +143,14 @@ impl BootstrapManager {
                        error!("Failed to setup Directory: {}", e);
                    }
                }
                // Auto-configure Email after installation
                if component == "email" {
                    info!("🔧 Auto-configuring Email (Stalwart)...");
                    if let Err(e) = self.setup_email().await {
                        error!("Failed to setup Email: {}", e);
                    }
                }
            }
        }
        Ok(())
@ -220,7 +228,7 @@ impl BootstrapManager {
    }
    /// Setup Email (Stalwart) with Directory integration
-    async fn setup_email(&self) -> Result<()> {
+    pub async fn setup_email(&self) -> Result<()> {
        let config_path = PathBuf::from("./config/email_config.json");
        let directory_config_path = PathBuf::from("./config/directory_config.json");
--- a/src/core/bot/channels/instagram.rs
+++ b/src/core/bot/channels/instagram.rs
@ -1,29 +1,390 @@
 use crate::shared::models::BotResponse;
 use async_trait::async_trait;
-use log::info;
+use log::{error, info};
 use serde::{Deserialize, Serialize};
 use std::collections::HashMap;
 use crate::core::bot::channels::ChannelAdapter;
 use crate::shared::models::BotResponse;
 /// Instagram channel adapter for sending messages through Instagram
 pub struct InstagramAdapter {
-    // TODO: Add Instagram API client configuration
+    access_token: String,
    verify_token: String,
    page_id: String,
    api_version: String,
    instagram_account_id: String,
 }
 impl InstagramAdapter {
    pub fn new() -> Self {
-        Self {}
+        // Load from environment variables (would be from config.csv in production)
        let access_token = std::env::var("INSTAGRAM_ACCESS_TOKEN").unwrap_or_default();
        let verify_token = std::env::var("INSTAGRAM_VERIFY_TOKEN")
            .unwrap_or_else(|_| "webhook_verify".to_string());
        let page_id = std::env::var("INSTAGRAM_PAGE_ID").unwrap_or_default();
        let api_version = "v17.0".to_string();
        let instagram_account_id = std::env::var("INSTAGRAM_ACCOUNT_ID").unwrap_or_default();
        Self {
            access_token,
            verify_token,
            page_id,
            api_version,
            instagram_account_id,
        }
    }
    async fn send_instagram_message(
        &self,
        recipient_id: &str,
        message: &str,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}/messages",
            self.api_version, self.page_id
        );
        let payload = serde_json::json!({
            "recipient": {
                "id": recipient_id
            },
            "message": {
                "text": message
            },
            "messaging_type": "RESPONSE"
        });
        let response = client
            .post(&url)
            .header("Content-Type", "application/json")
            .query(&[("access_token", &self.access_token)])
            .json(&payload)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["message_id"].as_str().unwrap_or("").to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("Instagram API error: {}", error_text).into())
        }
    }
    pub async fn send_media_message(
        &self,
        recipient_id: &str,
        media_url: &str,
        media_type: &str,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}/messages",
            self.api_version, self.page_id
        );
        let attachment_type = match media_type {
            "image" => "image",
            "video" => "video",
            "audio" => "audio",
            _ => "file",
        };
        let payload = serde_json::json!({
            "recipient": {
                "id": recipient_id
            },
            "message": {
                "attachment": {
                    "type": attachment_type,
                    "payload": {
                        "url": media_url,
                        "is_reusable": true
                    }
                }
            }
        });
        let response = client
            .post(&url)
            .query(&[("access_token", &self.access_token)])
            .json(&payload)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["message_id"].as_str().unwrap_or("").to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("Instagram API error: {}", error_text).into())
        }
    }
    pub async fn send_story_reply(
        &self,
        recipient_id: &str,
        message: &str,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        // Story replies use the same messaging API
        self.send_instagram_message(recipient_id, message).await
    }
    pub async fn get_user_profile(
        &self,
        user_id: &str,
    ) -> Result<InstagramProfile, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}",
            self.api_version, user_id
        );
        let response = client
            .get(&url)
            .query(&[
                ("access_token", &self.access_token),
                ("fields", &"name,profile_pic".to_string()),
            ])
            .send()
            .await?;
        if response.status().is_success() {
            let profile: InstagramProfile = response.json().await?;
            Ok(profile)
        } else {
            Err("Failed to get Instagram profile".into())
        }
    }
    pub fn verify_webhook(&self, token: &str) -> bool {
        token == self.verify_token
    }
    pub async fn handle_webhook_verification(
        &self,
        mode: &str,
        token: &str,
        challenge: &str,
    ) -> Option<String> {
        if mode == "subscribe" && self.verify_webhook(token) {
            Some(challenge.to_string())
        } else {
            None
        }
    }
 }
 #[async_trait]
-impl super::ChannelAdapter for InstagramAdapter {
+impl ChannelAdapter for InstagramAdapter {
    fn name(&self) -> &str {
        "Instagram"
    }
    fn is_configured(&self) -> bool {
        !self.access_token.is_empty() && !self.page_id.is_empty()
    }
    async fn send_message(
        &self,
        response: BotResponse,
    ) -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
        if !self.is_configured() {
            error!("Instagram adapter not configured. Please set instagram-access-token and instagram-page-id in config.csv");
            return Err("Instagram not configured".into());
        }
        let message_id = self
            .send_instagram_message(&response.user_id, &response.content)
            .await?;
        info!(
-            "Instagram message would be sent to {}: {}",
+            "Instagram message sent to {}: {} (message_id: {})",
-            response.user_id, response.content
+            response.user_id, response.content, message_id
        );
-        // TODO: Implement actual Instagram API integration
+
        Ok(())
    }
    async fn receive_message(
        &self,
        payload: serde_json::Value,
    ) -> Result<Option<String>, Box<dyn std::error::Error + Send + Sync>> {
        // Parse Instagram webhook payload
        if let Some(entry) = payload["entry"].as_array() {
            if let Some(first_entry) = entry.first() {
                if let Some(messaging) = first_entry["messaging"].as_array() {
                    if let Some(first_message) = messaging.first() {
                        // Check for different message types
                        if let Some(message) = first_message["message"].as_object() {
                            if let Some(text) = message["text"].as_str() {
                                return Ok(Some(text.to_string()));
                            } else if let Some(attachments) = message["attachments"].as_array() {
                                if let Some(first_attachment) = attachments.first() {
                                    let attachment_type =
                                        first_attachment["type"].as_str().unwrap_or("unknown");
                                    return Ok(Some(format!(
                                        "Received {} attachment",
                                        attachment_type
                                    )));
                                }
                            }
                        } else if let Some(postback) = first_message["postback"].as_object() {
                            if let Some(payload) = postback["payload"].as_str() {
                                return Ok(Some(format!("Postback: {}", payload)));
                            }
                        }
                    }
                } else if let Some(changes) = first_entry["changes"].as_array() {
                    // Handle Instagram mentions and comments
                    if let Some(first_change) = changes.first() {
                        let field = first_change["field"].as_str().unwrap_or("");
                        match field {
                            "comments" => {
                                if let Some(text) = first_change["value"]["text"].as_str() {
                                    return Ok(Some(format!("Comment: {}", text)));
                                }
                            }
                            "mentions" => {
                                if let Some(media_id) = first_change["value"]["media_id"].as_str() {
                                    return Ok(Some(format!("Mentioned in media: {}", media_id)));
                                }
                            }
                            _ => {}
                        }
                    }
                }
            }
        }
        Ok(None)
    }
    async fn get_user_info(
        &self,
        user_id: &str,
    ) -> Result<serde_json::Value, Box<dyn std::error::Error + Send + Sync>> {
        match self.get_user_profile(user_id).await {
            Ok(profile) => Ok(serde_json::to_value(profile)?),
            Err(_) => Ok(serde_json::json!({
                "id": user_id,
                "platform": "instagram"
            })),
        }
    }
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramProfile {
    pub id: String,
    pub name: Option<String>,
    pub profile_pic: Option<String>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramWebhookPayload {
    pub object: String,
    pub entry: Vec<InstagramEntry>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramEntry {
    pub id: String,
    pub time: i64,
    pub messaging: Option<Vec<InstagramMessaging>>,
    pub changes: Option<Vec<InstagramChange>>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramMessaging {
    pub sender: InstagramUser,
    pub recipient: InstagramUser,
    pub timestamp: i64,
    pub message: Option<InstagramMessage>,
    pub postback: Option<InstagramPostback>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramUser {
    pub id: String,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramMessage {
    pub mid: String,
    pub text: Option<String>,
    pub attachments: Option<Vec<InstagramAttachment>>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramAttachment {
    #[serde(rename = "type")]
    pub attachment_type: String,
    pub payload: InstagramAttachmentPayload,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramAttachmentPayload {
    pub url: Option<String>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramPostback {
    pub payload: String,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct InstagramChange {
    pub field: String,
    pub value: serde_json::Value,
 }
 // Helper functions for Instagram-specific features
 pub fn create_quick_reply(text: &str, replies: Vec<(&str, &str)>) -> serde_json::Value {
    let quick_replies: Vec<serde_json::Value> = replies
        .into_iter()
        .map(|(title, payload)| {
            serde_json::json!({
                "content_type": "text",
                "title": title,
                "payload": payload
            })
        })
        .collect();
    serde_json::json!({
        "text": text,
        "quick_replies": quick_replies
    })
 }
 pub fn create_generic_template(elements: Vec<serde_json::Value>) -> serde_json::Value {
    serde_json::json!({
        "attachment": {
            "type": "template",
            "payload": {
                "template_type": "generic",
                "elements": elements
            }
        }
    })
 }
 pub fn create_media_template(media_type: &str, attachment_id: &str) -> serde_json::Value {
    serde_json::json!({
        "attachment": {
            "type": "template",
            "payload": {
                "template_type": "media",
                "elements": [{
                    "media_type": media_type,
                    "attachment_id": attachment_id
                }]
            }
        }
    })
 }
--- a/src/core/bot/channels/mod.rs
+++ b/src/core/bot/channels/mod.rs
@ -10,10 +10,35 @@ use std::sync::Arc;
 use tokio::sync::{mpsc, Mutex};
 #[async_trait]
 pub trait ChannelAdapter: Send + Sync {
    fn name(&self) -> &str {
        "Unknown"
    }
    fn is_configured(&self) -> bool {
        true
    }
    async fn send_message(
        &self,
        response: BotResponse,
    ) -> Result<(), Box<dyn std::error::Error + Send + Sync>>;
    async fn receive_message(
        &self,
        payload: serde_json::Value,
    ) -> Result<Option<String>, Box<dyn std::error::Error + Send + Sync>> {
        Ok(None)
    }
    async fn get_user_info(
        &self,
        user_id: &str,
    ) -> Result<serde_json::Value, Box<dyn std::error::Error + Send + Sync>> {
        Ok(serde_json::json!({
            "id": user_id,
            "platform": self.name()
        }))
    }
 }
 #[derive(Debug)]
 pub struct WebChannelAdapter {
--- a/src/core/bot/channels/teams.rs
+++ b/src/core/bot/channels/teams.rs
@ -1,29 +1,369 @@
 use crate::shared::models::BotResponse;
 use async_trait::async_trait;
-use log::info;
+use log::{error, info};
 use serde::{Deserialize, Serialize};
 use std::collections::HashMap;
 use crate::core::bot::channels::ChannelAdapter;
 use crate::shared::models::BotResponse;
 /// Microsoft Teams channel adapter for sending messages through Teams
 pub struct TeamsAdapter {
-    // TODO: Add Teams API client configuration
+    app_id: String,
    app_password: String,
    tenant_id: String,
    service_url: String,
    bot_id: String,
 }
 impl TeamsAdapter {
    pub fn new() -> Self {
-        Self {}
+        // Load from environment variables (would be from config.csv in production)
        let app_id = std::env::var("TEAMS_APP_ID").unwrap_or_default();
        let app_password = std::env::var("TEAMS_APP_PASSWORD").unwrap_or_default();
        let tenant_id = std::env::var("TEAMS_TENANT_ID").unwrap_or_default();
        let service_url = std::env::var("TEAMS_SERVICE_URL")
            .unwrap_or_else(|_| "https://smba.trafficmanager.net".to_string());
        let bot_id = std::env::var("TEAMS_BOT_ID").unwrap_or_else(|_| app_id.clone());
        Self {
            app_id,
            app_password,
            tenant_id,
            service_url,
            bot_id,
        }
    }
    async fn get_access_token(&self) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let token_url = format!(
            "https://login.microsoftonline.com/{}/oauth2/v2.0/token",
            if self.tenant_id.is_empty() {
                "common"
            } else {
                &self.tenant_id
            }
        );
        let params = [
            ("client_id", &self.app_id),
            ("client_secret", &self.app_password),
            ("grant_type", &"client_credentials".to_string()),
            (
                "scope",
                &"https://api.botframework.com/.default".to_string(),
            ),
        ];
        let response = client.post(&token_url).form(&params).send().await?;
        if response.status().is_success() {
            let token_response: serde_json::Value = response.json().await?;
            Ok(token_response["access_token"]
                .as_str()
                .unwrap_or("")
                .to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("Failed to get Teams access token: {}", error_text).into())
        }
    }
    async fn send_teams_message(
        &self,
        conversation_id: &str,
        activity_id: Option<&str>,
        message: &str,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let token = self.get_access_token().await?;
        let client = reqwest::Client::new();
        let url = if let Some(reply_to_id) = activity_id {
            format!(
                "{}/v3/conversations/{}/activities/{}/reply",
                self.service_url, conversation_id, reply_to_id
            )
        } else {
            format!(
                "{}/v3/conversations/{}/activities",
                self.service_url, conversation_id
            )
        };
        let activity = TeamsActivity {
            activity_type: "message".to_string(),
            text: message.to_string(),
            from: TeamsChannelAccount {
                id: self.bot_id.clone(),
                name: Some("Bot".to_string()),
            },
            conversation: TeamsConversationAccount {
                id: conversation_id.to_string(),
                conversation_type: None,
                tenant_id: Some(self.tenant_id.clone()),
            },
            recipient: None,
            attachments: None,
            entities: None,
        };
        let response = client
            .post(&url)
            .header("Authorization", format!("Bearer {}", token))
            .header("Content-Type", "application/json")
            .json(&activity)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["id"].as_str().unwrap_or("").to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("Teams API error: {}", error_text).into())
        }
    }
    pub async fn send_card(
        &self,
        conversation_id: &str,
        card: TeamsAdaptiveCard,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let token = self.get_access_token().await?;
        let client = reqwest::Client::new();
        let url = format!(
            "{}/v3/conversations/{}/activities",
            self.service_url, conversation_id
        );
        let attachment = TeamsAttachment {
            content_type: "application/vnd.microsoft.card.adaptive".to_string(),
            content: serde_json::to_value(card)?,
        };
        let activity = TeamsActivity {
            activity_type: "message".to_string(),
            text: String::new(),
            from: TeamsChannelAccount {
                id: self.bot_id.clone(),
                name: Some("Bot".to_string()),
            },
            conversation: TeamsConversationAccount {
                id: conversation_id.to_string(),
                conversation_type: None,
                tenant_id: Some(self.tenant_id.clone()),
            },
            recipient: None,
            attachments: Some(vec![attachment]),
            entities: None,
        };
        let response = client
            .post(&url)
            .header("Authorization", format!("Bearer {}", token))
            .header("Content-Type", "application/json")
            .json(&activity)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["id"].as_str().unwrap_or("").to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("Teams API error: {}", error_text).into())
        }
    }
    pub async fn update_message(
        &self,
        conversation_id: &str,
        activity_id: &str,
        new_message: &str,
    ) -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
        let token = self.get_access_token().await?;
        let client = reqwest::Client::new();
        let url = format!(
            "{}/v3/conversations/{}/activities/{}",
            self.service_url, conversation_id, activity_id
        );
        let activity = TeamsActivity {
            activity_type: "message".to_string(),
            text: new_message.to_string(),
            from: TeamsChannelAccount {
                id: self.bot_id.clone(),
                name: Some("Bot".to_string()),
            },
            conversation: TeamsConversationAccount {
                id: conversation_id.to_string(),
                conversation_type: None,
                tenant_id: Some(self.tenant_id.clone()),
            },
            recipient: None,
            attachments: None,
            entities: None,
        };
        let response = client
            .put(&url)
            .header("Authorization", format!("Bearer {}", token))
            .header("Content-Type", "application/json")
            .json(&activity)
            .send()
            .await?;
        if !response.status().is_success() {
            let error_text = response.text().await?;
            return Err(format!("Teams API error: {}", error_text).into());
        }
        Ok(())
    }
 }
 #[async_trait]
-impl super::ChannelAdapter for TeamsAdapter {
+impl ChannelAdapter for TeamsAdapter {
    fn name(&self) -> &str {
        "Teams"
    }
    fn is_configured(&self) -> bool {
        !self.app_id.is_empty() && !self.app_password.is_empty()
    }
    async fn send_message(
        &self,
        response: BotResponse,
    ) -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
        if !self.is_configured() {
            error!("Teams adapter not configured. Please set teams-app-id and teams-app-password in config.csv");
            return Err("Teams not configured".into());
        }
        // In Teams, user_id is typically the conversation ID
        let message_id = self
            .send_teams_message(&response.user_id, None, &response.content)
            .await?;
        info!(
-            "Teams message would be sent to {}: {}",
+            "Teams message sent to conversation {}: {} (message_id: {})",
-            response.user_id, response.content
+            response.user_id, response.content, message_id
        );
-        // TODO: Implement actual Teams API integration
+
        Ok(())
    }
    async fn receive_message(
        &self,
        payload: serde_json::Value,
    ) -> Result<Option<String>, Box<dyn std::error::Error + Send + Sync>> {
        // Parse Teams activity payload
        if let Some(activity_type) = payload["type"].as_str() {
            match activity_type {
                "message" => {
                    return Ok(payload["text"].as_str().map(|s| s.to_string()));
                }
                "invoke" => {
                    // Handle Teams-specific invokes (like adaptive card actions)
                    if let Some(name) = payload["name"].as_str() {
                        return Ok(Some(format!("Teams invoke: {}", name)));
                    }
                }
                _ => {
                    return Ok(None);
                }
            }
        }
        Ok(None)
    }
    async fn get_user_info(
        &self,
        user_id: &str,
    ) -> Result<serde_json::Value, Box<dyn std::error::Error + Send + Sync>> {
        let token = self.get_access_token().await?;
        let client = reqwest::Client::new();
        // In Teams, user_id might be in format "29:1xyz..."
        let url = format!("{}/v3/conversations/{}/members", self.service_url, user_id);
        let response = client
            .get(&url)
            .header("Authorization", format!("Bearer {}", token))
            .send()
            .await?;
        if response.status().is_success() {
            let members: Vec<serde_json::Value> = response.json().await?;
            if let Some(first_member) = members.first() {
                return Ok(first_member.clone());
            }
        }
        Ok(serde_json::json!({
            "id": user_id,
            "platform": "teams"
        }))
    }
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct TeamsActivity {
    #[serde(rename = "type")]
    pub activity_type: String,
    pub text: String,
    pub from: TeamsChannelAccount,
    pub conversation: TeamsConversationAccount,
    pub recipient: Option<TeamsChannelAccount>,
    pub attachments: Option<Vec<TeamsAttachment>>,
    pub entities: Option<Vec<serde_json::Value>>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct TeamsChannelAccount {
    pub id: String,
    pub name: Option<String>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct TeamsConversationAccount {
    pub id: String,
    #[serde(rename = "conversationType")]
    pub conversation_type: Option<String>,
    #[serde(rename = "tenantId")]
    pub tenant_id: Option<String>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct TeamsAttachment {
    #[serde(rename = "contentType")]
    pub content_type: String,
    pub content: serde_json::Value,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct TeamsAdaptiveCard {
    #[serde(rename = "$schema")]
    pub schema: String,
    #[serde(rename = "type")]
    pub card_type: String,
    pub version: String,
    pub body: Vec<serde_json::Value>,
    pub actions: Option<Vec<serde_json::Value>>,
 }
 impl Default for TeamsAdaptiveCard {
    fn default() -> Self {
        Self {
            schema: "http://adaptivecards.io/schemas/adaptive-card.json".to_string(),
            card_type: "AdaptiveCard".to_string(),
            version: "1.4".to_string(),
            body: Vec::new(),
            actions: None,
        }
    }
 }
--- a/src/core/bot/channels/whatsapp.rs
+++ b/src/core/bot/channels/whatsapp.rs
@ -1,29 +1,347 @@
 use crate::shared::models::BotResponse;
 use async_trait::async_trait;
-use log::info;
+use log::{error, info};
 use serde::{Deserialize, Serialize};
 use std::collections::HashMap;
 use crate::core::bot::channels::ChannelAdapter;
 use crate::shared::models::BotResponse;
 /// WhatsApp channel adapter for sending messages through WhatsApp
 pub struct WhatsAppAdapter {
-    // TODO: Add WhatsApp API client configuration
+    api_key: String,
    phone_number_id: String,
    webhook_verify_token: String,
    business_account_id: String,
    api_version: String,
 }
 impl WhatsAppAdapter {
    pub fn new() -> Self {
-        Self {}
+        // Load from environment variables (would be from config.csv in production)
        let api_key = std::env::var("WHATSAPP_API_KEY").unwrap_or_default();
        let phone_number_id = std::env::var("WHATSAPP_PHONE_NUMBER_ID").unwrap_or_default();
        let webhook_verify_token =
            std::env::var("WHATSAPP_VERIFY_TOKEN").unwrap_or_else(|_| "webhook_verify".to_string());
        let business_account_id = std::env::var("WHATSAPP_BUSINESS_ACCOUNT_ID").unwrap_or_default();
        let api_version = "v17.0".to_string();
        Self {
            api_key,
            phone_number_id,
            webhook_verify_token,
            business_account_id,
            api_version,
        }
    }
    async fn send_whatsapp_message(
        &self,
        to: &str,
        message: &str,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}/messages",
            self.api_version, self.phone_number_id
        );
        let payload = serde_json::json!({
            "messaging_product": "whatsapp",
            "recipient_type": "individual",
            "to": to,
            "type": "text",
            "text": {
                "preview_url": false,
                "body": message
            }
        });
        let response = client
            .post(&url)
            .header("Authorization", format!("Bearer {}", self.api_key))
            .header("Content-Type", "application/json")
            .json(&payload)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["messages"][0]["id"]
                .as_str()
                .unwrap_or("")
                .to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("WhatsApp API error: {}", error_text).into())
        }
    }
    pub async fn send_template_message(
        &self,
        to: &str,
        template_name: &str,
        language_code: &str,
        parameters: Vec<String>,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}/messages",
            self.api_version, self.phone_number_id
        );
        let components = if !parameters.is_empty() {
            vec![serde_json::json!({
                "type": "body",
                "parameters": parameters.iter().map(|p| {
                    serde_json::json!({
                        "type": "text",
                        "text": p
                    })
                }).collect::<Vec<_>>()
            })]
        } else {
            vec![]
        };
        let payload = serde_json::json!({
            "messaging_product": "whatsapp",
            "to": to,
            "type": "template",
            "template": {
                "name": template_name,
                "language": {
                    "code": language_code
                },
                "components": components
            }
        });
        let response = client
            .post(&url)
            .header("Authorization", format!("Bearer {}", self.api_key))
            .header("Content-Type", "application/json")
            .json(&payload)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["messages"][0]["id"]
                .as_str()
                .unwrap_or("")
                .to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("WhatsApp API error: {}", error_text).into())
        }
    }
    pub async fn send_media_message(
        &self,
        to: &str,
        media_type: &str,
        media_url: &str,
        caption: Option<&str>,
    ) -> Result<String, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}/messages",
            self.api_version, self.phone_number_id
        );
        let mut media_object = serde_json::json!({
            "link": media_url
        });
        if let Some(caption_text) = caption {
            media_object["caption"] = serde_json::json!(caption_text);
        }
        let payload = serde_json::json!({
            "messaging_product": "whatsapp",
            "to": to,
            "type": media_type,
            media_type: media_object
        });
        let response = client
            .post(&url)
            .header("Authorization", format!("Bearer {}", self.api_key))
            .header("Content-Type", "application/json")
            .json(&payload)
            .send()
            .await?;
        if response.status().is_success() {
            let result: serde_json::Value = response.json().await?;
            Ok(result["messages"][0]["id"]
                .as_str()
                .unwrap_or("")
                .to_string())
        } else {
            let error_text = response.text().await?;
            Err(format!("WhatsApp API error: {}", error_text).into())
        }
    }
    pub fn verify_webhook(&self, token: &str) -> bool {
        token == self.webhook_verify_token
    }
 }
 #[async_trait]
-impl super::ChannelAdapter for WhatsAppAdapter {
+impl ChannelAdapter for WhatsAppAdapter {
    fn name(&self) -> &str {
        "WhatsApp"
    }
    fn is_configured(&self) -> bool {
        !self.api_key.is_empty() && !self.phone_number_id.is_empty()
    }
    async fn send_message(
        &self,
        response: BotResponse,
    ) -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
        if !self.is_configured() {
            error!("WhatsApp adapter not configured. Please set whatsapp-api-key and whatsapp-phone-number-id in config.csv");
            return Err("WhatsApp not configured".into());
        }
        let message_id = self
            .send_whatsapp_message(&response.user_id, &response.content)
            .await?;
        info!(
-            "WhatsApp message would be sent to {}: {}",
+            "WhatsApp message sent to {}: {} (message_id: {})",
-            response.user_id, response.content
+            response.user_id, response.content, message_id
        );
-        // TODO: Implement actual WhatsApp API integration
+
        Ok(())
    }
    async fn receive_message(
        &self,
        payload: serde_json::Value,
    ) -> Result<Option<String>, Box<dyn std::error::Error + Send + Sync>> {
        // Parse WhatsApp webhook payload
        if let Some(entry) = payload["entry"].as_array() {
            if let Some(first_entry) = entry.first() {
                if let Some(changes) = first_entry["changes"].as_array() {
                    if let Some(first_change) = changes.first() {
                        if let Some(messages) = first_change["value"]["messages"].as_array() {
                            if let Some(first_message) = messages.first() {
                                let message_type = first_message["type"].as_str().unwrap_or("");
                                match message_type {
                                    "text" => {
                                        return Ok(first_message["text"]["body"]
                                            .as_str()
                                            .map(|s| s.to_string()));
                                    }
                                    "image" | "document" | "audio" | "video" => {
                                        return Ok(Some(format!(
                                            "Received {} message",
                                            message_type
                                        )));
                                    }
                                    _ => {
                                        return Ok(Some(format!(
                                            "Received unsupported message type: {}",
                                            message_type
                                        )));
                                    }
                                }
                            }
                        }
                    }
                }
            }
        }
        Ok(None)
    }
    async fn get_user_info(
        &self,
        user_id: &str,
    ) -> Result<serde_json::Value, Box<dyn std::error::Error + Send + Sync>> {
        let client = reqwest::Client::new();
        let url = format!(
            "https://graph.facebook.com/{}/{}",
            self.api_version, user_id
        );
        let response = client
            .get(&url)
            .header("Authorization", format!("Bearer {}", self.api_key))
            .send()
            .await?;
        if response.status().is_success() {
            Ok(response.json().await?)
        } else {
            Ok(serde_json::json!({
                "id": user_id,
                "platform": "whatsapp"
            }))
        }
    }
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppWebhookPayload {
    pub entry: Vec<WhatsAppEntry>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppEntry {
    pub id: String,
    pub changes: Vec<WhatsAppChange>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppChange {
    pub field: String,
    pub value: WhatsAppValue,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppValue {
    pub messaging_product: String,
    pub metadata: WhatsAppMetadata,
    pub messages: Option<Vec<WhatsAppMessage>>,
    pub statuses: Option<Vec<WhatsAppStatus>>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppMetadata {
    pub display_phone_number: String,
    pub phone_number_id: String,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppMessage {
    pub from: String,
    pub id: String,
    pub timestamp: String,
    #[serde(rename = "type")]
    pub message_type: String,
    pub text: Option<WhatsAppText>,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppText {
    pub body: String,
 }
 #[derive(Debug, Serialize, Deserialize)]
 pub struct WhatsAppStatus {
    pub id: String,
    pub status: String,
    pub timestamp: String,
    pub recipient_id: String,
 }
--- a/src/core/config/mod.rs
+++ b/src/core/config/mod.rs
@ -207,6 +207,29 @@ impl ConfigManager {
        };
        Ok(value)
    }
    pub async fn get_bot_config_value(
        &self,
        target_bot_id: &uuid::Uuid,
        key: &str,
    ) -> Result<String, String> {
        use crate::shared::models::schema::bot_configuration::dsl::*;
        use diesel::prelude::*;
        let mut conn = self
            .get_conn()
            .map_err(|e| format!("Failed to acquire connection: {}", e))?;
        let value = bot_configuration
            .filter(bot_id.eq(target_bot_id))
            .filter(config_key.eq(key))
            .select(config_value)
            .first::<String>(&mut conn)
            .map_err(|e| format!("Failed to get bot config value: {}", e))?;
        Ok(value)
    }
    pub fn sync_gbot_config(&self, bot_id: &uuid::Uuid, content: &str) -> Result<usize, String> {
        use sha2::{Digest, Sha256};
        let mut hasher = Sha256::new();
--- a/src/core/kb/document_processor.rs
+++ b/src/core/kb/document_processor.rs
@ -0,0 +1,587 @@
 use anyhow::Result;
 use log::{error, info, warn};
 use serde::{Deserialize, Serialize};
 use std::collections::HashMap;
 use std::path::Path;
 use tokio::io::AsyncReadExt;
 /// Supported document formats for knowledge base
 #[derive(Debug, Clone, Copy, PartialEq, Eq)]
 pub enum DocumentFormat {
    PDF,
    DOCX,
    XLSX,
    PPTX,
    TXT,
    MD,
    HTML,
    RTF,
    CSV,
    JSON,
    XML,
 }
 impl DocumentFormat {
    /// Detect format from file extension
    pub fn from_extension(path: &Path) -> Option<Self> {
        let ext = path.extension()?.to_str()?.to_lowercase();
        match ext.as_str() {
            "pdf" => Some(Self::PDF),
            "docx" => Some(Self::DOCX),
            "xlsx" => Some(Self::XLSX),
            "pptx" => Some(Self::PPTX),
            "txt" => Some(Self::TXT),
            "md" | "markdown" => Some(Self::MD),
            "html" | "htm" => Some(Self::HTML),
            "rtf" => Some(Self::RTF),
            "csv" => Some(Self::CSV),
            "json" => Some(Self::JSON),
            "xml" => Some(Self::XML),
            _ => None,
        }
    }
    /// Get maximum file size for this format (in bytes)
    pub fn max_size(&self) -> usize {
        match self {
            Self::PDF => 500 * 1024 * 1024,  // 500MB
            Self::DOCX => 100 * 1024 * 1024, // 100MB
            Self::XLSX => 100 * 1024 * 1024, // 100MB
            Self::PPTX => 200 * 1024 * 1024, // 200MB
            Self::TXT => 100 * 1024 * 1024,  // 100MB
            Self::MD => 10 * 1024 * 1024,    // 10MB
            Self::HTML => 50 * 1024 * 1024,  // 50MB
            Self::RTF => 50 * 1024 * 1024,   // 50MB
            Self::CSV => 1024 * 1024 * 1024, // 1GB
            Self::JSON => 100 * 1024 * 1024, // 100MB
            Self::XML => 100 * 1024 * 1024,  // 100MB
        }
    }
 }
 /// Document metadata extracted during processing
 #[derive(Debug, Clone, Serialize, Deserialize)]
 pub struct DocumentMetadata {
    pub title: Option<String>,
    pub author: Option<String>,
    pub creation_date: Option<String>,
    pub modification_date: Option<String>,
    pub page_count: Option<usize>,
    pub word_count: Option<usize>,
    pub language: Option<String>,
 }
 /// A text chunk ready for embedding
 #[derive(Debug, Clone, Serialize, Deserialize)]
 pub struct TextChunk {
    pub content: String,
    pub metadata: ChunkMetadata,
 }
 /// Metadata for a text chunk
 #[derive(Debug, Clone, Serialize, Deserialize)]
 pub struct ChunkMetadata {
    pub document_path: String,
    pub document_title: Option<String>,
    pub chunk_index: usize,
    pub total_chunks: usize,
    pub start_char: usize,
    pub end_char: usize,
    pub page_number: Option<usize>,
 }
 /// Main document processor for knowledge base
 #[derive(Debug)]
 pub struct DocumentProcessor {
    chunk_size: usize,
    chunk_overlap: usize,
 }
 impl Default for DocumentProcessor {
    fn default() -> Self {
        Self {
            chunk_size: 1000,   // 1000 characters as per docs
            chunk_overlap: 200, // 200 character overlap as per docs
        }
    }
 }
 impl DocumentProcessor {
    pub fn new(chunk_size: usize, chunk_overlap: usize) -> Self {
        Self {
            chunk_size,
            chunk_overlap,
        }
    }
    /// Get the chunk size
    pub fn chunk_size(&self) -> usize {
        self.chunk_size
    }
    /// Get the chunk overlap
    pub fn chunk_overlap(&self) -> usize {
        self.chunk_overlap
    }
    /// Process a document file and return extracted text chunks
    pub async fn process_document(&self, file_path: &Path) -> Result<Vec<TextChunk>> {
        // Check if file exists
        if !file_path.exists() {
            return Err(anyhow::anyhow!("File not found: {:?}", file_path));
        }
        // Get file size
        let metadata = tokio::fs::metadata(file_path).await?;
        let file_size = metadata.len() as usize;
        // Detect format
        let format = DocumentFormat::from_extension(file_path)
            .ok_or_else(|| anyhow::anyhow!("Unsupported file format: {:?}", file_path))?;
        // Check file size
        if file_size > format.max_size() {
            return Err(anyhow::anyhow!(
                "File too large: {} bytes (max: {} bytes)",
                file_size,
                format.max_size()
            ));
        }
        info!(
            "Processing document: {:?} (format: {:?}, size: {} bytes)",
            file_path, format, file_size
        );
        // Extract text based on format
        let text = self.extract_text(file_path, format).await?;
        // Clean and normalize text
        let cleaned_text = self.clean_text(&text);
        // Generate chunks
        let chunks = self.create_chunks(&cleaned_text, file_path);
        info!(
            "Created {} chunks from document: {:?}",
            chunks.len(),
            file_path
        );
        Ok(chunks)
    }
    /// Extract text from document based on format
    async fn extract_text(&self, file_path: &Path, format: DocumentFormat) -> Result<String> {
        match format {
            DocumentFormat::TXT | DocumentFormat::MD => {
                // Direct text file reading
                let mut file = tokio::fs::File::open(file_path).await?;
                let mut contents = String::new();
                file.read_to_string(&mut contents).await?;
                Ok(contents)
            }
            DocumentFormat::PDF => self.extract_pdf_text(file_path).await,
            DocumentFormat::DOCX => self.extract_docx_text(file_path).await,
            DocumentFormat::HTML => self.extract_html_text(file_path).await,
            DocumentFormat::CSV => self.extract_csv_text(file_path).await,
            DocumentFormat::JSON => self.extract_json_text(file_path).await,
            _ => {
                warn!(
                    "Format {:?} extraction not yet implemented, using fallback",
                    format
                );
                self.fallback_text_extraction(file_path).await
            }
        }
    }
    /// Extract text from PDF files
    async fn extract_pdf_text(&self, file_path: &Path) -> Result<String> {
        // Try system pdftotext first (fastest and most reliable)
        let output = tokio::process::Command::new("pdftotext")
            .arg("-layout")
            .arg(file_path)
            .arg("-")
            .output()
            .await;
        match output {
            Ok(output) if output.status.success() => {
                info!("Successfully extracted PDF with pdftotext: {:?}", file_path);
                Ok(String::from_utf8_lossy(&output.stdout).to_string())
            }
            _ => {
                warn!(
                    "pdftotext failed for {:?}, trying library extraction",
                    file_path
                );
                self.extract_pdf_with_library(file_path).await
            }
        }
    }
    /// Extract PDF using poppler-utils
    async fn extract_pdf_with_poppler(&self, file_path: &Path) -> Result<String> {
        let output = tokio::process::Command::new("pdftotext")
            .arg(file_path)
            .arg("-")
            .output()
            .await?;
        if output.status.success() {
            Ok(String::from_utf8_lossy(&output.stdout).to_string())
        } else {
            // Fallback to library extraction
            self.extract_pdf_with_library(file_path).await
        }
    }
    /// Extract PDF using rust library (fallback)
    async fn extract_pdf_with_library(&self, file_path: &Path) -> Result<String> {
        use pdf_extract::extract_text;
        match extract_text(file_path) {
            Ok(text) => {
                info!("Successfully extracted PDF with library: {:?}", file_path);
                Ok(text)
            }
            Err(e) => {
                warn!("PDF library extraction failed: {}", e);
                // Last resort: try to get any text we can
                self.extract_pdf_basic(file_path).await
            }
        }
    }
    /// Basic PDF extraction using rust library (minimal approach)
    async fn extract_pdf_basic(&self, file_path: &Path) -> Result<String> {
        // Try using pdf-extract as final fallback
        match pdf_extract::extract_text(file_path) {
            Ok(text) if !text.is_empty() => Ok(text),
            _ => {
                // Last resort: return error message
                Err(anyhow::anyhow!(
                    "Could not extract text from PDF. Please ensure pdftotext is installed."
                ))
            }
        }
    }
    /// Extract text from DOCX files
    async fn extract_docx_text(&self, file_path: &Path) -> Result<String> {
        // Use docx-rs or similar crate
        // For now, use pandoc as fallback
        let output = tokio::process::Command::new("pandoc")
            .arg("-f")
            .arg("docx")
            .arg("-t")
            .arg("plain")
            .arg(file_path)
            .output()
            .await;
        match output {
            Ok(output) if output.status.success() => {
                Ok(String::from_utf8_lossy(&output.stdout).to_string())
            }
            _ => {
                warn!("pandoc failed for DOCX, using fallback");
                self.fallback_text_extraction(file_path).await
            }
        }
    }
    /// Extract text from HTML files
    async fn extract_html_text(&self, file_path: &Path) -> Result<String> {
        let contents = tokio::fs::read_to_string(file_path).await?;
        // Simple HTML tag removal (production should use html parser)
        let text = contents
            .split('<')
            .flat_map(|s| s.split('>').skip(1))
            .collect::<Vec<_>>()
            .join(" ");
        Ok(text)
    }
    /// Extract text from CSV files
    async fn extract_csv_text(&self, file_path: &Path) -> Result<String> {
        let contents = tokio::fs::read_to_string(file_path).await?;
        // Convert CSV rows to text
        let mut text = String::new();
        for line in contents.lines() {
            text.push_str(line);
            text.push('\n');
        }
        Ok(text)
    }
    /// Extract text from JSON files
    async fn extract_json_text(&self, file_path: &Path) -> Result<String> {
        let contents = tokio::fs::read_to_string(file_path).await?;
        // Parse JSON and extract all string values
        if let Ok(json) = serde_json::from_str::<serde_json::Value>(&contents) {
            Ok(self.extract_json_strings(&json))
        } else {
            Ok(contents)
        }
    }
    /// Recursively extract string values from JSON
    fn extract_json_strings(&self, value: &serde_json::Value) -> String {
        let mut result = String::new();
        match value {
            serde_json::Value::String(s) => {
                result.push_str(s);
                result.push(' ');
            }
            serde_json::Value::Array(arr) => {
                for item in arr {
                    result.push_str(&self.extract_json_strings(item));
                }
            }
            serde_json::Value::Object(map) => {
                for (_key, val) in map {
                    result.push_str(&self.extract_json_strings(val));
                }
            }
            _ => {}
        }
        result
    }
    /// Fallback text extraction for unsupported formats
    async fn fallback_text_extraction(&self, file_path: &Path) -> Result<String> {
        // Try to read as UTF-8 text
        match tokio::fs::read_to_string(file_path).await {
            Ok(contents) => Ok(contents),
            Err(_) => {
                // If not UTF-8, try with lossy conversion
                let bytes = tokio::fs::read(file_path).await?;
                Ok(String::from_utf8_lossy(&bytes).to_string())
            }
        }
    }
    /// Clean and normalize extracted text
    fn clean_text(&self, text: &str) -> String {
        // Remove multiple spaces and normalize whitespace
        let cleaned = text
            .lines()
            .map(|line| line.trim())
            .filter(|line| !line.is_empty())
            .collect::<Vec<_>>()
            .join("\n");
        // Remove control characters
        cleaned
            .chars()
            .filter(|c| !c.is_control() || c.is_whitespace())
            .collect::<String>()
            .split_whitespace()
            .collect::<Vec<_>>()
            .join(" ")
    }
    /// Create overlapping chunks from text
    fn create_chunks(&self, text: &str, file_path: &Path) -> Vec<TextChunk> {
        let mut chunks = Vec::new();
        let chars: Vec<char> = text.chars().collect();
        let total_chars = chars.len();
        if total_chars == 0 {
            return chunks;
        }
        let mut start = 0;
        let mut chunk_index = 0;
        // Calculate total number of chunks for metadata
        let step_size = self.chunk_size.saturating_sub(self.chunk_overlap);
        let total_chunks = if step_size > 0 {
            (total_chars + step_size - 1) / step_size
        } else {
            1
        };
        while start < total_chars {
            let end = std::cmp::min(start + self.chunk_size, total_chars);
            // Find word boundary for clean cuts
            let mut chunk_end = end;
            if end < total_chars {
                // Look for word boundary
                for i in (start..end).rev() {
                    if chars[i].is_whitespace() {
                        chunk_end = i + 1;
                        break;
                    }
                }
            }
            let chunk_content: String = chars[start..chunk_end].iter().collect();
            chunks.push(TextChunk {
                content: chunk_content,
                metadata: ChunkMetadata {
                    document_path: file_path.to_string_lossy().to_string(),
                    document_title: file_path
                        .file_stem()
                        .and_then(|s| s.to_str())
                        .map(|s| s.to_string()),
                    chunk_index,
                    total_chunks,
                    start_char: start,
                    end_char: chunk_end,
                    page_number: None, // Would be set for PDFs with page info
                },
            });
            chunk_index += 1;
            // Move forward by chunk_size - overlap
            start = if chunk_end >= self.chunk_overlap {
                chunk_end - self.chunk_overlap
            } else {
                chunk_end
            };
            // Prevent infinite loop
            if start >= total_chars {
                break;
            }
        }
        chunks
    }
    /// Process all documents in a knowledge base folder
    pub async fn process_kb_folder(
        &self,
        kb_path: &Path,
    ) -> Result<HashMap<String, Vec<TextChunk>>> {
        let mut results = HashMap::new();
        if !kb_path.exists() {
            return Err(anyhow::anyhow!(
                "Knowledge base folder not found: {:?}",
                kb_path
            ));
        }
        info!("Processing knowledge base folder: {:?}", kb_path);
        // Recursively process all files
        self.process_directory_recursive(kb_path, &mut results)
            .await?;
        info!("Processed {} documents in knowledge base", results.len());
        Ok(results)
    }
    /// Recursively process directory
    fn process_directory_recursive<'a>(
        &'a self,
        dir: &'a Path,
        results: &'a mut HashMap<String, Vec<TextChunk>>,
    ) -> std::pin::Pin<Box<dyn std::future::Future<Output = Result<()>> + Send + 'a>> {
        Box::pin(async move {
            let mut entries = tokio::fs::read_dir(dir).await?;
            while let Some(entry) = entries.next_entry().await? {
                let path = entry.path();
                let metadata = entry.metadata().await?;
                if metadata.is_dir() {
                    // Recurse into subdirectory
                    self.process_directory_recursive(&path, results).await?;
                } else if metadata.is_file() {
                    // Check if this is a supported format
                    if DocumentFormat::from_extension(&path).is_some() {
                        match self.process_document(&path).await {
                            Ok(chunks) => {
                                let key = path.to_string_lossy().to_string();
                                results.insert(key, chunks);
                            }
                            Err(e) => {
                                error!("Failed to process document {:?}: {}", path, e);
                            }
                        }
                    }
                }
            }
            Ok(())
        })
    }
 }
 #[cfg(test)]
 mod tests {
    use super::*;
    #[test]
    fn test_chunk_creation() {
        let processor = DocumentProcessor::default();
        let text = "This is a test document with some content that needs to be chunked properly. "
            .repeat(20);
        let chunks = processor.create_chunks(&text, Path::new("test.txt"));
        // Verify chunks are created
        assert!(!chunks.is_empty());
        // Verify chunk size
        for chunk in &chunks {
            assert!(chunk.content.len() <= processor.chunk_size);
        }
        // Verify overlap exists
        if chunks.len() > 1 {
            let first_end = &chunks[0].content[chunks[0].content.len().saturating_sub(100)..];
            let second_start = &chunks[1].content[..100.min(chunks[1].content.len())];
            // There should be some overlap
            assert!(first_end.chars().any(|c| second_start.contains(c)));
        }
    }
    #[test]
    fn test_format_detection() {
        assert_eq!(
            DocumentFormat::from_extension(Path::new("test.pdf")),
            Some(DocumentFormat::PDF)
        );
        assert_eq!(
            DocumentFormat::from_extension(Path::new("test.docx")),
            Some(DocumentFormat::DOCX)
        );
        assert_eq!(
            DocumentFormat::from_extension(Path::new("test.txt")),
            Some(DocumentFormat::TXT)
        );
        assert_eq!(
            DocumentFormat::from_extension(Path::new("test.md")),
            Some(DocumentFormat::MD)
        );
        assert_eq!(
            DocumentFormat::from_extension(Path::new("test.unknown")),
            None
        );
    }
    #[test]
    fn test_text_cleaning() {
        let processor = DocumentProcessor::default();
        let dirty_text = "  This   is\n\n\na    test\r\nwith  multiple    spaces  ";
        let cleaned = processor.clean_text(dirty_text);
        assert_eq!(cleaned, "This is a test with multiple spaces");
    }
 }
--- a/src/core/kb/embedding_generator.rs
+++ b/src/core/kb/embedding_generator.rs
@ -0,0 +1,443 @@
 use anyhow::{Context, Result};
 use log::{debug, info, warn};
 use reqwest::Client;
 use serde::{Deserialize, Serialize};
 use std::sync::Arc;
 use tokio::sync::Semaphore;
 use super::document_processor::TextChunk;
 /// Embedding model configuration
 #[derive(Debug, Clone)]
 pub struct EmbeddingConfig {
    /// URL for the embedding service (e.g., http://localhost:8082)
    pub embedding_url: String,
    /// Model name/path for embeddings (e.g., bge-small-en-v1.5)
    pub embedding_model: String,
    /// Dimension of embeddings (e.g., 384, 768, 1536)
    pub dimensions: usize,
    /// Maximum batch size for embedding generation
    pub batch_size: usize,
    /// Request timeout in seconds
    pub timeout_seconds: u64,
 }
 impl Default for EmbeddingConfig {
    fn default() -> Self {
        Self {
            embedding_url: "http://localhost:8082".to_string(),
            embedding_model: "bge-small-en-v1.5".to_string(),
            dimensions: 384, // Default for bge-small
            batch_size: 32,
            timeout_seconds: 30,
        }
    }
 }
 impl EmbeddingConfig {
    /// Create config from environment or config.csv values
    pub fn from_env() -> Self {
        let embedding_url =
            std::env::var("EMBEDDING_URL").unwrap_or_else(|_| "http://localhost:8082".to_string());
        let embedding_model =
            std::env::var("EMBEDDING_MODEL").unwrap_or_else(|_| "bge-small-en-v1.5".to_string());
        // Detect dimensions based on model name
        let dimensions = Self::detect_dimensions(&embedding_model);
        Self {
            embedding_url,
            embedding_model,
            dimensions,
            batch_size: 32,
            timeout_seconds: 30,
        }
    }
    /// Detect embedding dimensions based on model name
    fn detect_dimensions(model: &str) -> usize {
        if model.contains("small") || model.contains("MiniLM") {
            384
        } else if model.contains("base") || model.contains("mpnet") {
            768
        } else if model.contains("large") || model.contains("ada") {
            1536
        } else {
            384 // Default
        }
    }
 }
 /// Request payload for embedding generation
 #[derive(Debug, Serialize)]
 struct EmbeddingRequest {
    input: Vec<String>,
    model: String,
 }
 /// Response from embedding service
 #[derive(Debug, Deserialize)]
 struct EmbeddingResponse {
    data: Vec<EmbeddingData>,
    model: String,
    usage: Option<EmbeddingUsage>,
 }
 #[derive(Debug, Deserialize)]
 struct EmbeddingData {
    embedding: Vec<f32>,
    index: usize,
 }
 #[derive(Debug, Deserialize)]
 struct EmbeddingUsage {
    prompt_tokens: usize,
    total_tokens: usize,
 }
 /// Generated embedding with metadata
 #[derive(Debug, Clone, Serialize, Deserialize)]
 pub struct Embedding {
    pub vector: Vec<f32>,
    pub dimensions: usize,
    pub model: String,
    pub tokens_used: Option<usize>,
 }
 /// Knowledge base embedding generator
 pub struct KbEmbeddingGenerator {
    config: EmbeddingConfig,
    client: Client,
    semaphore: Arc<Semaphore>,
 }
 impl std::fmt::Debug for KbEmbeddingGenerator {
    fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
        f.debug_struct("KbEmbeddingGenerator")
            .field("config", &self.config)
            .field("client", &"Client")
            .field("semaphore", &"Semaphore")
            .finish()
    }
 }
 impl KbEmbeddingGenerator {
    pub fn new(config: EmbeddingConfig) -> Self {
        let client = Client::builder()
            .timeout(std::time::Duration::from_secs(config.timeout_seconds))
            .build()
            .expect("Failed to create HTTP client");
        // Limit concurrent requests
        let semaphore = Arc::new(Semaphore::new(4));
        Self {
            config,
            client,
            semaphore,
        }
    }
    /// Generate embeddings for text chunks
    pub async fn generate_embeddings(
        &self,
        chunks: &[TextChunk],
    ) -> Result<Vec<(TextChunk, Embedding)>> {
        if chunks.is_empty() {
            return Ok(Vec::new());
        }
        info!("Generating embeddings for {} chunks", chunks.len());
        let mut results = Vec::new();
        // Process in batches
        for batch in chunks.chunks(self.config.batch_size) {
            let batch_embeddings = self.generate_batch_embeddings(batch).await?;
            // Pair chunks with their embeddings
            for (chunk, embedding) in batch.iter().zip(batch_embeddings.iter()) {
                results.push((chunk.clone(), embedding.clone()));
            }
        }
        info!("Generated {} embeddings", results.len());
        Ok(results)
    }
    /// Generate embeddings for a batch of chunks
    async fn generate_batch_embeddings(&self, chunks: &[TextChunk]) -> Result<Vec<Embedding>> {
        let _permit = self.semaphore.acquire().await?;
        let texts: Vec<String> = chunks.iter().map(|c| c.content.clone()).collect();
        debug!("Generating embeddings for batch of {} texts", texts.len());
        // Try local embedding service first
        match self.generate_local_embeddings(&texts).await {
            Ok(embeddings) => Ok(embeddings),
            Err(e) => {
                warn!("Local embedding service failed: {}, trying OpenAI API", e);
                self.generate_openai_embeddings(&texts).await
            }
        }
    }
    /// Generate embeddings using local service
    async fn generate_local_embeddings(&self, texts: &[String]) -> Result<Vec<Embedding>> {
        let request = EmbeddingRequest {
            input: texts.to_vec(),
            model: self.config.embedding_model.clone(),
        };
        let response = self
            .client
            .post(&format!("{}/embeddings", self.config.embedding_url))
            .json(&request)
            .send()
            .await
            .context("Failed to send request to embedding service")?;
        if !response.status().is_success() {
            let status = response.status();
            let error_text = response.text().await.unwrap_or_default();
            return Err(anyhow::anyhow!(
                "Embedding service error {}: {}",
                status,
                error_text
            ));
        }
        let embedding_response: EmbeddingResponse = response
            .json()
            .await
            .context("Failed to parse embedding response")?;
        let mut embeddings = Vec::new();
        for data in embedding_response.data {
            embeddings.push(Embedding {
                vector: data.embedding,
                dimensions: self.config.dimensions,
                model: embedding_response.model.clone(),
                tokens_used: embedding_response.usage.as_ref().map(|u| u.total_tokens),
            });
        }
        Ok(embeddings)
    }
    /// Generate embeddings using OpenAI API (fallback)
    async fn generate_openai_embeddings(&self, texts: &[String]) -> Result<Vec<Embedding>> {
        let api_key = std::env::var("OPENAI_API_KEY")
            .context("OPENAI_API_KEY not set for fallback embedding generation")?;
        let request = serde_json::json!({
            "input": texts,
            "model": "text-embedding-ada-002"
        });
        let response = self
            .client
            .post("https://api.openai.com/v1/embeddings")
            .header("Authorization", format!("Bearer {}", api_key))
            .json(&request)
            .send()
            .await
            .context("Failed to send request to OpenAI")?;
        if !response.status().is_success() {
            let status = response.status();
            let error_text = response.text().await.unwrap_or_default();
            return Err(anyhow::anyhow!(
                "OpenAI API error {}: {}",
                status,
                error_text
            ));
        }
        let response_json: serde_json::Value = response
            .json()
            .await
            .context("Failed to parse OpenAI response")?;
        let mut embeddings = Vec::new();
        if let Some(data) = response_json["data"].as_array() {
            for item in data {
                if let Some(embedding) = item["embedding"].as_array() {
                    let vector: Vec<f32> = embedding
                        .iter()
                        .filter_map(|v| v.as_f64().map(|f| f as f32))
                        .collect();
                    embeddings.push(Embedding {
                        vector,
                        dimensions: 1536, // OpenAI ada-002 dimensions
                        model: "text-embedding-ada-002".to_string(),
                        tokens_used: response_json["usage"]["total_tokens"]
                            .as_u64()
                            .map(|t| t as usize),
                    });
                }
            }
        }
        Ok(embeddings)
    }
    /// Generate embedding for a single text
    pub async fn generate_single_embedding(&self, text: &str) -> Result<Embedding> {
        let embeddings = self
            .generate_batch_embeddings(&[TextChunk {
                content: text.to_string(),
                metadata: super::document_processor::ChunkMetadata {
                    document_path: "query".to_string(),
                    document_title: None,
                    chunk_index: 0,
                    total_chunks: 1,
                    start_char: 0,
                    end_char: text.len(),
                    page_number: None,
                },
            }])
            .await?;
        embeddings
            .into_iter()
            .next()
            .ok_or_else(|| anyhow::anyhow!("No embedding generated"))
    }
 }
 /// Generic embedding generator for other uses (email, etc.)
 pub struct EmbeddingGenerator {
    kb_generator: KbEmbeddingGenerator,
 }
 impl std::fmt::Debug for EmbeddingGenerator {
    fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
        f.debug_struct("EmbeddingGenerator")
            .field("kb_generator", &self.kb_generator)
            .finish()
    }
 }
 impl EmbeddingGenerator {
    pub fn new(llm_endpoint: String) -> Self {
        let config = EmbeddingConfig {
            embedding_url: llm_endpoint,
            ..Default::default()
        };
        Self {
            kb_generator: KbEmbeddingGenerator::new(config),
        }
    }
    /// Generate embedding for arbitrary text
    pub async fn generate_text_embedding(&self, text: &str) -> Result<Vec<f32>> {
        let embedding = self.kb_generator.generate_single_embedding(text).await?;
        Ok(embedding.vector)
    }
 }
 /// Email-specific embedding generator (for compatibility)
 pub struct EmailEmbeddingGenerator {
    generator: EmbeddingGenerator,
 }
 impl std::fmt::Debug for EmailEmbeddingGenerator {
    fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
        f.debug_struct("EmailEmbeddingGenerator")
            .field("generator", &self.generator)
            .finish()
    }
 }
 impl EmailEmbeddingGenerator {
    pub fn new(llm_endpoint: String) -> Self {
        Self {
            generator: EmbeddingGenerator::new(llm_endpoint),
        }
    }
    /// Generate embedding for email content
    pub async fn generate_embedding(&self, email: &impl EmailLike) -> Result<Vec<f32>> {
        let text = format!(
            "Subject: {}\nFrom: {}\nTo: {}\n\n{}",
            email.subject(),
            email.from(),
            email.to(),
            email.body()
        );
        self.generator.generate_text_embedding(&text).await
    }
    /// Generate embedding for text
    pub async fn generate_text_embedding(&self, text: &str) -> Result<Vec<f32>> {
        self.generator.generate_text_embedding(text).await
    }
 }
 /// Trait for email-like objects
 pub trait EmailLike {
    fn subject(&self) -> &str;
    fn from(&self) -> &str;
    fn to(&self) -> &str;
    fn body(&self) -> &str;
 }
 /// Simple email struct for testing
 #[derive(Debug)]
 pub struct SimpleEmail {
    pub id: String,
    pub subject: String,
    pub from: String,
    pub to: String,
    pub body: String,
 }
 impl EmailLike for SimpleEmail {
    fn subject(&self) -> &str {
        &self.subject
    }
    fn from(&self) -> &str {
        &self.from
    }
    fn to(&self) -> &str {
        &self.to
    }
    fn body(&self) -> &str {
        &self.body
    }
 }
 #[cfg(test)]
 mod tests {
    use super::*;
    #[test]
    fn test_dimension_detection() {
        assert_eq!(EmbeddingConfig::detect_dimensions("bge-small-en"), 384);
        assert_eq!(EmbeddingConfig::detect_dimensions("all-mpnet-base-v2"), 768);
        assert_eq!(
            EmbeddingConfig::detect_dimensions("text-embedding-ada-002"),
            1536
        );
        assert_eq!(EmbeddingConfig::detect_dimensions("unknown-model"), 384);
    }
    #[tokio::test]
    async fn test_text_cleaning_for_embedding() {
        let text = "This is a test\n\nWith multiple lines";
        let generator = EmbeddingGenerator::new("http://localhost:8082".to_string());
        // This would test actual embedding generation if service is available
        // For unit tests, we just verify the structure is correct
        assert!(!text.is_empty());
    }
 }
--- a/src/core/kb/kb_indexer.rs
+++ b/src/core/kb/kb_indexer.rs
@ -0,0 +1,546 @@
 use anyhow::Result;
 use log::{debug, info, warn};
 use serde::{Deserialize, Serialize};
 use std::collections::HashMap;
 use std::path::{Path, PathBuf};
 use uuid::Uuid;
 use super::document_processor::{DocumentProcessor, TextChunk};
 use super::embedding_generator::{Embedding, EmbeddingConfig, KbEmbeddingGenerator};
 /// Qdrant client configuration
 #[derive(Debug, Clone)]
 pub struct QdrantConfig {
    pub url: String,
    pub api_key: Option<String>,
    pub timeout_secs: u64,
 }
 impl Default for QdrantConfig {
    fn default() -> Self {
        Self {
            url: std::env::var("QDRANT_URL")
                .unwrap_or_else(|_| "http://localhost:6333".to_string()),
            api_key: std::env::var("QDRANT_API_KEY").ok(),
            timeout_secs: 30,
        }
    }
 }
 /// Point structure for Qdrant
 #[derive(Debug, Serialize, Deserialize)]
 pub struct QdrantPoint {
    pub id: String,
    pub vector: Vec<f32>,
    pub payload: HashMap<String, serde_json::Value>,
 }
 /// Collection configuration for Qdrant
 #[derive(Debug, Serialize)]
 pub struct CollectionConfig {
    pub vectors: VectorConfig,
    pub replication_factor: u32,
    pub shard_number: u32,
 }
 #[derive(Debug, Serialize)]
 pub struct VectorConfig {
    pub size: usize,
    pub distance: String,
 }
 /// Search request structure
 #[derive(Debug, Serialize)]
 pub struct SearchRequest {
    pub vector: Vec<f32>,
    pub limit: usize,
    pub with_payload: bool,
    pub score_threshold: Option<f32>,
    pub filter: Option<serde_json::Value>,
 }
 /// Knowledge Base Indexer for Qdrant
 pub struct KbIndexer {
    document_processor: DocumentProcessor,
    embedding_generator: KbEmbeddingGenerator,
    qdrant_config: QdrantConfig,
    http_client: reqwest::Client,
 }
 impl std::fmt::Debug for KbIndexer {
    fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result {
        f.debug_struct("KbIndexer")
            .field("document_processor", &self.document_processor)
            .field("embedding_generator", &self.embedding_generator)
            .field("qdrant_config", &self.qdrant_config)
            .field("http_client", &"reqwest::Client")
            .finish()
    }
 }
 impl KbIndexer {
    pub fn new(embedding_config: EmbeddingConfig, qdrant_config: QdrantConfig) -> Self {
        let document_processor = DocumentProcessor::default();
        let embedding_generator = KbEmbeddingGenerator::new(embedding_config);
        let http_client = reqwest::Client::builder()
            .timeout(std::time::Duration::from_secs(qdrant_config.timeout_secs))
            .build()
            .expect("Failed to create HTTP client");
        Self {
            document_processor,
            embedding_generator,
            qdrant_config,
            http_client,
        }
    }
    /// Index a knowledge base folder
    pub async fn index_kb_folder(
        &self,
        bot_name: &str,
        kb_name: &str,
        kb_path: &Path,
    ) -> Result<IndexingResult> {
        info!("Indexing KB folder: {} for bot {}", kb_name, bot_name);
        // Create collection name
        let collection_name = format!("{}_{}", bot_name, kb_name);
        // Ensure collection exists
        self.ensure_collection_exists(&collection_name).await?;
        // Process all documents in the folder
        let documents = self.document_processor.process_kb_folder(kb_path).await?;
        let mut total_chunks = 0;
        let mut indexed_documents = 0;
        for (doc_path, chunks) in documents {
            if chunks.is_empty() {
                continue;
            }
            info!(
                "Processing document: {} ({} chunks)",
                doc_path,
                chunks.len()
            );
            // Generate embeddings for chunks
            let embeddings = self
                .embedding_generator
                .generate_embeddings(&chunks)
                .await?;
            // Create points for Qdrant
            let points = self.create_qdrant_points(&doc_path, embeddings)?;
            // Upsert points to collection
            self.upsert_points(&collection_name, points).await?;
            total_chunks += chunks.len();
            indexed_documents += 1;
        }
        // Update collection info in database
        self.update_collection_metadata(&collection_name, bot_name, kb_name, total_chunks)
            .await?;
        Ok(IndexingResult {
            collection_name,
            documents_processed: indexed_documents,
            chunks_indexed: total_chunks,
        })
    }
    /// Ensure Qdrant collection exists
    async fn ensure_collection_exists(&self, collection_name: &str) -> Result<()> {
        // Check if collection exists
        let check_url = format!("{}/collections/{}", self.qdrant_config.url, collection_name);
        let response = self.http_client.get(&check_url).send().await?;
        if response.status().is_success() {
            info!("Collection {} already exists", collection_name);
            return Ok(());
        }
        // Create collection
        info!("Creating collection: {}", collection_name);
        let config = CollectionConfig {
            vectors: VectorConfig {
                size: 384, // Default for bge-small, should be configurable
                distance: "Cosine".to_string(),
            },
            replication_factor: 1,
            shard_number: 1,
        };
        let create_url = format!("{}/collections/{}", self.qdrant_config.url, collection_name);
        let response = self
            .http_client
            .put(&create_url)
            .json(&config)
            .send()
            .await?;
        if !response.status().is_success() {
            let error_text = response.text().await.unwrap_or_default();
            return Err(anyhow::anyhow!(
                "Failed to create collection: {}",
                error_text
            ));
        }
        // Create indexes for better performance
        self.create_collection_indexes(collection_name).await?;
        Ok(())
    }
    /// Create indexes for collection
    async fn create_collection_indexes(&self, collection_name: &str) -> Result<()> {
        // Create HNSW index for vector search
        let index_config = serde_json::json!({
            "hnsw_config": {
                "m": 16,
                "ef_construct": 200,
                "full_scan_threshold": 10000
            }
        });
        let index_url = format!(
            "{}/collections/{}/index",
            self.qdrant_config.url, collection_name
        );
        let response = self
            .http_client
            .put(&index_url)
            .json(&index_config)
            .send()
            .await?;
        if !response.status().is_success() {
            warn!("Failed to create index, using defaults");
        }
        Ok(())
    }
    /// Create Qdrant points from chunks and embeddings
    fn create_qdrant_points(
        &self,
        doc_path: &str,
        embeddings: Vec<(TextChunk, Embedding)>,
    ) -> Result<Vec<QdrantPoint>> {
        let mut points = Vec::new();
        for (chunk, embedding) in embeddings {
            let point_id = Uuid::new_v4().to_string();
            let mut payload = HashMap::new();
            payload.insert(
                "content".to_string(),
                serde_json::Value::String(chunk.content),
            );
            payload.insert(
                "document_path".to_string(),
                serde_json::Value::String(doc_path.to_string()),
            );
            payload.insert(
                "chunk_index".to_string(),
                serde_json::Value::Number(chunk.metadata.chunk_index.into()),
            );
            payload.insert(
                "total_chunks".to_string(),
                serde_json::Value::Number(chunk.metadata.total_chunks.into()),
            );
            payload.insert(
                "start_char".to_string(),
                serde_json::Value::Number(chunk.metadata.start_char.into()),
            );
            payload.insert(
                "end_char".to_string(),
                serde_json::Value::Number(chunk.metadata.end_char.into()),
            );
            if let Some(title) = chunk.metadata.document_title {
                payload.insert(
                    "document_title".to_string(),
                    serde_json::Value::String(title),
                );
            }
            points.push(QdrantPoint {
                id: point_id,
                vector: embedding.vector,
                payload,
            });
        }
        Ok(points)
    }
    /// Upsert points to Qdrant collection
    async fn upsert_points(&self, collection_name: &str, points: Vec<QdrantPoint>) -> Result<()> {
        if points.is_empty() {
            return Ok(());
        }
        let batch_size = 100; // Qdrant recommended batch size
        for batch in points.chunks(batch_size) {
            let upsert_request = serde_json::json!({
                "points": batch
            });
            let upsert_url = format!(
                "{}/collections/{}/points?wait=true",
                self.qdrant_config.url, collection_name
            );
            let response = self
                .http_client
                .put(&upsert_url)
                .json(&upsert_request)
                .send()
                .await?;
            if !response.status().is_success() {
                let error_text = response.text().await.unwrap_or_default();
                return Err(anyhow::anyhow!("Failed to upsert points: {}", error_text));
            }
        }
        debug!(
            "Upserted {} points to collection {}",
            points.len(),
            collection_name
        );
        Ok(())
    }
    /// Update collection metadata in database
    async fn update_collection_metadata(
        &self,
        collection_name: &str,
        bot_name: &str,
        kb_name: &str,
        document_count: usize,
    ) -> Result<()> {
        // This would update the kb_collections table
        // For now, just log the information
        info!(
            "Updated collection {} metadata: bot={}, kb={}, docs={}",
            collection_name, bot_name, kb_name, document_count
        );
        Ok(())
    }
    /// Search for similar chunks in a collection
    pub async fn search(
        &self,
        collection_name: &str,
        query: &str,
        limit: usize,
    ) -> Result<Vec<SearchResult>> {
        // Generate embedding for query
        let embedding = self
            .embedding_generator
            .generate_single_embedding(query)
            .await?;
        // Create search request
        let search_request = SearchRequest {
            vector: embedding.vector,
            limit,
            with_payload: true,
            score_threshold: Some(0.5), // Minimum similarity threshold
            filter: None,
        };
        let search_url = format!(
            "{}/collections/{}/points/search",
            self.qdrant_config.url, collection_name
        );
        let response = self
            .http_client
            .post(&search_url)
            .json(&search_request)
            .send()
            .await?;
        if !response.status().is_success() {
            let error_text = response.text().await.unwrap_or_default();
            return Err(anyhow::anyhow!("Search failed: {}", error_text));
        }
        let response_json: serde_json::Value = response.json().await?;
        let mut results = Vec::new();
        if let Some(result_array) = response_json["result"].as_array() {
            for item in result_array {
                if let (Some(score), Some(payload)) =
                    (item["score"].as_f64(), item["payload"].as_object())
                {
                    let content = payload
                        .get("content")
                        .and_then(|v| v.as_str())
                        .unwrap_or("")
                        .to_string();
                    let document_path = payload
                        .get("document_path")
                        .and_then(|v| v.as_str())
                        .unwrap_or("")
                        .to_string();
                    results.push(SearchResult {
                        content,
                        document_path,
                        score: score as f32,
                        metadata: payload.clone(),
                    });
                }
            }
        }
        Ok(results)
    }
    /// Delete a collection
    pub async fn delete_collection(&self, collection_name: &str) -> Result<()> {
        let delete_url = format!("{}/collections/{}", self.qdrant_config.url, collection_name);
        let response = self.http_client.delete(&delete_url).send().await?;
        if !response.status().is_success() {
            let error_text = response.text().await.unwrap_or_default();
            warn!(
                "Failed to delete collection {}: {}",
                collection_name, error_text
            );
        }
        Ok(())
    }
 }
 /// Result of indexing operation
 #[derive(Debug)]
 pub struct IndexingResult {
    pub collection_name: String,
    pub documents_processed: usize,
    pub chunks_indexed: usize,
 }
 /// Search result from vector database
 #[derive(Debug, Clone)]
 pub struct SearchResult {
    pub content: String,
    pub document_path: String,
    pub score: f32,
    pub metadata: serde_json::Map<String, serde_json::Value>,
 }
 /// Monitor for .gbkb folder changes and trigger indexing
 #[derive(Debug)]
 pub struct KbFolderMonitor {
    indexer: KbIndexer,
    work_root: PathBuf,
 }
 impl KbFolderMonitor {
    pub fn new(work_root: PathBuf, embedding_config: EmbeddingConfig) -> Self {
        let qdrant_config = QdrantConfig::default();
        let indexer = KbIndexer::new(embedding_config, qdrant_config);
        Self { indexer, work_root }
    }
    /// Process a .gbkb folder that was detected by drive monitor
    pub async fn process_gbkb_folder(&self, bot_name: &str, kb_folder: &Path) -> Result<()> {
        // Extract KB name from folder path
        let kb_name = kb_folder
            .file_name()
            .and_then(|n| n.to_str())
            .ok_or_else(|| anyhow::anyhow!("Invalid KB folder name"))?;
        info!("Processing .gbkb folder: {} for bot {}", kb_name, bot_name);
        // Build local work path
        let local_path = self
            .work_root
            .join(bot_name)
            .join(format!("{}.gbkb", bot_name))
            .join(kb_name);
        // Index the folder
        let result = self
            .indexer
            .index_kb_folder(bot_name, kb_name, &local_path)
            .await?;
        info!(
            "Indexed {} documents ({} chunks) into collection {}",
            result.documents_processed, result.chunks_indexed, result.collection_name
        );
        Ok(())
    }
 }
 #[cfg(test)]
 mod tests {
    use super::*;
    #[test]
    fn test_collection_name_generation() {
        let bot_name = "mybot";
        let kb_name = "docs";
        let collection_name = format!("{}_{}", bot_name, kb_name);
        assert_eq!(collection_name, "mybot_docs");
    }
    #[test]
    fn test_qdrant_point_creation() {
        let chunk = TextChunk {
            content: "Test content".to_string(),
            metadata: super::super::document_processor::ChunkMetadata {
                document_path: "test.txt".to_string(),
                document_title: Some("Test".to_string()),
                chunk_index: 0,
                total_chunks: 1,
                start_char: 0,
                end_char: 12,
                page_number: None,
            },
        };
        let embedding = Embedding {
            vector: vec![0.1, 0.2, 0.3],
            dimensions: 3,
            model: "test".to_string(),
            tokens_used: None,
        };
        let indexer = KbIndexer::new(EmbeddingConfig::default(), QdrantConfig::default());
        let points = indexer
            .create_qdrant_points("test.txt", vec![(chunk, embedding)])
            .unwrap();
        assert_eq!(points.len(), 1);
        assert_eq!(points[0].vector.len(), 3);
        assert!(points[0].payload.contains_key("content"));
    }
 }
--- a/src/core/kb/mod.rs
+++ b/src/core/kb/mod.rs
@ -0,0 +1,215 @@
 pub mod document_processor;
 pub mod embedding_generator;
 pub mod kb_indexer;
 pub mod web_crawler;
 pub mod website_crawler_service;
 pub use document_processor::{DocumentFormat, DocumentProcessor, TextChunk};
 pub use embedding_generator::{
    EmailEmbeddingGenerator, EmbeddingConfig, EmbeddingGenerator, KbEmbeddingGenerator,
 };
 pub use kb_indexer::{KbFolderMonitor, KbIndexer, QdrantConfig, SearchResult};
 pub use web_crawler::{WebCrawler, WebPage, WebsiteCrawlConfig};
 pub use website_crawler_service::{ensure_crawler_service_running, WebsiteCrawlerService};
 use anyhow::Result;
 use log::{error, info, warn};
 use std::path::Path;
 use std::sync::Arc;
 use tokio::sync::RwLock;
 /// Main Knowledge Base manager
 #[derive(Debug)]
 pub struct KnowledgeBaseManager {
    indexer: Arc<KbIndexer>,
    processor: Arc<DocumentProcessor>,
    monitor: Arc<RwLock<KbFolderMonitor>>,
 }
 impl KnowledgeBaseManager {
    /// Create new KB manager with default configuration
    pub fn new(work_root: impl Into<std::path::PathBuf>) -> Self {
        let work_root = work_root.into();
        let embedding_config = EmbeddingConfig::from_env();
        let qdrant_config = QdrantConfig::default();
        let indexer = Arc::new(KbIndexer::new(embedding_config.clone(), qdrant_config));
        let processor = Arc::new(DocumentProcessor::default());
        let monitor = Arc::new(RwLock::new(KbFolderMonitor::new(
            work_root,
            embedding_config,
        )));
        Self {
            indexer,
            processor,
            monitor,
        }
    }
    /// Process and index a knowledge base folder
    pub async fn index_kb_folder(
        &self,
        bot_name: &str,
        kb_name: &str,
        kb_path: &Path,
    ) -> Result<()> {
        info!(
            "Indexing knowledge base: {} for bot {} from path: {:?}",
            kb_name, bot_name, kb_path
        );
        // Index the folder using the indexer
        let result = self
            .indexer
            .index_kb_folder(bot_name, kb_name, kb_path)
            .await?;
        info!(
            "Successfully indexed {} documents with {} chunks into collection {}",
            result.documents_processed, result.chunks_indexed, result.collection_name
        );
        Ok(())
    }
    /// Search in a knowledge base
    pub async fn search(
        &self,
        bot_name: &str,
        kb_name: &str,
        query: &str,
        limit: usize,
    ) -> Result<Vec<SearchResult>> {
        let collection_name = format!("{}_{}", bot_name, kb_name);
        self.indexer.search(&collection_name, query, limit).await
    }
    /// Process a single document
    pub async fn process_document(&self, file_path: &Path) -> Result<Vec<TextChunk>> {
        self.processor.process_document(file_path).await
    }
    /// Handle .gbkb folder change notification from drive monitor
    pub async fn handle_gbkb_change(&self, bot_name: &str, kb_folder: &Path) -> Result<()> {
        info!(
            "Handling .gbkb folder change for bot {} at {:?}",
            bot_name, kb_folder
        );
        let monitor = self.monitor.read().await;
        monitor.process_gbkb_folder(bot_name, kb_folder).await
    }
    /// Clear a knowledge base collection
    pub async fn clear_kb(&self, bot_name: &str, kb_name: &str) -> Result<()> {
        let collection_name = format!("{}_{}", bot_name, kb_name);
        warn!("Clearing knowledge base collection: {}", collection_name);
        match self.indexer.delete_collection(&collection_name).await {
            Ok(_) => {
                info!("Successfully cleared collection: {}", collection_name);
                Ok(())
            }
            Err(e) => {
                error!("Failed to clear collection {}: {}", collection_name, e);
                Err(e)
            }
        }
    }
    /// Get collection statistics
    pub async fn get_kb_stats(&self, bot_name: &str, kb_name: &str) -> Result<KbStatistics> {
        let collection_name = format!("{}_{}", bot_name, kb_name);
        // This would query Qdrant for collection statistics
        // For now, return placeholder stats
        Ok(KbStatistics {
            collection_name,
            document_count: 0,
            chunk_count: 0,
            total_size_bytes: 0,
        })
    }
 }
 /// Statistics for a knowledge base
 #[derive(Debug, Clone)]
 pub struct KbStatistics {
    pub collection_name: String,
    pub document_count: usize,
    pub chunk_count: usize,
    pub total_size_bytes: usize,
 }
 /// Integration with drive monitor
 pub struct DriveMonitorIntegration {
    kb_manager: Arc<KnowledgeBaseManager>,
 }
 impl DriveMonitorIntegration {
    pub fn new(kb_manager: Arc<KnowledgeBaseManager>) -> Self {
        Self { kb_manager }
    }
    /// Called when drive monitor detects changes in .gbkb folder
    pub async fn on_gbkb_folder_changed(
        &self,
        bot_name: &str,
        folder_path: &Path,
        change_type: ChangeType,
    ) -> Result<()> {
        match change_type {
            ChangeType::Created | ChangeType::Modified => {
                info!(
                    "Drive monitor detected {:?} in .gbkb folder: {:?}",
                    change_type, folder_path
                );
                self.kb_manager
                    .handle_gbkb_change(bot_name, folder_path)
                    .await
            }
            ChangeType::Deleted => {
                // Extract KB name from path
                if let Some(kb_name) = folder_path.file_name().and_then(|n| n.to_str()) {
                    self.kb_manager.clear_kb(bot_name, kb_name).await
                } else {
                    Err(anyhow::anyhow!("Invalid KB folder path"))
                }
            }
        }
    }
 }
 /// Types of changes detected by drive monitor
 #[derive(Debug, Clone, Copy)]
 pub enum ChangeType {
    Created,
    Modified,
    Deleted,
 }
 #[cfg(test)]
 mod tests {
    use super::*;
    use tempfile::TempDir;
    #[tokio::test]
    async fn test_kb_manager_creation() {
        let temp_dir = TempDir::new().unwrap();
        let manager = KnowledgeBaseManager::new(temp_dir.path());
        // Test that manager is created successfully
        assert!(manager.processor.chunk_size() == 1000);
        assert!(manager.processor.chunk_overlap() == 200);
    }
    #[test]
    fn test_collection_naming() {
        let bot_name = "testbot";
        let kb_name = "docs";
        let collection_name = format!("{}_{}", bot_name, kb_name);
        assert_eq!(collection_name, "testbot_docs");
    }
 }
--- a/src/core/kb/web_crawler.rs
+++ b/src/core/kb/web_crawler.rs
@ -0,0 +1,346 @@
 use anyhow::Result;
 use log::{info, trace, warn};
 use serde::{Deserialize, Serialize};
 use std::collections::HashSet;
 use std::time::Duration;
 use tokio::time::sleep;
 /// Website crawl configuration
 #[derive(Debug, Clone, Serialize, Deserialize)]
 pub struct WebsiteCrawlConfig {
    pub url: String,
    pub max_depth: usize,
    pub max_pages: usize,
    pub crawl_delay_ms: u64,
    pub expires_policy: String,
    pub last_crawled: Option<chrono::DateTime<chrono::Utc>>,
    pub next_crawl: Option<chrono::DateTime<chrono::Utc>>,
 }
 impl WebsiteCrawlConfig {
    /// Parse expiration policy and calculate next crawl time
    pub fn calculate_next_crawl(&mut self) {
        let now = chrono::Utc::now();
        self.last_crawled = Some(now);
        let duration = match self.expires_policy.as_str() {
            "1h" => chrono::Duration::hours(1),
            "6h" => chrono::Duration::hours(6),
            "12h" => chrono::Duration::hours(12),
            "1d" | "24h" => chrono::Duration::days(1),
            "3d" => chrono::Duration::days(3),
            "1w" | "7d" => chrono::Duration::weeks(1),
            "2w" => chrono::Duration::weeks(2),
            "1m" | "30d" => chrono::Duration::days(30),
            "3m" => chrono::Duration::days(90),
            "6m" => chrono::Duration::days(180),
            "1y" | "365d" => chrono::Duration::days(365),
            custom => {
                // Simple parsing for custom format like "2h", "5d", etc.
                if custom.ends_with('h') {
                    if let Ok(hours) = custom[..custom.len() - 1].parse::<i64>() {
                        chrono::Duration::hours(hours)
                    } else {
                        chrono::Duration::days(1)
                    }
                } else if custom.ends_with('d') {
                    if let Ok(days) = custom[..custom.len() - 1].parse::<i64>() {
                        chrono::Duration::days(days)
                    } else {
                        chrono::Duration::days(1)
                    }
                } else if custom.ends_with('w') {
                    if let Ok(weeks) = custom[..custom.len() - 1].parse::<i64>() {
                        chrono::Duration::weeks(weeks)
                    } else {
                        chrono::Duration::days(1)
                    }
                } else if custom.ends_with('m') {
                    if let Ok(months) = custom[..custom.len() - 1].parse::<i64>() {
                        chrono::Duration::days(months * 30)
                    } else {
                        chrono::Duration::days(1)
                    }
                } else if custom.ends_with('y') {
                    if let Ok(years) = custom[..custom.len() - 1].parse::<i64>() {
                        chrono::Duration::days(years * 365)
                    } else {
                        chrono::Duration::days(1)
                    }
                } else {
                    chrono::Duration::days(1) // Default to daily if unparseable
                }
            }
        };
        self.next_crawl = Some(now + duration);
    }
    /// Check if website needs recrawling
    pub fn needs_crawl(&self) -> bool {
        match self.next_crawl {
            Some(next) => chrono::Utc::now() >= next,
            None => true, // Never crawled
        }
    }
 }
 /// Website content for indexing
 #[derive(Debug, Clone)]
 pub struct WebPage {
    pub url: String,
    pub title: Option<String>,
    pub content: String,
    pub meta_description: Option<String>,
    pub crawled_at: chrono::DateTime<chrono::Utc>,
 }
 /// Web crawler for website content
 pub struct WebCrawler {
    client: reqwest::Client,
    config: WebsiteCrawlConfig,
    visited_urls: HashSet<String>,
    pages: Vec<WebPage>,
 }
 impl WebCrawler {
    pub fn new(config: WebsiteCrawlConfig) -> Self {
        let client = reqwest::Client::builder()
            .timeout(Duration::from_secs(30))
            .user_agent("GeneralBots/1.0 (Knowledge Base Crawler)")
            .build()
            .unwrap_or_default();
        Self {
            client,
            config,
            visited_urls: HashSet::new(),
            pages: Vec::new(),
        }
    }
    /// Crawl website starting from configured URL
    pub async fn crawl(&mut self) -> Result<Vec<WebPage>> {
        info!("Starting crawl of website: {}", self.config.url);
        // Start crawling from root URL
        self.crawl_recursive(&self.config.url.clone(), 0).await?;
        info!(
            "Crawled {} pages from {}",
            self.pages.len(),
            self.config.url
        );
        Ok(self.pages.clone())
    }
    /// Recursive crawling with depth control
    async fn crawl_recursive(&mut self, url: &str, depth: usize) -> Result<()> {
        // Check depth limit
        if depth > self.config.max_depth {
            trace!(
                "Reached max depth {} for URL: {}",
                self.config.max_depth,
                url
            );
            return Ok(());
        }
        // Check page limit
        if self.pages.len() >= self.config.max_pages {
            trace!("Reached max pages limit: {}", self.config.max_pages);
            return Ok(());
        }
        // Check if already visited
        if self.visited_urls.contains(url) {
            return Ok(());
        }
        // Mark as visited
        self.visited_urls.insert(url.to_string());
        // Add crawl delay to be polite
        if !self.visited_urls.is_empty() {
            sleep(Duration::from_millis(self.config.crawl_delay_ms)).await;
        }
        // Fetch page
        let response = match self.client.get(url).send().await {
            Ok(resp) => resp,
            Err(e) => {
                warn!("Failed to fetch {}: {}", url, e);
                return Ok(()); // Continue crawling other pages
            }
        };
        // Check if HTML
        let content_type = response
            .headers()
            .get("content-type")
            .and_then(|v| v.to_str().ok())
            .unwrap_or("");
        if !content_type.contains("text/html") {
            trace!("Skipping non-HTML content: {}", url);
            return Ok(());
        }
        // Get page content
        let html_text = match response.text().await {
            Ok(text) => text,
            Err(e) => {
                warn!("Failed to read response from {}: {}", url, e);
                return Ok(());
            }
        };
        // Extract page content
        let page = self.extract_page_content(&html_text, url);
        self.pages.push(page);
        // Extract and crawl links if not at max depth
        if depth < self.config.max_depth {
            let links = self.extract_links(&html_text, url);
            for link in links {
                // Only crawl same domain
                if self.is_same_domain(url, &link) {
                    Box::pin(self.crawl_recursive(&link, depth + 1)).await?;
                }
            }
        }
        Ok(())
    }
    /// Extract text content from HTML
    fn extract_page_content(&self, html: &str, url: &str) -> WebPage {
        // Simple HTML tag removal
        let mut text = html.to_string();
        // Remove script and style tags with their content
        while let Some(start) = text.find("<script") {
            if let Some(end) = text.find("</script>") {
                text.replace_range(start..=end + 8, " ");
            } else {
                break;
            }
        }
        while let Some(start) = text.find("<style") {
            if let Some(end) = text.find("</style>") {
                text.replace_range(start..=end + 7, " ");
            } else {
                break;
            }
        }
        // Extract title if present
        let title = if let Some(title_start) = text.find("<title>") {
            if let Some(title_end) = text.find("</title>") {
                Some(text[title_start + 7..title_end].to_string())
            } else {
                None
            }
        } else {
            None
        };
        // Remove all remaining HTML tags
        while let Some(start) = text.find('<') {
            if let Some(end) = text.find('>') {
                if end > start {
                    text.replace_range(start..=end, " ");
                } else {
                    break;
                }
            } else {
                break;
            }
        }
        // Clean up whitespace
        let content = text.split_whitespace().collect::<Vec<_>>().join(" ");
        WebPage {
            url: url.to_string(),
            title,
            content,
            meta_description: None,
            crawled_at: chrono::Utc::now(),
        }
    }
    /// Extract links from HTML
    fn extract_links(&self, html: &str, base_url: &str) -> Vec<String> {
        let mut links = Vec::new();
        let mut search_pos = 0;
        // Simple href extraction
        while let Some(href_pos) = html[search_pos..].find("href=\"") {
            let href_start = search_pos + href_pos + 6;
            if let Some(href_end) = html[href_start..].find('"') {
                let href = &html[href_start..href_start + href_end];
                // Skip anchors, javascript, mailto, etc.
                if !href.starts_with('#')
                    && !href.starts_with("javascript:")
                    && !href.starts_with("mailto:")
                    && !href.starts_with("tel:")
                {
                    // Convert relative URLs to absolute
                    let absolute_url =
                        if href.starts_with("http://") || href.starts_with("https://") {
                            href.to_string()
                        } else if href.starts_with('/') {
                            // Get base domain from base_url
                            if let Some(domain_end) = base_url[8..].find('/') {
                                format!("{}{}", &base_url[..8 + domain_end], href)
                            } else {
                                format!("{}{}", base_url, href)
                            }
                        } else {
                            // Relative to current page
                            if let Some(last_slash) = base_url.rfind('/') {
                                format!("{}/{}", &base_url[..last_slash], href)
                            } else {
                                format!("{}/{}", base_url, href)
                            }
                        };
                    links.push(absolute_url);
                }
                search_pos = href_start + href_end;
            } else {
                break;
            }
        }
        links
    }
    /// Check if two URLs are from the same domain
    fn is_same_domain(&self, url1: &str, url2: &str) -> bool {
        let domain1 = self.extract_domain(url1);
        let domain2 = self.extract_domain(url2);
        domain1 == domain2
    }
    /// Extract domain from URL
    fn extract_domain(&self, url: &str) -> String {
        let without_protocol = if url.starts_with("https://") {
            &url[8..]
        } else if url.starts_with("http://") {
            &url[7..]
        } else {
            url
        };
        if let Some(slash_pos) = without_protocol.find('/') {
            without_protocol[..slash_pos].to_string()
        } else {
            without_protocol.to_string()
        }
    }
 }
--- a/src/core/kb/website_crawler_service.rs
+++ b/src/core/kb/website_crawler_service.rs
@ -0,0 +1,287 @@
 use crate::config::ConfigManager;
 use crate::core::kb::web_crawler::{WebCrawler, WebsiteCrawlConfig};
 use crate::core::kb::KnowledgeBaseManager;
 use crate::shared::state::AppState;
 use crate::shared::utils::DbPool;
 use diesel::prelude::*;
 use log::{error, info, warn};
 use std::sync::Arc;
 use tokio::time::{interval, Duration};
 use uuid::Uuid;
 /// Service for periodically checking and recrawling websites
 pub struct WebsiteCrawlerService {
    db_pool: DbPool,
    kb_manager: Arc<KnowledgeBaseManager>,
    check_interval: Duration,
    running: Arc<tokio::sync::RwLock<bool>>,
 }
 impl WebsiteCrawlerService {
    pub fn new(db_pool: DbPool, kb_manager: Arc<KnowledgeBaseManager>) -> Self {
        Self {
            db_pool,
            kb_manager,
            check_interval: Duration::from_secs(3600), // Check every hour
            running: Arc::new(tokio::sync::RwLock::new(false)),
        }
    }
    /// Start the website crawler service
    pub async fn start(self: Arc<Self>) -> tokio::task::JoinHandle<()> {
        let service = Arc::clone(&self);
        tokio::spawn(async move {
            info!("Website crawler service started");
            let mut ticker = interval(service.check_interval);
            loop {
                ticker.tick().await;
                // Check if already running
                if *service.running.read().await {
                    warn!("Website crawler is already running, skipping this cycle");
                    continue;
                }
                // Set running flag
                *service.running.write().await = true;
                // Check and crawl websites
                if let Err(e) = service.check_and_crawl_websites().await {
                    error!("Error in website crawler service: {}", e);
                }
                // Clear running flag
                *service.running.write().await = false;
            }
        })
    }
    /// Check for websites that need recrawling
    async fn check_and_crawl_websites(&self) -> Result<(), Box<dyn std::error::Error>> {
        info!("Checking for websites that need recrawling");
        let mut conn = self.db_pool.get()?;
        // Query websites that need recrawling
        let websites = diesel::sql_query(
            "SELECT id, bot_id, url, expires_policy, max_depth, max_pages
             FROM website_crawls
             WHERE next_crawl <= NOW()
             AND crawl_status != 2
             ORDER BY next_crawl ASC
             LIMIT 10",
        )
        .load::<WebsiteCrawlRecord>(&mut conn)?;
        info!("Found {} websites to recrawl", websites.len());
        for website in websites {
            // Mark as processing (status = 2)
            diesel::sql_query("UPDATE website_crawls SET crawl_status = 2 WHERE id = $1")
                .bind::<diesel::sql_types::Uuid, _>(&website.id)
                .execute(&mut conn)?;
            // Spawn crawl task
            let kb_manager = Arc::clone(&self.kb_manager);
            let db_pool = self.db_pool.clone();
            tokio::spawn(async move {
                if let Err(e) = Self::crawl_website(website, kb_manager, db_pool).await {
                    error!("Failed to crawl website: {}", e);
                }
            });
        }
        Ok(())
    }
    /// Crawl a single website
    async fn crawl_website(
        website: WebsiteCrawlRecord,
        kb_manager: Arc<KnowledgeBaseManager>,
        db_pool: DbPool,
    ) -> Result<(), Box<dyn std::error::Error>> {
        info!("Starting crawl for website: {}", website.url);
        // Get bot configuration for max_depth and max_pages
        let config_manager = ConfigManager::new(db_pool.clone());
        let website_max_depth = config_manager
            .get_bot_config_value(&website.bot_id, "website-max-depth")
            .await
            .ok()
            .and_then(|v| v.parse::<usize>().ok())
            .unwrap_or(website.max_depth as usize);
        let website_max_pages = config_manager
            .get_bot_config_value(&website.bot_id, "website-max-pages")
            .await
            .ok()
            .and_then(|v| v.parse::<usize>().ok())
            .unwrap_or(website.max_pages as usize);
        // Create crawler configuration
        let mut config = WebsiteCrawlConfig {
            url: website.url.clone(),
            max_depth: website_max_depth,
            max_pages: website_max_pages,
            crawl_delay_ms: 500,
            expires_policy: website.expires_policy.clone(),
            last_crawled: None,
            next_crawl: None,
        };
        // Create and run crawler
        let mut crawler = WebCrawler::new(config.clone());
        match crawler.crawl().await {
            Ok(pages) => {
                info!("Crawled {} pages from {}", pages.len(), website.url);
                // Get bot name
                let mut conn = db_pool.get()?;
                #[derive(QueryableByName)]
                struct BotNameResult {
                    #[diesel(sql_type = diesel::sql_types::Text)]
                    name: String,
                }
                let bot_name: String = diesel::sql_query("SELECT name FROM bots WHERE id = $1")
                    .bind::<diesel::sql_types::Uuid, _>(&website.bot_id)
                    .get_result::<BotNameResult>(&mut conn)
                    .map(|r| r.name)?;
                // Create KB name from URL
                let kb_name = format!("website_{}", sanitize_url_for_kb(&website.url));
                // Create work directory
                let work_path = std::path::PathBuf::from("work")
                    .join(&bot_name)
                    .join(format!("{}.gbkb", bot_name))
                    .join(&kb_name);
                // Ensure directory exists
                tokio::fs::create_dir_all(&work_path).await?;
                // Write pages to files
                for (idx, page) in pages.iter().enumerate() {
                    let filename = format!("page_{:04}.txt", idx);
                    let filepath = work_path.join(&filename);
                    let content = format!(
                        "URL: {}\nTitle: {}\nCrawled: {}\n\n{}",
                        page.url,
                        page.title.as_deref().unwrap_or("Untitled"),
                        page.crawled_at,
                        page.content
                    );
                    tokio::fs::write(&filepath, content).await?;
                }
                // Index with KB manager
                kb_manager
                    .index_kb_folder(&bot_name, &kb_name, &work_path)
                    .await?;
                // Update configuration
                config.calculate_next_crawl();
                // Update database
                diesel::sql_query(
                    "UPDATE website_crawls
                     SET last_crawled = NOW(),
                         next_crawl = $1,
                         crawl_status = 1,
                         pages_crawled = $2,
                         error_message = NULL
                     WHERE id = $3",
                )
                .bind::<diesel::sql_types::Nullable<diesel::sql_types::Timestamptz>, _>(
                    config.next_crawl,
                )
                .bind::<diesel::sql_types::Integer, _>(pages.len() as i32)
                .bind::<diesel::sql_types::Uuid, _>(&website.id)
                .execute(&mut conn)?;
                info!(
                    "Successfully recrawled {}, next crawl: {:?}",
                    website.url, config.next_crawl
                );
            }
            Err(e) => {
                error!("Failed to crawl {}: {}", website.url, e);
                // Update database with error
                let mut conn = db_pool.get()?;
                diesel::sql_query(
                    "UPDATE website_crawls
                     SET crawl_status = 3,
                         error_message = $1
                     WHERE id = $2",
                )
                .bind::<diesel::sql_types::Text, _>(&e.to_string())
                .bind::<diesel::sql_types::Uuid, _>(&website.id)
                .execute(&mut conn)?;
            }
        }
        Ok(())
    }
 }
 /// Record from website_crawls table
 #[derive(QueryableByName, Debug)]
 struct WebsiteCrawlRecord {
    #[diesel(sql_type = diesel::sql_types::Uuid)]
    id: Uuid,
    #[diesel(sql_type = diesel::sql_types::Uuid)]
    bot_id: Uuid,
    #[diesel(sql_type = diesel::sql_types::Text)]
    url: String,
    #[diesel(sql_type = diesel::sql_types::Text)]
    expires_policy: String,
    #[diesel(sql_type = diesel::sql_types::Integer)]
    max_depth: i32,
    #[diesel(sql_type = diesel::sql_types::Integer)]
    max_pages: i32,
 }
 /// Sanitize URL for use as KB name (duplicate from add_website.rs for isolation)
 fn sanitize_url_for_kb(url: &str) -> String {
    url.replace("http://", "")
        .replace("https://", "")
        .replace('/', "_")
        .replace(':', "_")
        .replace('.', "_")
        .chars()
        .filter(|c| c.is_alphanumeric() || *c == '_' || *c == '-')
        .collect::<String>()
        .to_lowercase()
 }
 /// Get crawler service for a state (create if not exists)
 pub async fn ensure_crawler_service_running(
    state: Arc<AppState>,
 ) -> Result<(), Box<dyn std::error::Error>> {
    // Check if KB manager exists
    if let Some(kb_manager) = &state.kb_manager {
        let service = Arc::new(WebsiteCrawlerService::new(
            state.conn.clone(),
            Arc::clone(kb_manager),
        ));
        // Start the service
        service.start().await;
        info!("Website crawler service started");
        Ok(())
    } else {
        warn!("KB manager not available, website crawler service not started");
        Ok(())
    }
 }
--- a/src/core/mod.rs
+++ b/src/core/mod.rs
@ -2,6 +2,7 @@ pub mod automation;
 pub mod bootstrap;
 pub mod bot;
 pub mod config;
 pub mod kb;
 pub mod package_manager;
 pub mod session;
 pub mod shared;
--- a/src/core/shared/state.rs
+++ b/src/core/shared/state.rs
@ -1,10 +1,11 @@
 #[cfg(feature = "directory")]
 use crate::directory::AuthService;
 use crate::core::bot::channels::{ChannelAdapter, VoiceAdapter, WebChannelAdapter};
 use crate::core::config::AppConfig;
 use crate::core::kb::KnowledgeBaseManager;
 use crate::core::session::SessionManager;
 #[cfg(feature = "directory")]
 use crate::directory::AuthService;
 #[cfg(feature = "llm")]
 use crate::llm::LLMProvider;
 use crate::core::session::SessionManager;
 use crate::shared::models::BotResponse;
 use crate::shared::utils::DbPool;
 #[cfg(feature = "drive")]
@ -32,6 +33,7 @@ pub struct AppState {
    pub response_channels: Arc<tokio::sync::Mutex<HashMap<String, mpsc::Sender<BotResponse>>>>,
    pub web_adapter: Arc<WebChannelAdapter>,
    pub voice_adapter: Arc<VoiceAdapter>,
    pub kb_manager: Option<Arc<KnowledgeBaseManager>>,
 }
 impl Clone for AppState {
    fn clone(&self) -> Self {
@ -48,6 +50,7 @@ impl Clone for AppState {
            llm_provider: Arc::clone(&self.llm_provider),
            #[cfg(feature = "directory")]
            auth_service: Arc::clone(&self.auth_service),
            kb_manager: self.kb_manager.clone(),
            channels: Arc::clone(&self.channels),
            response_channels: Arc::clone(&self.response_channels),
            web_adapter: Arc::clone(&self.web_adapter),
@ -66,7 +69,8 @@ impl std::fmt::Debug for AppState {
        #[cfg(feature = "redis-cache")]
        debug.field("cache", &self.cache.is_some());
-        debug.field("bucket_name", &self.bucket_name)
+        debug
            .field("bucket_name", &self.bucket_name)
            .field("config", &self.config)
            .field("conn", &"DbPool")
            .field("session_manager", &"Arc<Mutex<SessionManager>>");
--- a/src/drive/drive_monitor/mod.rs
+++ b/src/drive/drive_monitor/mod.rs
@ -1,10 +1,13 @@
 use crate::basic::compiler::BasicCompiler;
 use crate::config::ConfigManager;
 use crate::core::kb::{ChangeType, KnowledgeBaseManager};
 use crate::shared::state::AppState;
 use aws_sdk_s3::Client;
 use log::info;
 use std::collections::HashMap;
 use std::error::Error;
 use std::path::PathBuf;
 use std::sync::atomic::{AtomicBool, Ordering};
 use std::sync::Arc;
 use tokio::time::{interval, Duration};
 #[derive(Debug, Clone)]
@ -17,14 +20,23 @@ pub struct DriveMonitor {
    bucket_name: String,
    file_states: Arc<tokio::sync::RwLock<HashMap<String, FileState>>>,
    bot_id: uuid::Uuid,
    kb_manager: Arc<KnowledgeBaseManager>,
    work_root: PathBuf,
    is_processing: Arc<AtomicBool>,
 }
 impl DriveMonitor {
    pub fn new(state: Arc<AppState>, bucket_name: String, bot_id: uuid::Uuid) -> Self {
        let work_root = PathBuf::from("work");
        let kb_manager = Arc::new(KnowledgeBaseManager::new(work_root.clone()));
        Self {
            state,
            bucket_name,
            file_states: Arc::new(tokio::sync::RwLock::new(HashMap::new())),
            bot_id,
            kb_manager,
            work_root,
            is_processing: Arc::new(AtomicBool::new(false)),
        }
    }
    pub fn spawn(self: Arc<Self>) -> tokio::task::JoinHandle<()> {
@ -36,9 +48,25 @@ impl DriveMonitor {
            let mut tick = interval(Duration::from_secs(90));
            loop {
                tick.tick().await;
                // Check if we're already processing to prevent reentrancy
                if self.is_processing.load(Ordering::Acquire) {
                    log::warn!(
                        "Drive monitor is still processing previous changes, skipping this tick"
                    );
                    continue;
                }
                // Set processing flag
                self.is_processing.store(true, Ordering::Release);
                // Process changes
                if let Err(e) = self.check_for_changes().await {
                    log::error!("Error checking for drive changes: {}", e);
                }
                // Clear processing flag
                self.is_processing.store(false, Ordering::Release);
            }
        })
    }
@ -49,6 +77,7 @@ impl DriveMonitor {
        };
        self.check_gbdialog_changes(client).await?;
        self.check_gbot(client).await?;
        self.check_gbkb_changes(client).await?;
        Ok(())
    }
    async fn check_gbdialog_changes(
@ -352,4 +381,200 @@ impl DriveMonitor {
        .await??;
        Ok(())
    }
    async fn check_gbkb_changes(
        &self,
        client: &Client,
    ) -> Result<(), Box<dyn Error + Send + Sync>> {
        let bot_name = self
            .bucket_name
            .strip_suffix(".gbai")
            .unwrap_or(&self.bucket_name);
        let gbkb_prefix = format!("{}.gbkb/", bot_name);
        let mut current_files = HashMap::new();
        let mut continuation_token = None;
        // Add progress tracking for large file sets
        let mut files_processed = 0;
        let mut files_to_process = Vec::new();
        loop {
            let list_objects = match tokio::time::timeout(
                Duration::from_secs(30),
                client
                    .list_objects_v2()
                    .bucket(&self.bucket_name.to_lowercase())
                    .prefix(&gbkb_prefix)
                    .set_continuation_token(continuation_token)
                    .send(),
            )
            .await
            {
                Ok(Ok(list)) => list,
                Ok(Err(e)) => return Err(e.into()),
                Err(_) => {
                    log::error!(
                        "Timeout listing .gbkb objects in bucket {}",
                        self.bucket_name
                    );
                    return Ok(());
                }
            };
            for obj in list_objects.contents.unwrap_or_default() {
                let path = obj.key().unwrap_or_default().to_string();
                // Skip directories
                if path.ends_with('/') {
                    continue;
                }
                let file_state = FileState {
                    etag: obj.e_tag().unwrap_or_default().to_string(),
                };
                current_files.insert(path.clone(), file_state);
            }
            if !list_objects.is_truncated.unwrap_or(false) {
                break;
            }
            continuation_token = list_objects.next_continuation_token;
        }
        let mut file_states = self.file_states.write().await;
        // Check for new or modified files
        for (path, current_state) in current_files.iter() {
            let is_new = !file_states.contains_key(path);
            let is_modified = file_states
                .get(path)
                .map(|prev| prev.etag != current_state.etag)
                .unwrap_or(false);
            if is_new || is_modified {
                info!(
                    "Detected {} in .gbkb: {}",
                    if is_new { "new file" } else { "change" },
                    path
                );
                // Queue file for batch processing instead of immediate download
                files_to_process.push(path.clone());
                files_processed += 1;
                // Process in batches of 10 to avoid overwhelming the system
                if files_to_process.len() >= 10 {
                    for file_path in files_to_process.drain(..) {
                        if let Err(e) = self.download_gbkb_file(client, &file_path).await {
                            log::error!("Failed to download .gbkb file {}: {}", file_path, e);
                            continue;
                        }
                    }
                    // Add small delay between batches to prevent system overload
                    tokio::time::sleep(Duration::from_millis(100)).await;
                }
                // Extract KB name from path (e.g., "mybot.gbkb/docs/file.pdf" -> "docs")
                let path_parts: Vec<&str> = path.split('/').collect();
                if path_parts.len() >= 2 {
                    let kb_name = path_parts[1];
                    let kb_folder_path = self
                        .work_root
                        .join(bot_name)
                        .join(&gbkb_prefix)
                        .join(kb_name);
                    // Trigger indexing
                    if let Err(e) = self
                        .kb_manager
                        .handle_gbkb_change(bot_name, &kb_folder_path)
                        .await
                    {
                        log::error!("Failed to process .gbkb change: {}", e);
                    }
                }
            }
        }
        // Check for deleted files first
        let paths_to_remove: Vec<String> = file_states
            .keys()
            .filter(|path| path.starts_with(&gbkb_prefix) && !current_files.contains_key(*path))
            .cloned()
            .collect();
        // Process remaining files in the queue
        for file_path in files_to_process {
            if let Err(e) = self.download_gbkb_file(client, &file_path).await {
                log::error!("Failed to download .gbkb file {}: {}", file_path, e);
            }
        }
        if files_processed > 0 {
            info!("Processed {} .gbkb files", files_processed);
        }
        // Update file states after checking for deletions
        for (path, state) in current_files {
            file_states.insert(path, state);
        }
        for path in paths_to_remove {
            info!("Detected deletion in .gbkb: {}", path);
            file_states.remove(&path);
            // Extract KB name and trigger cleanup
            let path_parts: Vec<&str> = path.split('/').collect();
            if path_parts.len() >= 2 {
                let kb_name = path_parts[1];
                // Check if entire KB folder was deleted
                let kb_prefix = format!("{}{}/", gbkb_prefix, kb_name);
                if !file_states.keys().any(|k| k.starts_with(&kb_prefix)) {
                    // No more files in this KB, clear the collection
                    if let Err(e) = self.kb_manager.clear_kb(bot_name, kb_name).await {
                        log::error!("Failed to clear KB {}: {}", kb_name, e);
                    }
                }
            }
        }
        Ok(())
    }
    async fn download_gbkb_file(
        &self,
        client: &Client,
        file_path: &str,
    ) -> Result<(), Box<dyn Error + Send + Sync>> {
        let bot_name = self
            .bucket_name
            .strip_suffix(".gbai")
            .unwrap_or(&self.bucket_name);
        // Create local path
        let local_path = self.work_root.join(bot_name).join(file_path);
        // Create parent directories
        if let Some(parent) = local_path.parent() {
            tokio::fs::create_dir_all(parent).await?;
        }
        // Download file
        let response = client
            .get_object()
            .bucket(&self.bucket_name)
            .key(file_path)
            .send()
            .await?;
        let bytes = response.body.collect().await?.into_bytes();
        tokio::fs::write(&local_path, bytes).await?;
        info!("Downloaded .gbkb file {} to {:?}", file_path, local_path);
        Ok(())
    }
 }
--- a/src/drive/files.rs
+++ b/src/drive/files.rs
@ -813,7 +813,7 @@ pub async fn create_folder(
 /// POST /files/shareFolder - Share a folder
 pub async fn share_folder(
    State(_state): State<Arc<AppState>>,
-    Json(req): Json<ShareFolderRequest>,
+    Json(_params): Json<ShareParams>,
 ) -> Result<Json<ApiResponse<ShareResponse>>, (StatusCode, Json<ApiResponse<()>>)> {
    // TODO: Implement actual sharing logic with database
    let share_id = Uuid::new_v4().to_string();
@ -825,13 +825,290 @@ pub async fn share_folder(
            success: true,
            share_id,
            share_link: Some(share_link),
            expires_at: req.expires_at,
        }),
        message: Some("Folder shared successfully".to_string()),
        error: None,
    }))
 }
 // S3/MinIO helper functions for storage operations
 pub async fn save_to_s3(
    state: &Arc<AppState>,
    bucket: &str,
    key: &str,
    content: &[u8],
 ) -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
    let s3_client = &state.s3_client;
    s3_client
        .put_object()
        .bucket(bucket)
        .key(key)
        .body(ByteStream::from(content.to_vec()))
        .send()
        .await?;
    Ok(())
 }
 pub async fn delete_from_s3(
    state: &Arc<AppState>,
    bucket: &str,
    key: &str,
 ) -> Result<(), Box<dyn std::error::Error + Send + Sync>> {
    let s3_client = &state.s3_client;
    s3_client
        .delete_object()
        .bucket(bucket)
        .key(key)
        .send()
        .await?;
    Ok(())
 }
 #[derive(Debug)]
 pub struct BucketStats {
    pub object_count: usize,
    pub total_size: u64,
    pub last_modified: Option<String>,
 }
 pub async fn get_bucket_stats(
    state: &Arc<AppState>,
    bucket: &str,
 ) -> Result<BucketStats, Box<dyn std::error::Error + Send + Sync>> {
    let s3_client = &state.s3_client;
    let list_response = s3_client.list_objects_v2().bucket(bucket).send().await?;
    let mut total_size = 0u64;
    let mut object_count = 0usize;
    let mut last_modified = None;
    if let Some(contents) = list_response.contents() {
        object_count = contents.len();
        for object in contents {
            if let Some(size) = object.size() {
                total_size += size as u64;
            }
            if let Some(modified) = object.last_modified() {
                let modified_str = modified.to_string();
                if last_modified.is_none() || last_modified.as_ref().unwrap() < &modified_str {
                    last_modified = Some(modified_str);
                }
            }
        }
    }
    Ok(BucketStats {
        object_count,
        total_size,
        last_modified,
    })
 }
 pub async fn cleanup_old_files(
    state: &Arc<AppState>,
    bucket: &str,
    cutoff_date: chrono::DateTime<chrono::Utc>,
 ) -> Result<(usize, u64), Box<dyn std::error::Error + Send + Sync>> {
    let s3_client = &state.s3_client;
    let list_response = s3_client.list_objects_v2().bucket(bucket).send().await?;
    let mut deleted_count = 0usize;
    let mut freed_bytes = 0u64;
    if let Some(contents) = list_response.contents() {
        for object in contents {
            if let Some(modified) = object.last_modified() {
                let modified_time = chrono::DateTime::parse_from_rfc3339(&modified.to_string())
                    .map(|dt| dt.with_timezone(&chrono::Utc))
                    .unwrap_or_else(|_| chrono::Utc::now());
                if modified_time < cutoff_date {
                    if let Some(key) = object.key() {
                        if let Some(size) = object.size() {
                            freed_bytes += size as u64;
                        }
                        s3_client
                            .delete_object()
                            .bucket(bucket)
                            .key(key)
                            .send()
                            .await?;
                        deleted_count += 1;
                    }
                }
            }
        }
    }
    Ok((deleted_count, freed_bytes))
 }
 pub async fn create_bucket_backup(
    state: &Arc<AppState>,
    source_bucket: &str,
    backup_bucket: &str,
    backup_id: &str,
 ) -> Result<usize, Box<dyn std::error::Error + Send + Sync>> {
    let s3_client = &state.s3_client;
    // Create backup bucket if it doesn't exist
    let _ = s3_client.create_bucket().bucket(backup_bucket).send().await;
    let list_response = s3_client
        .list_objects_v2()
        .bucket(source_bucket)
        .send()
        .await?;
    let mut file_count = 0usize;
    if let Some(contents) = list_response.contents() {
        for object in contents {
            if let Some(key) = object.key() {
                let backup_key = format!("{}/{}", backup_id, key);
                // Copy object to backup bucket
                let copy_source = format!("{}/{}", source_bucket, key);
                s3_client
                    .copy_object()
                    .copy_source(&copy_source)
                    .bucket(backup_bucket)
                    .key(&backup_key)
                    .send()
                    .await?;
                file_count += 1;
            }
        }
    }
    Ok(file_count)
 }
 pub async fn restore_bucket_backup(
    state: &Arc<AppState>,
    backup_bucket: &str,
    target_bucket: &str,
    backup_id: &str,
 ) -> Result<usize, Box<dyn std::error::Error + Send + Sync>> {
    let s3_client = &state.s3_client;
    let prefix = format!("{}/", backup_id);
    let list_response = s3_client
        .list_objects_v2()
        .bucket(backup_bucket)
        .prefix(&prefix)
        .send()
        .await?;
    let mut file_count = 0usize;
    if let Some(contents) = list_response.contents() {
        for object in contents {
            if let Some(key) = object.key() {
                // Remove backup_id prefix from key
                let restored_key = key.strip_prefix(&prefix).unwrap_or(key);
                // Copy object back to target bucket
                let copy_source = format!("{}/{}", backup_bucket, key);
                s3_client
                    .copy_object()
                    .copy_source(&copy_source)
                    .bucket(target_bucket)
                    .key(restored_key)
                    .send()
                    .await?;
                file_count += 1;
            }
        }
    }
    Ok(file_count)
 }
 pub async fn create_archive(
    state: &Arc<AppState>,
    bucket: &str,
    prefix: &str,
    archive_key: &str,
 ) -> Result<u64, Box<dyn std::error::Error + Send + Sync>> {
    use flate2::write::GzEncoder;
    use flate2::Compression;
    use std::io::Write;
    let s3_client = &state.s3_client;
    let list_response = s3_client
        .list_objects_v2()
        .bucket(bucket)
        .prefix(prefix)
        .send()
        .await?;
    let mut archive_data = Vec::new();
    {
        let mut encoder = GzEncoder::new(&mut archive_data, Compression::default());
        if let Some(contents) = list_response.contents() {
            for object in contents {
                if let Some(key) = object.key() {
                    // Get object content
                    let get_response = s3_client
                        .get_object()
                        .bucket(bucket)
                        .key(key)
                        .send()
                        .await?;
                    let body_bytes = get_response
                        .body
                        .collect()
                        .await
                        .map_err(|e| format!("Failed to collect body: {}", e))?;
                    let bytes = body_bytes.into_bytes();
                    // Write to archive with key as filename
                    encoder.write_all(key.as_bytes())?;
                    encoder.write_all(b"\n")?;
                    encoder.write_all(&bytes)?;
                    encoder.write_all(b"\n---\n")?;
                }
            }
        }
        encoder.finish()?;
    }
    let archive_size = archive_data.len() as u64;
    // Upload archive
    s3_client
        .put_object()
        .bucket(bucket)
        .key(archive_key)
        .body(ByteStream::from(archive_data))
        .send()
        .await?;
    Ok(archive_size)
 }
 pub async fn get_bucket_metrics(
    state: &Arc<AppState>,
    bucket: &str,
 ) -> Result<BucketStats, Box<dyn std::error::Error + Send + Sync>> {
    get_bucket_stats(state, bucket).await
 }
 /// GET /files/dirFolder - Directory listing (alias for list)
 pub async fn dir_folder(
    State(state): State<Arc<AppState>>,
--- a/src/main.rs
+++ b/src/main.rs
@ -130,103 +130,62 @@ async fn run_axum_server(
        .allow_headers(tower_http::cors::Any)
        .max_age(std::time::Duration::from_secs(3600));
-    // Build API routes with State
+    // Use unified API router configuration
-    let mut api_router = Router::new()
+    let mut api_router = crate::api_router::configure_api_routes();
-        // Session routes
+
    // Add session-specific routes
    api_router = api_router
        .route("/api/sessions", post(create_session))
        .route("/api/sessions", get(get_sessions))
        .route(
            "/api/sessions/{session_id}/history",
            get(get_session_history),
        )
-        .route("/api/sessions/{session_id}/start", post(start_session));
+        .route("/api/sessions/{session_id}/start", post(start_session))
-    // File routes
+        // WebSocket route
-    // .route("/api/files/upload/{folder_path}", post(upload_file)) // Function doesn't exist
+        .route("/ws", get(websocket_handler));
-    // Auth route
+    // Add feature-specific routes
    #[cfg(feature = "directory")]
    {
        api_router = api_router.route("/api/auth", get(auth_handler));
    }
    // Voice/Meet routes
    #[cfg(feature = "meet")]
    {
        api_router = api_router
            .route("/api/voice/start", post(voice_start))
            .route("/api/voice/stop", post(voice_stop))
-            .route("/api/meet/create", post(crate::meet::create_meeting))
+            .route("/ws/meet", get(crate::meet::meeting_websocket))
-            .route("/api/meet/rooms", get(crate::meet::list_rooms))
+            .merge(crate::meet::configure());
            .route("/api/meet/rooms/:room_id", get(crate::meet::get_room))
            .route(
                "/api/meet/rooms/:room_id/join",
                post(crate::meet::join_room),
            )
            .route(
                "/api/meet/rooms/:room_id/transcription/start",
                post(crate::meet::start_transcription),
            )
            .route("/api/meet/token", post(crate::meet::get_meeting_token))
            .route("/api/meet/invite", post(crate::meet::send_meeting_invites))
            .route("/ws/meet", get(crate::meet::meeting_websocket));
    }
    api_router = api_router
        // Media/Multimedia routes
        .route(
            "/api/media/upload",
            post(crate::bot::multimedia::upload_media_handler),
        )
        .route(
            "/api/media/:media_id",
            get(crate::bot::multimedia::download_media_handler),
        )
        .route(
            "/api/media/:media_id/thumbnail",
            get(crate::bot::multimedia::generate_thumbnail_handler),
        )
        .route(
            "/api/media/search",
            post(crate::bot::multimedia::web_search_handler),
        )
        // WebSocket route
        .route("/ws", get(websocket_handler))
        // Bot routes
        .route("/api/bots", post(crate::bot::create_bot_handler))
        .route(
            "/api/bots/{bot_id}/mount",
            post(crate::bot::mount_bot_handler),
        )
        .route(
            "/api/bots/{bot_id}/input",
            post(crate::bot::handle_user_input_handler),
        )
        .route(
            "/api/bots/{bot_id}/sessions",
            get(crate::bot::get_user_sessions_handler),
        )
        .route(
            "/api/bots/{bot_id}/history",
            get(crate::bot::get_conversation_history_handler),
        )
        .route(
            "/api/bots/{bot_id}/warning",
            post(crate::bot::send_warning_handler),
        );
    // Add email routes if feature is enabled
    // Merge drive, email, meet, and auth module routes
    api_router = api_router.merge(crate::drive::configure());
    #[cfg(feature = "meet")]
    {
        api_router = api_router.merge(crate::meet::configure());
    }
    api_router = api_router.nest("/api", crate::directory::router::configure());
    #[cfg(feature = "email")]
-    let api_router = api_router.merge(crate::email::configure());
+    {
        api_router = api_router.merge(crate::email::configure());
    }
    // Add calendar routes with CalDAV if feature is enabled
    #[cfg(feature = "calendar")]
    {
        let calendar_engine =
            Arc::new(crate::calendar::CalendarEngine::new(app_state.conn.clone()));
        // Start reminder job
        let reminder_engine = Arc::clone(&calendar_engine);
        tokio::spawn(async move {
            crate::calendar::start_reminder_job(reminder_engine).await;
        });
        // Add CalDAV router
        api_router = api_router.merge(crate::calendar::caldav::create_caldav_router(
            calendar_engine,
        ));
    }
    // Add task engine routes
    let task_engine = Arc::new(crate::tasks::TaskEngine::new(app_state.conn.clone()));
    api_router = api_router.merge(crate::tasks::configure_task_routes(task_engine));
    // Build static file serving
    let static_path = std::path::Path::new("./web/desktop");
@ -241,8 +200,7 @@ async fn run_axum_server(
        .nest_service("/mail", ServeDir::new(static_path.join("mail")))
        .nest_service("/tasks", ServeDir::new(static_path.join("tasks")))
        // API routes
-        .merge(api_router)
+        .merge(api_router.with_state(app_state.clone()))
        .with_state(app_state.clone())
        // Root index route - only matches exact "/"
        .route("/", get(crate::ui_server::index))
        // Layers
@ -554,6 +512,9 @@ async fn main() -> std::io::Result<()> {
        base_llm_provider
    };
    // Initialize Knowledge Base Manager
    let kb_manager = Arc::new(botserver::core::kb::KnowledgeBaseManager::new("work"));
    let app_state = Arc::new(AppState {
        drive: Some(drive),
        config: Some(cfg.clone()),
@ -575,8 +536,14 @@ async fn main() -> std::io::Result<()> {
        response_channels: Arc::new(tokio::sync::Mutex::new(HashMap::new())),
        web_adapter: web_adapter.clone(),
        voice_adapter: voice_adapter.clone(),
        kb_manager: Some(kb_manager.clone()),
    });
    // Start website crawler service
    if let Err(e) = botserver::core::kb::ensure_crawler_service_running(app_state.clone()).await {
        log::warn!("Failed to start website crawler service: {}", e);
    }
    state_tx.send(app_state.clone()).await.ok();
    progress_tx.send(BootstrapProgress::BootstrapComplete).ok();
--- a/src/tasks/mod.rs
+++ b/src/tasks/mod.rs
@ -185,16 +185,13 @@ impl TaskEngine {
        let updated_at = Utc::now();
        // Check if status is changing to Done
-        let completing = updates.status
+        let completing = updates
            .status
            .as_ref()
            .map(|s| matches!(s, TaskStatus::Done))
            .unwrap_or(false);
-        let completed_at = if completing {
+        let completed_at = if completing { Some(Utc::now()) } else { None };
            Some(Utc::now())
        } else {
            None
        };
        // TODO: Implement with Diesel
        /*
@ -450,7 +447,10 @@ impl TaskEngine {
    }
    /// Calculate task progress (percentage)
-    pub async fn calculate_progress(&self, task_id: Uuid) -> Result<f32, Box<dyn std::error::Error>> {
+    pub async fn calculate_progress(
        &self,
        task_id: Uuid,
    ) -> Result<f32, Box<dyn std::error::Error>> {
        let task = self.get_task(task_id).await?;
        if task.subtasks.is_empty() {
@ -460,7 +460,9 @@ impl TaskEngine {
                TaskStatus::InProgress => 50.0,
                TaskStatus::Review => 75.0,
                TaskStatus::Done => 100.0,
-                TaskStatus::Blocked => task.actual_hours.unwrap_or(0.0) / task.estimated_hours.unwrap_or(1.0) * 100.0,
+                TaskStatus::Blocked => {
                    task.actual_hours.unwrap_or(0.0) / task.estimated_hours.unwrap_or(1.0) * 100.0
                }
                TaskStatus::Cancelled => 0.0,
            });
        }
@ -645,9 +647,9 @@ impl TaskEngine {
 /// HTTP API handlers
 pub mod handlers {
    use super::*;
-    use axum::extract::{State as AxumState, Query as AxumQuery, Path as AxumPath};
+    use axum::extract::{Path as AxumPath, Query as AxumQuery, State as AxumState};
    use axum::response::{Json as AxumJson, IntoResponse};
    use axum::http::StatusCode;
    use axum::response::{IntoResponse, Json as AxumJson};
    pub async fn create_task_handler<S>(
        AxumState(_engine): AxumState<S>,
@ -656,7 +658,6 @@ pub mod handlers {
        // TODO: Implement with actual engine
        let created = task;
        (StatusCode::OK, AxumJson(serde_json::json!(created)))
    }
    pub async fn get_tasks_handler<S>(
@ -693,7 +694,184 @@ pub mod handlers {
    }
 }
 pub async fn handle_task_create(
    State(engine): State<Arc<TaskEngine>>,
    Json(mut task): Json<Task>,
 ) -> Result<Json<Task>, StatusCode> {
    task.id = Uuid::new_v4();
    task.created_at = Utc::now();
    task.updated_at = Utc::now();
    match engine.create_task(task).await {
        Ok(created) => Ok(Json(created)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_update(
    State(engine): State<Arc<TaskEngine>>,
    Path(id): Path<Uuid>,
    Json(updates): Json<TaskUpdate>,
 ) -> Result<Json<Task>, StatusCode> {
    match engine.update_task(id, updates).await {
        Ok(updated) => Ok(Json(updated)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_delete(
    State(engine): State<Arc<TaskEngine>>,
    Path(id): Path<Uuid>,
 ) -> Result<StatusCode, StatusCode> {
    match engine.delete_task(id).await {
        Ok(true) => Ok(StatusCode::NO_CONTENT),
        Ok(false) => Err(StatusCode::NOT_FOUND),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_list(
    State(engine): State<Arc<TaskEngine>>,
    Query(params): Query<std::collections::HashMap<String, String>>,
 ) -> Result<Json<Vec<Task>>, StatusCode> {
    let tasks = if let Some(user_id) = params.get("user_id") {
        engine.get_user_tasks(user_id).await
    } else if let Some(status_str) = params.get("status") {
        let status = match status_str.as_str() {
            "todo" => TaskStatus::Todo,
            "in_progress" => TaskStatus::InProgress,
            "review" => TaskStatus::Review,
            "done" => TaskStatus::Done,
            "blocked" => TaskStatus::Blocked,
            "cancelled" => TaskStatus::Cancelled,
            _ => TaskStatus::Todo,
        };
        engine.get_tasks_by_status(status).await
    } else {
        engine.get_all_tasks().await
    };
    match tasks {
        Ok(task_list) => Ok(Json(task_list)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_assign(
    State(engine): State<Arc<TaskEngine>>,
    Path(id): Path<Uuid>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<Task>, StatusCode> {
    let assignee = payload["assignee"]
        .as_str()
        .ok_or(StatusCode::BAD_REQUEST)?;
    match engine.assign_task(id, assignee.to_string()).await {
        Ok(updated) => Ok(Json(updated)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_status_update(
    State(engine): State<Arc<TaskEngine>>,
    Path(id): Path<Uuid>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<Task>, StatusCode> {
    let status_str = payload["status"].as_str().ok_or(StatusCode::BAD_REQUEST)?;
    let status = match status_str {
        "todo" => TaskStatus::Todo,
        "in_progress" => TaskStatus::InProgress,
        "review" => TaskStatus::Review,
        "done" => TaskStatus::Done,
        "blocked" => TaskStatus::Blocked,
        "cancelled" => TaskStatus::Cancelled,
        _ => return Err(StatusCode::BAD_REQUEST),
    };
    let updates = TaskUpdate {
        title: None,
        description: None,
        status: Some(status),
        priority: None,
        assignee: None,
        due_date: None,
        tags: None,
    };
    match engine.update_task(id, updates).await {
        Ok(updated) => Ok(Json(updated)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_priority_set(
    State(engine): State<Arc<TaskEngine>>,
    Path(id): Path<Uuid>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<Task>, StatusCode> {
    let priority_str = payload["priority"]
        .as_str()
        .ok_or(StatusCode::BAD_REQUEST)?;
    let priority = match priority_str {
        "low" => TaskPriority::Low,
        "medium" => TaskPriority::Medium,
        "high" => TaskPriority::High,
        "urgent" => TaskPriority::Urgent,
        _ => return Err(StatusCode::BAD_REQUEST),
    };
    let updates = TaskUpdate {
        title: None,
        description: None,
        status: None,
        priority: Some(priority),
        assignee: None,
        due_date: None,
        tags: None,
    };
    match engine.update_task(id, updates).await {
        Ok(updated) => Ok(Json(updated)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 pub async fn handle_task_dependencies_set(
    State(engine): State<Arc<TaskEngine>>,
    Path(id): Path<Uuid>,
    Json(payload): Json<serde_json::Value>,
 ) -> Result<Json<Task>, StatusCode> {
    let deps = payload["dependencies"]
        .as_array()
        .ok_or(StatusCode::BAD_REQUEST)?
        .iter()
        .filter_map(|v| v.as_str().and_then(|s| Uuid::parse_str(s).ok()))
        .collect::<Vec<_>>();
    match engine.set_dependencies(id, deps).await {
        Ok(updated) => Ok(Json(updated)),
        Err(_) => Err(StatusCode::INTERNAL_SERVER_ERROR),
    }
 }
 /// Configure task engine routes
 pub fn configure_task_routes(state: Arc<TaskEngine>) -> Router {
    Router::new()
        .route("/api/tasks", post(handle_task_create))
        .route("/api/tasks", get(handle_task_list))
        .route("/api/tasks/:id", put(handle_task_update))
        .route("/api/tasks/:id", delete(handle_task_delete))
        .route("/api/tasks/:id/assign", post(handle_task_assign))
        .route("/api/tasks/:id/status", put(handle_task_status_update))
        .route("/api/tasks/:id/priority", put(handle_task_priority_set))
        .route(
            "/api/tasks/:id/dependencies",
            put(handle_task_dependencies_set),
        )
        .with_state(state)
 }
 /// Configure task engine routes (legacy)
 pub fn configure<S>(router: Router<S>) -> Router<S>
 where
    S: Clone + Send + Sync + 'static,
@ -704,5 +882,8 @@ where
        .route("/api/tasks", post(handlers::create_task_handler::<S>))
        .route("/api/tasks", get(handlers::get_tasks_handler::<S>))
        .route("/api/tasks/:id", put(handlers::update_task_handler::<S>))
-        .route("/api/tasks/statistics", get(handlers::get_statistics_handler::<S>))
+        .route(
            "/api/tasks/statistics",
            get(handlers::get_statistics_handler::<S>),
        )
 }
--- a/templates/api-client.gbai/api-client.gbdialog/climate.bas
+++ b/templates/api-client.gbai/api-client.gbdialog/climate.bas
--- a/templates/default.gbai/default.gbdialog/calculate.bas
+++ b/templates/default.gbai/default.gbdialog/calculate.bas
--- a/templates/default.gbai/default.gbdialog/send-email.bas
+++ b/templates/default.gbai/default.gbdialog/send-email.bas
--- a/templates/default.gbai/default.gbdialog/send-sms.bas
+++ b/templates/default.gbai/default.gbdialog/send-sms.bas
--- a/templates/default.gbai/default.gbdialog/translate.bas
+++ b/templates/default.gbai/default.gbdialog/translate.bas
--- a/templates/default.gbai/default.gbdialog/weather.bas
+++ b/templates/default.gbai/default.gbdialog/weather.bas
--- a/templates/default.gbai/default.gbot/config.csv
+++ b/templates/default.gbai/default.gbot/config.csv
@ -45,3 +45,7 @@ custom-port,5432
 custom-database,mycustomdb
 custom-username,
 custom-password,
 ,
 website-expires,1d
 website-max-depth,3
 website-max-pages,100
--- a/templates/marketing.gbai/marketing.gbdialog/get-image.bas
+++ b/templates/marketing.gbai/marketing.gbdialog/get-image.bas
--- a/templates/marketing.gbai/marketing.gbdialog/post-to-instagram.bas
+++ b/templates/marketing.gbai/marketing.gbdialog/post-to-instagram.bas
--- a/templates/public-apis.gbai/KEYWORDS_CHECKLIST.md
+++ b/templates/public-apis.gbai/KEYWORDS_CHECKLIST.md
@ -10,7 +10,7 @@
 ## 📋 Implementation Status
-### ☁️ Weather APIs (7 keywords) - `weather-apis.vbs`
+### ☁️ Weather APIs (7 keywords) - `weather-apis.bas`
 - [x] 7Timer! Astro Weather - Astronomical weather forecast
 - [x] 7Timer! Civil Weather - 7-day weather forecast
 - [x] Open-Meteo Weather - Real-time weather data
@ -20,7 +20,7 @@
 - [x] AQICN Air Quality - Air quality index by city
 - [x] Get Weather Icon - Weather condition to emoji converter
-### 🐾 Animals APIs (17 keywords) - `animals-apis.vbs`
+### 🐾 Animals APIs (17 keywords) - `animals-apis.bas`
 - [x] Random Cat Fact - Cat facts
 - [x] Random Dog Fact - Dog facts
 - [x] Random Dog Image - Dog pictures
@ -40,7 +40,7 @@
 - [x] Dog Breeds List - All dog breeds
 - [x] Specific Dog Breed Image - Image by breed name
-### 😄 Entertainment APIs (19 keywords) - `entertainment-apis.vbs`
+### 😄 Entertainment APIs (19 keywords) - `entertainment-apis.bas`
 - [x] Chuck Norris Joke - Random Chuck Norris joke
 - [x] Chuck Norris Categories - Available joke categories
 - [x] Chuck Norris Joke by Category - Category-specific jokes
@ -66,7 +66,7 @@
 - [x] Insult Generator - Clean insults
 - [x] Compliment Generator - Random compliments
-### 🍽️ Food & Drink APIs (13 keywords) - `food-apis.vbs`
+### 🍽️ Food & Drink APIs (13 keywords) - `food-apis.bas`
 - [x] Random Coffee Image - Coffee images
 - [x] Random Food Dish - Food dish images
 - [x] Random Food by Category - Category-specific food
@ -84,7 +84,7 @@
 - [x] High ABV Beers - High alcohol content beers
 - [x] Bacon Ipsum Text - Bacon-themed lorem ipsum
-### 🔧 Data Utility & Geocoding APIs (19 keywords) - `data-utility-apis.vbs`
+### 🔧 Data Utility & Geocoding APIs (19 keywords) - `data-utility-apis.bas`
 - [x] Generate UUID - Single UUID generation
 - [x] Generate Multiple UUIDs - Multiple UUIDs
 - [x] Get My IP Address - Current public IP
@ -199,11 +199,11 @@ botserver/templates/public-apis.gbai/
 ├── README.md (758 lines)
 ├── KEYWORDS_CHECKLIST.md (this file)
 └── public-apis.gbdialog/
-    ├── weather-apis.vbs (244 lines, 8 keywords)
+    ├── weather-apis.bas (244 lines, 8 keywords)
-    ├── animals-apis.vbs (366 lines, 17 keywords)
+    ├── animals-apis.bas (366 lines, 17 keywords)
-    ├── entertainment-apis.vbs (438 lines, 19 keywords)
+    ├── entertainment-apis.bas (438 lines, 19 keywords)
-    ├── food-apis.vbs (503 lines, 13 keywords)
+    ├── food-apis.bas (503 lines, 13 keywords)
-    └── data-utility-apis.vbs (568 lines, 19 keywords)
+    └── data-utility-apis.bas (568 lines, 19 keywords)
 ```
 **Total Lines of Code**: ~2,877 lines
@ -332,4 +332,4 @@ botserver/templates/public-apis.gbai/
 ---
-🎉 **Ready to use! Copy the `public-apis.gbai` folder to your General Bots templates directory.**
+🎉 **Ready to use! Copy the `public-apis.gbai` folder to your General Bots templates directory.**
--- a/templates/public-apis.gbai/QUICKSTART.md
+++ b/templates/public-apis.gbai/QUICKSTART.md
@ -244,7 +244,7 @@ END IF
 ### Method 1: Direct Testing
 ```vbs
-REM Create a test dialog file: test.gbdialog/test-apis.vbs
+REM Create a test dialog file: test.gbdialog/test-apis.bas
 TALK "Testing Weather API..."
 weather = GET "https://api.open-meteo.com/v1/forecast?latitude=52.52&longitude=13.41&current_weather=true"
@ -415,4 +415,4 @@ END FOR
 **Need help?** Check the examples in this guide or refer to the full README.md
-**Have fun coding!** 🎉
+**Have fun coding!** 🎉
--- a/templates/public-apis.gbai/README.md
+++ b/templates/public-apis.gbai/README.md
@ -6,11 +6,11 @@ This package provides 50+ free API keywords for General Bots, allowing you to in
 This `.gbai` template includes the following BASIC keyword files:
- `weather-apis.vbs` - Weather data and forecasts
+- `weather-apis.bas` - Weather data and forecasts
- `animals-apis.vbs` - Animal facts and images
+- `animals-apis.bas` - Animal facts and images
- `entertainment-apis.vbs` - Jokes, quotes, and fun content
+- `entertainment-apis.bas` - Jokes, quotes, and fun content
- `food-apis.vbs` - Food recipes and drink information
+- `food-apis.bas` - Food recipes and drink information
- `data-utility-apis.vbs` - Data utilities and geocoding
+- `data-utility-apis.bas` - Data utilities and geocoding
 ## 🌤️ Weather APIs
@ -735,7 +735,7 @@ END IF
 To add more API keywords:
 1. Find a free, no-auth API from [public-apis](https://github.com/public-apis/public-apis)
-2. Create a `.vbs` or `.bas` file in the appropriate category
+2. Create a `.bas` or `.bas` file in the appropriate category
 3. Follow the existing keyword pattern
 4. Test thoroughly
 5. Update this README
--- a/templates/public-apis.gbai/public-apis.gbdialog/animals-apis.bas
+++ b/templates/public-apis.gbai/public-apis.gbdialog/animals-apis.bas
--- a/templates/public-apis.gbai/public-apis.gbdialog/data-utility-apis.bas
+++ b/templates/public-apis.gbai/public-apis.gbdialog/data-utility-apis.bas
--- a/templates/public-apis.gbai/public-apis.gbdialog/entertainment-apis.bas
+++ b/templates/public-apis.gbai/public-apis.gbdialog/entertainment-apis.bas
--- a/templates/public-apis.gbai/public-apis.gbdialog/food-apis.bas
+++ b/templates/public-apis.gbai/public-apis.gbdialog/food-apis.bas
--- a/templates/public-apis.gbai/public-apis.gbdialog/science-space-apis.bas
+++ b/templates/public-apis.gbai/public-apis.gbdialog/science-space-apis.bas
--- a/templates/public-apis.gbai/public-apis.gbdialog/weather-apis.bas
+++ b/templates/public-apis.gbai/public-apis.gbdialog/weather-apis.bas