fix(memory): file-lock read-modify-write to prevent concurrent data loss

Two concurrent gateway sessions calling memory add/replace/remove simultaneously could both read the old state, apply their changes independently, and write — the last writer silently drops the first writer's entry. Fix: wrap each mutation in a file lock (fcntl.flock on a .lock file). Under the lock, re-read entries from disk to get the latest state, apply the mutation, then write. This ensures concurrent writers serialize properly. The lock uses a separate .lock file since the memory file itself is atomically replaced via os.replace() (can't flock a replaced file). Readers remain lock-free since atomic rename ensures they always see a complete file.
Merge pull request #1724 from NousResearch/fix/model-metadata-fuzzy-match
2026-03-17 04:19:11 -07:00 · 2026-03-17 04:13:56 -07:00 · 2026-03-17 04:13:54 -07:00 · 2026-03-17 04:13:51 -07:00 · 2026-03-17 04:13:49 -07:00 · 2026-03-17 04:13:46 -07:00
10 changed files with 182 additions and 100 deletions
--- a/agent/anthropic_adapter.py
+++ b/agent/anthropic_adapter.py
@@ -1053,7 +1053,8 @@ def build_anthropic_kwargs(
        elif tool_choice == "required":
            kwargs["tool_choice"] = {"type": "any"}
        elif tool_choice == "none":
-            pass  # Don't send tool_choice — Anthropic will use tools if needed
+            # Anthropic has no tool_choice "none" — omit tools entirely to prevent use
+            kwargs.pop("tools", None)
        elif isinstance(tool_choice, str):
            # Specific tool name
            kwargs["tool_choice"] = {"type": "tool", "name": tool_choice}
--- a/agent/auxiliary_client.py
+++ b/agent/auxiliary_client.py
@@ -706,6 +706,8 @@ def _resolve_forced_provider(forced: str) -> Tuple[Optional[OpenAI], Optional[st

 def _resolve_auto() -> Tuple[Optional[OpenAI], Optional[str]]:
    """Full auto-detection chain: OpenRouter → Nous → custom → Codex → API-key → None."""
+    global auxiliary_is_nous
+    auxiliary_is_nous = False  # Reset — _try_nous() will set True if it wins
    for try_fn in (_try_openrouter, _try_nous, _try_custom_endpoint,
                   _try_codex, _resolve_api_key_provider):
        client, model = try_fn()
--- a/agent/context_compressor.py
+++ b/agent/context_compressor.py
@@ -313,7 +313,19 @@ Write only the summary body. Do not include any preamble or prefix; the system w

        if summary:
            last_head_role = messages[compress_start - 1].get("role", "user") if compress_start > 0 else "user"
-            summary_role = "user" if last_head_role in ("assistant", "tool") else "assistant"
+            first_tail_role = messages[compress_end].get("role", "user") if compress_end < n_messages else "user"
+            # Pick a role that avoids consecutive same-role with both neighbors.
+            # Priority: avoid colliding with head (already committed), then tail.
+            if last_head_role in ("assistant", "tool"):
+                summary_role = "user"
+            else:
+                summary_role = "assistant"
+            # If the chosen role collides with the tail AND flipping wouldn't
+            # collide with the head, flip it.
+            if summary_role == first_tail_role:
+                flipped = "assistant" if summary_role == "user" else "user"
+                if flipped != last_head_role:
+                    summary_role = flipped
            compressed.append({"role": summary_role, "content": summary})
        else:
            if not self.quiet_mode:
--- a/agent/model_metadata.py
+++ b/agent/model_metadata.py
@@ -266,8 +266,10 @@ def get_model_context_length(model: str, base_url: str = "") -> int:
    if model in metadata:
        return metadata[model].get("context_length", 128000)

-    # 3. Hardcoded defaults (fuzzy match)
-    for default_model, length in DEFAULT_CONTEXT_LENGTHS.items():
+    # 3. Hardcoded defaults (fuzzy match — longest key first for specificity)
+    for default_model, length in sorted(
+        DEFAULT_CONTEXT_LENGTHS.items(), key=lambda x: len(x[0]), reverse=True
+    ):
        if default_model in model or model in default_model:
            return length

--- a/cron/jobs.py
+++ b/cron/jobs.py
@@ -5,6 +5,7 @@ Jobs are stored in ~/.hermes/cron/jobs.json
 Output is saved to ~/.hermes/cron/output/{job_id}/{timestamp}.md
 """

+import copy
 import json
 import logging
 import tempfile
@@ -539,8 +540,8 @@ def get_due_jobs() -> List[Dict[str, Any]]:
    immediately.  This prevents a burst of missed jobs on gateway restart.
    """
    now = _hermes_now()
-    jobs = [_apply_skill_fields(j) for j in load_jobs()]
-    raw_jobs = load_jobs()  # For saving updates
+    raw_jobs = load_jobs()
+    jobs = [_apply_skill_fields(j) for j in copy.deepcopy(raw_jobs)]
    due = []
    needs_save = False

--- a/gateway/platforms/matrix.py
+++ b/gateway/platforms/matrix.py
@@ -220,6 +220,7 @@ class MatrixAdapter(BasePlatformAdapter):

        # Start the sync loop.
        self._sync_task = asyncio.create_task(self._sync_loop())
+        self._mark_connected()
        return True

    async def disconnect(self) -> None:
--- a/gateway/platforms/mattermost.py
+++ b/gateway/platforms/mattermost.py
@@ -222,6 +222,7 @@ class MattermostAdapter(BasePlatformAdapter):

        # Start WebSocket in background.
        self._ws_task = asyncio.create_task(self._ws_loop())
+        self._mark_connected()
        return True

    async def disconnect(self) -> None:
--- a/run_agent.py
+++ b/run_agent.py
@@ -4884,6 +4884,7 @@ class AIAgent:
        codex_ack_continuations = 0
        length_continue_retries = 0
        truncated_response_prefix = ""
+        compression_attempts = 0
        
        # Clear any stale interrupt state at start
        self.clear_interrupt()
@@ -5029,7 +5030,6 @@ class AIAgent:
            api_start_time = time.time()
            retry_count = 0
            max_retries = 3
-            compression_attempts = 0
            max_compression_attempts = 3
            codex_auth_retry_attempted = False
            anthropic_auth_retry_attempted = False
@@ -5971,19 +5971,32 @@ class AIAgent:
                            # Don't add anything to messages, just retry the API call
                            continue
                        else:
-                            # Instead of returning partial, inject a helpful message and let model recover
-                            self._vprint(f"{self.log_prefix}⚠️  Injecting recovery message for invalid JSON...")
+                            # Instead of returning partial, inject tool error results so the model can recover.
+                            # Using tool results (not user messages) preserves role alternation.
+                            self._vprint(f"{self.log_prefix}⚠️  Injecting recovery tool results for invalid JSON...")
                            self._invalid_json_retries = 0  # Reset for next attempt
                            
-                            # Add a user message explaining the issue
-                            recovery_msg = (
-                                f"Your tool call to '{tool_name}' had invalid JSON arguments. "
-                                f"Error: {error_msg}. "
-                                f"For tools with no required parameters, use an empty object: {{}}. "
-                                f"Please either retry the tool call with valid JSON, or respond without using that tool."
-                            )
-                            recovery_dict = {"role": "user", "content": recovery_msg}
-                            messages.append(recovery_dict)
+                            # Append the assistant message with its (broken) tool_calls
+                            recovery_assistant = self._build_assistant_message(assistant_message, finish_reason)
+                            messages.append(recovery_assistant)
+                            
+                            # Respond with tool error results for each tool call
+                            invalid_names = {name for name, _ in invalid_json_args}
+                            for tc in assistant_message.tool_calls:
+                                if tc.function.name in invalid_names:
+                                    err = next(e for n, e in invalid_json_args if n == tc.function.name)
+                                    tool_result = (
+                                        f"Error: Invalid JSON arguments. {err}. "
+                                        f"For tools with no required parameters, use an empty object: {{}}. "
+                                        f"Please retry with valid JSON."
+                                    )
+                                else:
+                                    tool_result = "Skipped: other tool call in this response had invalid JSON."
+                                messages.append({
+                                    "role": "tool",
+                                    "tool_call_id": tc.id,
+                                    "content": tool_result,
+                                })
                            continue
                    
                    # Reset retry counter on successful JSON validation
@@ -6169,6 +6182,8 @@ class AIAgent:

                    if truncated_response_prefix:
                        final_response = truncated_response_prefix + final_response
+                        truncated_response_prefix = ""
+                        length_continue_retries = 0
                    
                    # Strip <think> blocks from user-facing response (keep raw in messages for trajectory)
                    final_response = self._strip_think_blocks(final_response).strip()
@@ -6220,10 +6235,11 @@ class AIAgent:
                
                if not pending_handled:
                    # Error happened before tool processing (e.g. response parsing).
-                    # Use a user-role message so the model can see what went wrong
-                    # without confusing the API with a fabricated assistant turn.
+                    # Choose role to avoid consecutive same-role messages.
+                    last_role = messages[-1].get("role") if messages else None
+                    err_role = "assistant" if last_role == "user" else "user"
                    sys_err_msg = {
-                        "role": "user",
+                        "role": err_role,
                        "content": f"[System error during processing: {error_msg}]",
                    }
                    messages.append(sys_err_msg)
--- a/tools/browser_tool.py
+++ b/tools/browser_tool.py
@@ -555,6 +555,11 @@ def _get_session_info(task_id: Optional[str] = None) -> Dict[str, str]:
            session_info = provider.create_session(task_id)
    
    with _cleanup_lock:
+        # Double-check: another thread may have created a session while we
+        # were doing the network call. Use the existing one to avoid leaking
+        # orphan cloud sessions.
+        if task_id in _active_sessions:
+            return _active_sessions[task_id]
        _active_sessions[task_id] = session_info
    
    return session_info
--- a/tools/memory_tool.py
+++ b/tools/memory_tool.py
@@ -23,11 +23,13 @@ Design:
 - Frozen snapshot pattern: system prompt is stable, tool responses show live state
 """

+import fcntl
 import json
 import logging
 import os
 import re
 import tempfile
+from contextlib import contextmanager
 from pathlib import Path
 from typing import Dict, Any, List, Optional

@@ -120,14 +122,43 @@ class MemoryStore:
            "user": self._render_block("user", self.user_entries),
        }

+    @staticmethod
+    @contextmanager
+    def _file_lock(path: Path):
+        """Acquire an exclusive file lock for read-modify-write safety.
+
+        Uses a separate .lock file so the memory file itself can still be
+        atomically replaced via os.replace().
+        """
+        lock_path = path.with_suffix(path.suffix + ".lock")
+        lock_path.parent.mkdir(parents=True, exist_ok=True)
+        fd = open(lock_path, "w")
+        try:
+            fcntl.flock(fd, fcntl.LOCK_EX)
+            yield
+        finally:
+            fcntl.flock(fd, fcntl.LOCK_UN)
+            fd.close()
+
+    @staticmethod
+    def _path_for(target: str) -> Path:
+        if target == "user":
+            return MEMORY_DIR / "USER.md"
+        return MEMORY_DIR / "MEMORY.md"
+
+    def _reload_target(self, target: str):
+        """Re-read entries from disk into in-memory state.
+
+        Called under file lock to get the latest state before mutating.
+        """
+        fresh = self._read_file(self._path_for(target))
+        fresh = list(dict.fromkeys(fresh))  # deduplicate
+        self._set_entries(target, fresh)
+
    def save_to_disk(self, target: str):
        """Persist entries to the appropriate file. Called after every mutation."""
        MEMORY_DIR.mkdir(parents=True, exist_ok=True)
-
-        if target == "memory":
-            self._write_file(MEMORY_DIR / "MEMORY.md", self.memory_entries)
-        elif target == "user":
-            self._write_file(MEMORY_DIR / "USER.md", self.user_entries)
+        self._write_file(self._path_for(target), self._entries_for(target))

    def _entries_for(self, target: str) -> List[str]:
        if target == "user":
@@ -162,33 +193,37 @@ class MemoryStore:
        if scan_error:
            return {"success": False, "error": scan_error}

-        entries = self._entries_for(target)
-        limit = self._char_limit(target)
+        with self._file_lock(self._path_for(target)):
+            # Re-read from disk under lock to pick up writes from other sessions
+            self._reload_target(target)

-        # Reject exact duplicates
-        if content in entries:
-            return self._success_response(target, "Entry already exists (no duplicate added).")
+            entries = self._entries_for(target)
+            limit = self._char_limit(target)

-        # Calculate what the new total would be
-        new_entries = entries + [content]
-        new_total = len(ENTRY_DELIMITER.join(new_entries))
+            # Reject exact duplicates
+            if content in entries:
+                return self._success_response(target, "Entry already exists (no duplicate added).")

-        if new_total > limit:
-            current = self._char_count(target)
-            return {
-                "success": False,
-                "error": (
-                    f"Memory at {current:,}/{limit:,} chars. "
-                    f"Adding this entry ({len(content)} chars) would exceed the limit. "
-                    f"Replace or remove existing entries first."
-                ),
-                "current_entries": entries,
-                "usage": f"{current:,}/{limit:,}",
-            }
+            # Calculate what the new total would be
+            new_entries = entries + [content]
+            new_total = len(ENTRY_DELIMITER.join(new_entries))

-        entries.append(content)
-        self._set_entries(target, entries)
-        self.save_to_disk(target)
+            if new_total > limit:
+                current = self._char_count(target)
+                return {
+                    "success": False,
+                    "error": (
+                        f"Memory at {current:,}/{limit:,} chars. "
+                        f"Adding this entry ({len(content)} chars) would exceed the limit. "
+                        f"Replace or remove existing entries first."
+                    ),
+                    "current_entries": entries,
+                    "usage": f"{current:,}/{limit:,}",
+                }
+
+            entries.append(content)
+            self._set_entries(target, entries)
+            self.save_to_disk(target)

        return self._success_response(target, "Entry added.")

@@ -206,44 +241,47 @@ class MemoryStore:
        if scan_error:
            return {"success": False, "error": scan_error}

-        entries = self._entries_for(target)
-        matches = [(i, e) for i, e in enumerate(entries) if old_text in e]
+        with self._file_lock(self._path_for(target)):
+            self._reload_target(target)

-        if len(matches) == 0:
-            return {"success": False, "error": f"No entry matched '{old_text}'."}
+            entries = self._entries_for(target)
+            matches = [(i, e) for i, e in enumerate(entries) if old_text in e]

-        if len(matches) > 1:
-            # If all matches are identical (exact duplicates), operate on the first one
-            unique_texts = set(e for _, e in matches)
-            if len(unique_texts) > 1:
-                previews = [e[:80] + ("..." if len(e) > 80 else "") for _, e in matches]
+            if len(matches) == 0:
+                return {"success": False, "error": f"No entry matched '{old_text}'."}
+
+            if len(matches) > 1:
+                # If all matches are identical (exact duplicates), operate on the first one
+                unique_texts = set(e for _, e in matches)
+                if len(unique_texts) > 1:
+                    previews = [e[:80] + ("..." if len(e) > 80 else "") for _, e in matches]
+                    return {
+                        "success": False,
+                        "error": f"Multiple entries matched '{old_text}'. Be more specific.",
+                        "matches": previews,
+                    }
+                # All identical -- safe to replace just the first
+
+            idx = matches[0][0]
+            limit = self._char_limit(target)
+
+            # Check that replacement doesn't blow the budget
+            test_entries = entries.copy()
+            test_entries[idx] = new_content
+            new_total = len(ENTRY_DELIMITER.join(test_entries))
+
+            if new_total > limit:
                return {
                    "success": False,
-                    "error": f"Multiple entries matched '{old_text}'. Be more specific.",
-                    "matches": previews,
+                    "error": (
+                        f"Replacement would put memory at {new_total:,}/{limit:,} chars. "
+                        f"Shorten the new content or remove other entries first."
+                    ),
                }
-            # All identical -- safe to replace just the first

-        idx = matches[0][0]
-        limit = self._char_limit(target)
-
-        # Check that replacement doesn't blow the budget
-        test_entries = entries.copy()
-        test_entries[idx] = new_content
-        new_total = len(ENTRY_DELIMITER.join(test_entries))
-
-        if new_total > limit:
-            return {
-                "success": False,
-                "error": (
-                    f"Replacement would put memory at {new_total:,}/{limit:,} chars. "
-                    f"Shorten the new content or remove other entries first."
-                ),
-            }
-
-        entries[idx] = new_content
-        self._set_entries(target, entries)
-        self.save_to_disk(target)
+            entries[idx] = new_content
+            self._set_entries(target, entries)
+            self.save_to_disk(target)

        return self._success_response(target, "Entry replaced.")

@@ -253,28 +291,31 @@ class MemoryStore:
        if not old_text:
            return {"success": False, "error": "old_text cannot be empty."}

-        entries = self._entries_for(target)
-        matches = [(i, e) for i, e in enumerate(entries) if old_text in e]
+        with self._file_lock(self._path_for(target)):
+            self._reload_target(target)

-        if len(matches) == 0:
-            return {"success": False, "error": f"No entry matched '{old_text}'."}
+            entries = self._entries_for(target)
+            matches = [(i, e) for i, e in enumerate(entries) if old_text in e]

-        if len(matches) > 1:
-            # If all matches are identical (exact duplicates), remove the first one
-            unique_texts = set(e for _, e in matches)
-            if len(unique_texts) > 1:
-                previews = [e[:80] + ("..." if len(e) > 80 else "") for _, e in matches]
-                return {
-                    "success": False,
-                    "error": f"Multiple entries matched '{old_text}'. Be more specific.",
-                    "matches": previews,
-                }
-            # All identical -- safe to remove just the first
+            if len(matches) == 0:
+                return {"success": False, "error": f"No entry matched '{old_text}'."}

-        idx = matches[0][0]
-        entries.pop(idx)
-        self._set_entries(target, entries)
-        self.save_to_disk(target)
+            if len(matches) > 1:
+                # If all matches are identical (exact duplicates), remove the first one
+                unique_texts = set(e for _, e in matches)
+                if len(unique_texts) > 1:
+                    previews = [e[:80] + ("..." if len(e) > 80 else "") for _, e in matches]
+                    return {
+                        "success": False,
+                        "error": f"Multiple entries matched '{old_text}'. Be more specific.",
+                        "matches": previews,
+                    }
+                # All identical -- safe to remove just the first
+
+            idx = matches[0][0]
+            entries.pop(idx)
+            self._set_entries(target, entries)
+            self.save_to_disk(target)

        return self._success_response(target, "Entry removed.")
Author	SHA1	Message	Date
teknium1	d81de2f3d8	fix(memory): file-lock read-modify-write to prevent concurrent data loss Two concurrent gateway sessions calling memory add/replace/remove simultaneously could both read the old state, apply their changes independently, and write — the last writer silently drops the first writer's entry. Fix: wrap each mutation in a file lock (fcntl.flock on a .lock file). Under the lock, re-read entries from disk to get the latest state, apply the mutation, then write. This ensures concurrent writers serialize properly. The lock uses a separate .lock file since the memory file itself is atomically replaced via os.replace() (can't flock a replaced file). Readers remain lock-free since atomic rename ensures they always see a complete file.	2026-03-17 04:19:11 -07:00
Teknium	0878e5f4a8	Merge pull request #1724 from NousResearch/fix/model-metadata-fuzzy-match fix(metadata): fuzzy context length match can return wrong model's value	2026-03-17 04:13:56 -07:00
Teknium	72bcec0ce5	Merge pull request #1723 from NousResearch/fix/compression-attempts-persist fix(core): compression_attempts resets each iteration — allows unlimited compressions	2026-03-17 04:13:54 -07:00
Teknium	d604b9622c	Merge pull request #1722 from NousResearch/fix/run-agent-role-violations fix(core): message role alternation violations in JSON recovery and error handler	2026-03-17 04:13:51 -07:00
Teknium	cf0dd777c8	Merge pull request #1721 from NousResearch/fix/browser-session-race fix(browser): race condition in session creation orphans cloud sessions	2026-03-17 04:13:49 -07:00
Teknium	ec272ca8be	Merge pull request #1720 from NousResearch/fix/compressor-consecutive-role-violation fix(compressor): summary role can violate consecutive-role constraint	2026-03-17 04:13:46 -07:00
Teknium	99a44d87dc	Merge pull request #1718 from NousResearch/fix/messaging-toolset-missing fix(toolsets): add missing 'messaging' toolset — can't enable/disable send_message	2026-03-17 04:13:44 -07:00
Teknium	16f38abd25	Merge pull request #1717 from NousResearch/fix/length-continue-retries-reset fix(core): length_continue_retries never resets — later truncations get fewer retries	2026-03-17 04:13:41 -07:00
Teknium	cac3c4d45f	Merge pull request #1716 from NousResearch/fix/cron-double-load-jobs fix(cron): get_due_jobs reads jobs.json twice — race condition	2026-03-17 04:13:39 -07:00
Teknium	4167e2e294	Merge pull request #1714 from NousResearch/fix/anthropic-tool-choice-none fix(anthropic): tool_choice 'none' still allows tool calls	2026-03-17 04:13:36 -07:00
Teknium	6ddb9ee3e3	Merge pull request #1713 from NousResearch/fix/auxiliary-is-nous-reset fix(aux): auxiliary_is_nous flag never resets — leaks Nous tags to other providers	2026-03-17 04:13:33 -07:00
Teknium	05aefeddc7	Merge pull request #1711 from NousResearch/fix/matrix-mattermost-mark-connected fix(gateway): Matrix and Mattermost never report as connected	2026-03-17 04:13:31 -07:00
teknium1	9db75fcfc2	fix(metadata): fuzzy context length match prefers longest key The fuzzy match for model context lengths iterated dict insertion order. Shorter model names (e.g. 'gpt-5') could match before more specific ones (e.g. 'gpt-5.4-pro'), returning the wrong context length. Sort by key length descending so more specific model names always match first.	2026-03-17 04:12:08 -07:00
teknium1	1264275cc3	fix(core): compression_attempts counter resets each loop iteration compression_attempts was initialized inside the outer while loop, resetting to 0 on every iteration. Since compression triggers a 'continue' back to the top of the loop, the counter never accumulated past 1 — effectively allowing unlimited compression attempts. Move initialization before the outer while loop so the cap of 3 applies across the entire run_conversation() call.	2026-03-17 04:11:32 -07:00
teknium1	cd6dc4ef7e	fix(core): message role violations in JSON recovery and error handler Two edge cases could inject messages that violate role alternation: 1. Invalid JSON recovery (line ~5985): After 3 retries of invalid JSON tool args, a user-role recovery message was injected. But the assistant's tool_calls were never appended, so the sequence could become user → user. Fix: append the assistant message with its tool_calls, then respond with proper tool-role error results. 2. System error handler (line ~6238): Always injected a user-role error message, which creates consecutive user messages if the last message was already user. Fix: dynamically choose the role based on the last message to maintain alternation.	2026-03-17 04:10:41 -07:00
teknium1	8cd4a96686	fix(browser): race condition in session creation can orphan cloud sessions Two concurrent threads (e.g. parallel subagents) could both pass the 'task_id in _active_sessions' check, both create cloud sessions via network calls, and then one would overwrite the other — leaking the first cloud session. Add double-check after the lock is re-acquired: if another thread already created a session while we were doing the network call, use the existing one instead of orphaning it.	2026-03-17 04:09:16 -07:00
teknium1	344f3771cb	fix(compressor): summary role can create consecutive same-role messages The summary message role was determined only by the last head message, ignoring the first tail message. This could create consecutive user messages (rejected by Anthropic) when the tail started with 'user'. Now checks both neighbors. Priority: avoid colliding with the head (already committed). If the chosen role also collides with the tail, flip it — but only if flipping wouldn't re-collide with the head.	2026-03-17 04:08:37 -07:00
teknium1	24282dceb1	fix(core): reset length_continue_retries after successful continuation length_continue_retries and truncated_response_prefix were initialized once before the outer loop and never reset after a successful continuation. If a conversation hit length truncation once (counter=1), succeeded on continuation, did more tool calls, then hit length again, the counter started at 1 instead of 0 — reducing available retries from 3 to 2. The stale truncated_response_prefix would also leak into the next response. Reset both after the prefix is consumed on a successful final response.	2026-03-17 04:05:20 -07:00
teknium1	1f0bb8742f	fix(cron): get_due_jobs read jobs.json twice creating race window get_due_jobs() called load_jobs() twice: once for filtering (with _apply_skill_fields) and once for saving updates. Between the two reads, another process could modify jobs.json, causing the filtering and saving to operate on different versions. Fix: load once, deepcopy for the skill-applied working list.	2026-03-17 04:03:42 -07:00
teknium1	0de75505f3	fix(anthropic): tool_choice 'none' still allowed tool calls When tool_choice was 'none', the code did 'pass' — no tool_choice was sent but tools were still included in the request. Anthropic defaults to 'auto' when tools are present, so the model could still call tools despite the caller requesting 'none'. Fix: omit tools entirely from the request when tool_choice is 'none', which is the only way to prevent tool use with the Anthropic API.	2026-03-17 04:02:49 -07:00
teknium1	e5a244ad5d	fix(aux): reset auxiliary_is_nous flag on each resolution attempt The module-level auxiliary_is_nous was set to True by _try_nous() and never reset. In long-running gateway processes, once Nous was resolved as auxiliary provider, the flag stayed True forever — even if subsequent resolutions chose a different provider (e.g. OpenRouter). This caused Nous product tags to be sent to non-Nous providers. Reset the flag at the start of _resolve_auto() so only the winning provider's flag persists.	2026-03-17 04:02:15 -07:00
teknium1	b111f2a779	fix(gateway): Matrix and Mattermost never report as connected Neither adapter called _mark_connected() after successful connect(), so _running stayed False, runtime status never showed 'connected', and /status reported them as offline even while actively processing messages. Add _mark_connected() calls matching the pattern used by Telegram and DingTalk adapters.	2026-03-17 04:01:02 -07:00