106 changed files with 185 additions and 22532 deletions
--- a/CLAUDE.md
+++ b/CLAUDE.md
@ -1,233 +0,0 @@
-# CLAUDE.md
-
-This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
-
-## Project Overview
-
-AI Robot Core is a multi-tenant AI service platform providing intelligent chat, RAG (Retrieval-Augmented Generation) knowledge base, and LLM integration capabilities. The system consists of a Python FastAPI backend (ai-service) and a Vue.js admin frontend (ai-service-admin).
-
-## Architecture
-
-### Service Components
-
- **ai-service**: FastAPI backend (port 8080 internal, 8182 external)
-  - Multi-tenant isolation via `X-Tenant-Id` header
-  - SSE streaming support via `Accept: text/event-stream`
-  - RAG-powered responses with confidence scoring
-  - Intent-driven script flows and metadata governance
-
- **ai-service-admin**: Vue 3 + Element Plus frontend (port 80 internal, 8183 external)
-  - Admin UI for knowledge base, LLM config, RAG experiments
-  - Nginx reverse proxy to backend at `/api/*`
-
- **postgres**: PostgreSQL 15 database (port 5432)
-  - Chat sessions, messages, knowledge base metadata
-  - Multi-tenant data isolation
-
- **qdrant**: Vector database (port 6333)
-  - Document embeddings and vector search
-  - Collections prefixed with `kb_`
-
- **ollama**: Local embedding model service (port 11434)
-  - Default: `nomic-embed-text` (768 dimensions)
-  - Recommended: `toshk0/nomic-embed-text-v2-moe:Q6_K`
-
-### Key Architectural Patterns
-
-**Multi-Tenancy**: All data is scoped by `tenant_id`. The `TenantContextMiddleware` extracts tenant from `X-Tenant-Id` header and stores in request state. Database queries must filter by tenant_id.
-
-**Service Layer Organization**:
- `app/services/llm/`: LLM provider adapters (OpenAI, DeepSeek, Ollama)
- `app/services/embedding/`: Embedding providers (OpenAI, Ollama, Nomic)
- `app/services/retrieval/`: Vector search and indexing
- `app/services/document/`: Document parsers (PDF, Word, Excel, Text)
- `app/services/flow/`: Intent-driven script flow engine
- `app/services/guardrail/`: Input/output filtering and safety
- `app/services/monitoring/`: Dashboard metrics and logging
- `app/services/mid/`: Mid-platform dialogue and session management
-
-**API Structure**:
- `/ai/chat`: Main chat endpoint (supports JSON and SSE streaming)
- `/ai/health`: Health check
- `/admin/*`: Admin endpoints for configuration and management
- `/mid/*`: Mid-platform API for dialogue sessions and messages
-
-**Configuration**: Uses `pydantic-settings` with `AI_SERVICE_` prefix. All settings in `app/core/config.py` can be overridden via environment variables.
-
-**Database**: SQLModel (SQLAlchemy + Pydantic) with async PostgreSQL. Entities in `app/models/entities.py`. Session factory in `app/core/database.py`.
-
-## Development Commands
-
-### Backend (ai-service)
-
-```bash
-cd ai-service
-
-# Install dependencies (development)
-pip install -e ".[dev]"
-
-# Run development server
-uvicorn app.main:app --reload --port 8000
-
-# Run tests
-pytest
-
-# Run specific test
-pytest tests/test_confidence.py -v
-
-# Run tests with coverage
-pytest --cov=app --cov-report=html
-
-# Lint with ruff
-ruff check app/
-ruff format app/
-
-# Type check with mypy
-mypy app/
-
-# Database migrations
-python scripts/migrations/run_migration.py scripts/migrations/005_create_mid_tables.sql
-```
-
-### Frontend (ai-service-admin)
-
-```bash
-cd ai-service-admin
-
-# Install dependencies
-npm install
-
-# Run development server (port 5173)
-npm run dev
-
-# Build for production
-npm run build
-
-# Preview production build
-npm run preview
-```
-
-### Docker Compose
-
-```bash
-# Start all services
-docker compose up -d
-
-# Rebuild and start
-docker compose up -d --build
-
-# View logs
-docker compose logs -f ai-service
-docker compose logs -f ai-service-admin
-
-# Stop all services
-docker compose down
-
-# Stop and remove volumes (clears data)
-docker compose down -v
-
-# Pull embedding model in Ollama container
-docker exec -it ai-ollama ollama pull toshk0/nomic-embed-text-v2-moe:Q6_K
-```
-
-## Important Conventions
-
-### Acceptance Criteria Traceability
-
-All code must reference AC (Acceptance Criteria) codes in docstrings and comments:
- Format: `[AC-AISVC-XX]` for ai-service
- Example: `[AC-AISVC-01] Centralized configuration`
- See `spec/contracting.md` for contract maturity levels (L0-L3)
-
-### OpenAPI Contract Management
-
- Provider API: `openapi.provider.yaml` (APIs this module provides)
- Consumer API: `openapi.deps.yaml` (APIs this module depends on)
- Must declare `info.x-contract-level: L0|L1|L2|L3`
- L2 required before merging to main
- Run contract checks: `scripts/check-openapi-level.sh`, `scripts/check-openapi-diff.sh`
-
-### Multi-Tenant Data Access
-
-Always filter by tenant_id when querying database:
-```python
-from app.core.tenant import get_current_tenant_id
-
-tenant_id = get_current_tenant_id()
-result = await session.exec(
-    select(ChatSession).where(ChatSession.tenant_id == tenant_id)
-)
-```
-
-### SSE Streaming Pattern
-
-Check `Accept` header for streaming mode:
-```python
-from app.core.sse import create_sse_response
-
-if request.headers.get("accept") == "text/event-stream":
-    return create_sse_response(event_generator())
-else:
-    return JSONResponse({"response": "..."})
-```
-
-### Error Handling
-
-Use custom exceptions from `app/core/exceptions.py`:
-```python
-from app.core.exceptions import AIServiceException, ErrorCode
-
-raise AIServiceException(
-    code=ErrorCode.KNOWLEDGE_BASE_NOT_FOUND,
-    message="Knowledge base not found",
-    details={"kb_id": kb_id}
-)
-```
-
-### Configuration Access
-
-```python
-from app.core.config import get_settings
-
-settings = get_settings()  # Cached singleton
-llm_model = settings.llm_model
-```
-
-### Embedding Configuration
-
-Embedding config is persisted to `config/embedding_config.json` and loaded at startup. Use `app/services/embedding/factory.py` to get the configured provider.
-
-## Testing Strategy
-
- Unit tests in `tests/test_*.py`
- Use `pytest-asyncio` for async tests
- Fixtures in `tests/conftest.py`
- Mock external services (LLM, Qdrant) in tests
- Integration tests require running services (use `docker compose up -d`)
-
-## Session Handoff Protocol
-
-For complex multi-phase tasks, use the Session Handoff Protocol (v2.0):
- Progress docs in `docs/progproject]-progress.md`
- Must include: task overview, requirements reference, technical context, next steps
- See `~/.claude/specs/session-handoff-protocol-ai-ref.md` for details
- Stop proactively after 40-50 tool calls or phase completion
-
-## Common Pitfalls
-
-1. **Tenant Isolation**: Never query without tenant_id filter
-2. **Embedding Model Changes**: Switching models requires rebuilding all knowledge bases (vectors are incompatible)
-3. **SSE Streaming**: Must yield events in correct format: `data: {json}\n\n`
-4. **API Key**: Backend auto-generates default key on first startup (check logs)
-5. **Long-Running Commands**: Don't use `--watch` modes in tests/dev servers via bash tool
-6. **Windows Paths**: Use forward slashes in code, even on Windows (Python handles conversion)
-
-## Key Files
-
- [app/main.py](ai-service/app/main.py): FastAPI app entry point
- [app/core/config.py](ai-service/app/core/config.py): Configuration settings
- [app/models/entities.py](ai-service/app/models/entities.py): Database models
- [app/api/chat.py](ai-service/app/api/chat.py): Main chat endpoint
- [app/services/flow/engine.py](ai-service/app/services/flow/engine.py): Intent-driven flow engine
- [docker-compose.yaml](docker-compose.ya orchestration
- [spec/contracting.md](spec/coing.md): Contract maturity rules
--- a/ai-service-admin/src/App.vue
+++ b/ai-service-admin/src/App.vue
@ -54,18 +54,10 @@
            <el-icon><Warning /></el-icon>
            <span>输出护栏</span>
          </router-link>
-          <router-link to="/admin/mid-platform-playground" class="nav-item" :class="{ active: isActive('/admin/mid-platform-playground') }">
-            <el-icon><ChatLineRound /></el-icon>
-            <span>中台联调</span>
-          </router-link>
          <router-link to="/admin/metadata-schemas" class="nav-item" :class="{ active: isActive('/admin/metadata-schemas') }">
            <el-icon><Setting /></el-icon>
            <span>元数据配置</span>
          </router-link>
-          <router-link to="/admin/slot-definitions" class="nav-item" :class="{ active: isActive('/admin/slot-definitions') }">
-            <el-icon><Grid /></el-icon>
-            <span>槽位定义</span>
-          </router-link>
        </nav>
      </div>
      <div class="header-right">
@ -98,7 +90,7 @@ import { ref, onMounted } from 'vue'
 import { useRoute } from 'vue-router'
 import { useTenantStore } from '@/stores/tenant'
 import { getTenantList, type Tenant } from '@/api/tenant'
-import { Odometer, FolderOpened, Cpu, Monitor, Connection, ChatDotSquare, Document, Aim, Share, Warning, Setting, ChatLineRound, Grid } from '@element-plus/icons-vue'
+import { Odometer, FolderOpened, Cpu, Monitor, Connection, ChatDotSquare, Document, Aim, Share, Warning, Setting } from '@element-plus/icons-vue'
 import { ElMessage } from 'element-plus'

 const route = useRoute()
--- a/ai-service-admin/src/api/metadata-schema.ts
+++ b/ai-service-admin/src/api/metadata-schema.ts
@ -5,20 +5,12 @@ import type {
  MetadataFieldUpdateRequest,
  MetadataFieldListResponse,
  MetadataPayload,
-  MetadataScope,
-  FieldRole
+  MetadataScope
 } from '@/types/metadata'

 export const metadataSchemaApi = {
-  list: (status?: 'draft' | 'active' | 'deprecated', fieldRole?: FieldRole) =>
-    request<MetadataFieldListResponse>({ 
-      method: 'GET', 
-      url: '/admin/metadata-schemas', 
-      params: { 
-        ...(status ? { status } : {}),
-        ...(fieldRole ? { field_role: fieldRole } : {})
-      } 
-    }),
+  list: (status?: 'draft' | 'active' | 'deprecated') =>
+    request<MetadataFieldListResponse>({ method: 'GET', url: '/admin/metadata-schemas', params: status ? { status } : {} }),

  get: (id: string) =>
    request<MetadataFieldDefinition>({ method: 'GET', url: `/admin/metadata-schemas/${id}` }),
@ -39,13 +31,6 @@ export const metadataSchemaApi = {
      params: { scope, include_deprecated: includeDeprecated }
    }),

-  getByRole: (role: FieldRole, includeDeprecated = false) =>
-    request<MetadataFieldListResponse>({
-      method: 'GET',
-      url: '/admin/metadata-schemas/by-role',
-      params: { role, include_deprecated: includeDeprecated }
-    }),
-
  validate: (metadata: MetadataPayload, scope?: MetadataScope) =>
    request<{ valid: boolean; errors?: { field_key: string; message: string }[] }>({
      method: 'POST',
--- a/ai-service-admin/src/api/mid-platform.ts
+++ b/ai-service-admin/src/api/mid-platform.ts
@ -1,221 +0,0 @@
-import request from '@/utils/request'
-
-export type MidMode = 'agent' | 'micro_flow' | 'fixed' | 'transfer'
-export type SessionMode = 'BOT_ACTIVE' | 'HUMAN_ACTIVE'
-
-export interface DialogueMessage {
-  role: 'user' | 'assistant' | 'human'
-  content: string
-}
-
-export interface InterruptedSegment {
-  segment_id: string
-  content: string
-}
-
-export interface DialogueRequest {
-  session_id: string
-  user_id?: string
-  user_message: string
-  history: DialogueMessage[]
-  interrupted_segments?: InterruptedSegment[]
-  feature_flags?: {
-    agent_enabled?: boolean
-    rollback_to_legacy?: boolean
-  }
-}
-
-export interface ToolCallTrace {
-  tool_name: string
-  tool_type?: 'internal' | 'mcp'
-  registry_version?: string
-  auth_applied?: boolean
-  duration_ms: number
-  status: 'ok' | 'timeout' | 'error' | 'rejected'
-  error_code?: string
-  args_digest?: string
-  result_digest?: string
-}
-
-export interface DialogueResponse {
-  segments: Array<{
-    segment_id: string
-    text: string
-    delay_after: number
-  }>
-  trace: {
-    mode: MidMode
-    intent?: string
-    request_id?: string
-    generation_id?: string
-    guardrail_triggered?: boolean
-    fallback_reason_code?: string
-    react_iterations?: number
-    timeout_profile?: {
-      per_tool_timeout_ms?: number
-      end_to_end_timeout_ms?: number
-    }
-    high_risk_policy_set?: string[]
-    tools_used?: string[]
-    tool_calls?: ToolCallTrace[]
-    metrics_snapshot?: {
-      task_completion_rate?: number
-      slot_completion_rate?: number
-      wrong_transfer_rate?: number
-      no_recall_rate?: number
-      avg_latency_ms?: number
-    }
-  }
-}
-
-export function respondDialogue(data: DialogueRequest): Promise<DialogueResponse> {
-  return request({
-    url: '/mid/dialogue/respond',
-    method: 'post',
-    data
-  })
-}
-
-export function switchSessionMode(sessionId: string, mode: SessionMode, reason?: string): Promise<{ session_id: string; mode: SessionMode }> {
-  return request({
-    url: `/mid/sessions/${sessionId}/mode`,
-    method: 'post',
-    data: {
-      mode,
-      reason
-    }
-  })
-}
-
-export function reportMessages(data: {
-  session_id: string
-  messages: Array<{
-    role: 'user' | 'assistant' | 'human' | 'system'
-    content: string
-    source: 'bot' | 'human' | 'channel'
-    timestamp: string
-    segment_id?: string
-  }>
-}): Promise<void> {
-  return request({
-    url: '/mid/messages/report',
-    method: 'post',
-    data
-  })
-}
-
-export interface CreateShareRequest {
-  title?: string
-  description?: string
-  expires_in_days?: number
-  max_concurrent_users?: number
-}
-
-export interface ShareResponse {
-  share_token: string
-  share_url: string
-  expires_at: string
-  title?: string
-  description?: string
-  max_concurrent_users: number
-}
-
-export interface ShareListItem {
-  share_token: string
-  share_url: string
-  title?: string
-  description?: string
-  expires_at: string
-  is_active: boolean
-  max_concurrent_users: number
-  current_users: number
-  created_at: string
-}
-
-export interface ShareListResponse {
-  shares: ShareListItem[]
-}
-
-export interface SharedSessionInfo {
-  session_id: string
-  title?: string
-  description?: string
-  expires_at: string
-  max_concurrent_users: number
-  current_users: number
-  history: DialogueMessage[]
-}
-
-export interface CreatePublicShareTokenRequest {
-  tenant_id?: string
-  api_key?: string
-  session_id: string
-  user_id?: string
-  expires_in_minutes?: number
-}
-
-export interface CreatePublicShareTokenResponse {
-  share_token: string
-  share_url: string
-  expires_at: string
-}
-
-export function createShare(sessionId: string, data: CreateShareRequest = {}): Promise<ShareResponse> {
-  return request({
-    url: `/mid/sessions/${sessionId}/share`,
-    method: 'post',
-    data
-  })
-}
-
-export function createPublicShareToken(data: CreatePublicShareTokenRequest): Promise<CreatePublicShareTokenResponse> {
-  return request({
-    url: '/openapi/v1/share/token',
-    method: 'post',
-    data
-  })
-}
-
-export function getSharedSession(shareToken: string): Promise<SharedSessionInfo> {
-  return request({
-    url: `/mid/share/${shareToken}`,
-    method: 'get'
-  })
-}
-
-export function listShares(sessionId: string, includeExpired = false): Promise<ShareListResponse> {
-  return request({
-    url: `/mid/sessions/${sessionId}/shares`,
-    method: 'get',
-    params: { include_expired: includeExpired }
-  })
-}
-
-export function deleteShare(shareToken: string): Promise<{ success: boolean; message: string }> {
-  return request({
-    url: `/mid/share/${shareToken}`,
-    method: 'delete'
-  })
-}
-
-export function joinSharedSession(shareToken: string): Promise<SharedSessionInfo> {
-  return request({
-    url: `/mid/share/${shareToken}/join`,
-    method: 'post'
-  })
-}
-
-export function leaveSharedSession(shareToken: string): Promise<{ success: boolean; current_users: number }> {
-  return request({
-    url: `/mid/share/${shareToken}/leave`,
-    method: 'post'
-  })
-}
-
-export function sendSharedMessage(shareToken: string, userMessage: string): Promise<{ success: boolean; message: string; session_id: string }> {
-  return request({
-    url: `/mid/share/${shareToken}/message`,
-    method: 'post',
-    data: { user_message: userMessage }
-  })
-}
--- a/ai-service-admin/src/api/slot-definition.ts
+++ b/ai-service-admin/src/api/slot-definition.ts
@ -1,53 +0,0 @@
-import request from '@/utils/request'
-import type {
-  SlotDefinition,
-  SlotDefinitionCreateRequest,
-  SlotDefinitionUpdateRequest,
-  RuntimeSlotValue
-} from '@/types/slot-definition'
-import type { FieldRole } from '@/types/metadata'
-
-export const slotDefinitionApi = {
-  list: (required?: boolean) =>
-    request<SlotDefinition[]>({
-      method: 'GET',
-      url: '/admin/slot-definitions',
-      params: required !== undefined ? { required } : {}
-    }),
-
-  get: (id: string) =>
-    request<SlotDefinition>({ method: 'GET', url: `/admin/slot-definitions/${id}` }),
-
-  create: (data: SlotDefinitionCreateRequest) =>
-    request<SlotDefinition>({ method: 'POST', url: '/admin/slot-definitions', data }),
-
-  update: (id: string, data: SlotDefinitionUpdateRequest) =>
-    request<SlotDefinition>({ method: 'PUT', url: `/admin/slot-definitions/${id}`, data }),
-
-  delete: (id: string) =>
-    request({ method: 'DELETE', url: `/admin/slot-definitions/${id}` }),
-
-  getByRole: (role: FieldRole) =>
-    request<SlotDefinition[]>({
-      method: 'GET',
-      url: '/mid/slots/by-role',
-      params: { role }
-    }),
-
-  getSlotValue: (slotKey: string, userId?: string, sessionId?: string) =>
-    request<RuntimeSlotValue>({
-      method: 'GET',
-      url: `/mid/slots/${slotKey}`,
-      params: {
-        ...(userId ? { user_id: userId } : {}),
-        ...(sessionId ? { session_id: sessionId } : {})
-      }
-    })
-}
-
-export type {
-  SlotDefinition,
-  SlotDefinitionCreateRequest,
-  SlotDefinitionUpdateRequest,
-  RuntimeSlotValue
-}
--- a/ai-service-admin/src/components/metadata/FieldRolesSelector.vue
+++ b/ai-service-admin/src/components/metadata/FieldRolesSelector.vue
@ -1,114 +0,0 @@
-<template>
-  <div class="field-roles-selector">
-    <label class="selector-label">
-      字段角色
-      <el-tooltip content="为字段分配角色，工具将按角色精准消费字段" placement="top">
-        <el-icon class="help-icon"><QuestionFilled /></el-icon>
-      </el-tooltip>
-    </label>
-    <el-checkbox-group v-model="selectedRoles" class="roles-checkbox-group">
-      <el-checkbox
-        v-for="role in availableRoles"
-        :key="role.value"
-        :value="role.value"
-        class="role-checkbox"
-      >
-        <div class="role-item">
-          <span class="role-label">{{ role.label }}</span>
-          <el-tooltip :content="role.description" placement="top">
-            <el-icon class="role-help-icon"><QuestionFilled /></el-icon>
-          </el-tooltip>
-        </div>
-      </el-checkbox>
-    </el-checkbox-group>
-  </div>
-</template>
-
-<script setup lang="ts">
-import { computed } from 'vue'
-import { QuestionFilled } from '@element-plus/icons-vue'
-import type { FieldRole } from '@/types/metadata'
-
-const props = defineProps<{
-  modelValue: FieldRole[]
-}>()
-
-const emit = defineEmits<{
-  (e: 'update:modelValue', value: FieldRole[]): void
-}>()
-
-const availableRoles: { value: FieldRole; label: string; description: string }[] = [
-  {
-    value: 'resource_filter',
-    label: '资源过滤',
-    description: '用于 KB 文档检索时的元数据过滤，kb_search_dynamic 工具将消费此类字段'
-  },
-  {
-    value: 'slot',
-    label: '运行时槽位',
-    description: '对话流程中的结构化槽位，用于信息收集，memory_recall 工具将消费此类字段'
-  },
-  {
-    value: 'prompt_var',
-    label: '提示词变量',
-    description: '注入到 LLM Prompt 中的变量，template_engine 将消费此类字段'
-  },
-  {
-    value: 'routing_signal',
-    label: '路由信号',
-    description: '用于意图路由和风险判断的信号，intent_hint/high_risk_check 工具将消费此类字段'
-  }
-]
-
-const selectedRoles = computed({
-  get: () => props.modelValue || [],
-  set: (val: FieldRole[]) => emit('update:modelValue', val)
-})
-</script>
-
-<style scoped lang="scss">
-.field-roles-selector {
-  width: 100%;
-}
-
-.selector-label {
-  display: flex;
-  align-items: center;
-  gap: 4px;
-  font-size: 14px;
-  color: var(--el-text-color-regular);
-  margin-bottom: 8px;
-}
-
-.help-icon {
-  font-size: 14px;
-  color: var(--el-text-color-secondary);
-  cursor: help;
-}
-
-.roles-checkbox-group {
-  display: flex;
-  flex-wrap: wrap;
-  gap: 12px;
-}
-
-.role-checkbox {
-  margin-right: 0 !important;
-}
-
-.role-item {
-  display: flex;
-  align-items: center;
-  gap: 4px;
-}
-
-.role-label {
-  font-size: 14px;
-}
-
-.role-help-icon {
-  font-size: 12px;
-  color: var(--el-text-color-secondary);
-  cursor: help;
-}
-</style>
--- a/ai-service-admin/src/router/index.ts
+++ b/ai-service-admin/src/router/index.ts
@ -5,12 +5,6 @@ const routes: Array<RouteRecordRaw> = [
    path: '/',
    redirect: '/dashboard'
  },
-  {
-    path: '/share/:token',
-    name: 'SharedSession',
-    component: () => import('@/views/share/index.vue'),
-    meta: { title: '共享对话', public: true }
-  },
  {
    path: '/dashboard',
    name: 'Dashboard',
@ -65,12 +59,6 @@ const routes: Array<RouteRecordRaw> = [
    component: () => import('@/views/admin/metadata-schema/index.vue'),
    meta: { title: '元数据模式配置' }
  },
-  {
-    path: '/admin/slot-definitions',
-    name: 'SlotDefinition',
-    component: () => import('@/views/admin/slot-definition/index.vue'),
-    meta: { title: '槽位定义管理' }
-  },
  {
    path: '/admin/intent-rules',
    name: 'IntentRule',
@ -89,12 +77,6 @@ const routes: Array<RouteRecordRaw> = [
    component: () => import('@/views/admin/guardrail/index.vue'),
    meta: { title: '输出护栏管理' }
  },
-  {
-    path: '/admin/mid-platform-playground',
-    name: 'MidPlatformPlayground',
-    component: () => import('@/views/admin/mid-platform-playground/index.vue'),
-    meta: { title: '中台联调工作台' }
-  },
  {
    path: '/admin/decomposition-templates',
    name: 'DecompositionTemplate',
--- a/ai-service-admin/src/types/metadata.ts
+++ b/ai-service-admin/src/types/metadata.ts
@ -1,7 +1,6 @@
 export type MetadataFieldType = 'string' | 'number' | 'boolean' | 'enum' | 'array_enum'
 export type MetadataFieldStatus = 'draft' | 'active' | 'deprecated'
 export type MetadataScope = 'kb_document' | 'intent_rule' | 'script_flow' | 'prompt_template'
-export type FieldRole = 'resource_filter' | 'slot' | 'prompt_var' | 'routing_signal'

 export interface MetadataFieldDefinition {
  id: string
@ -15,7 +14,6 @@ export interface MetadataFieldDefinition {
  scope: MetadataScope[]
  is_filterable: boolean
  is_rank_feature: boolean
-  field_roles: FieldRole[]
  status: MetadataFieldStatus
  created_at?: string
  updated_at?: string
@ -31,7 +29,6 @@ export interface MetadataFieldCreateRequest {
  scope: MetadataScope[]
  is_filterable?: boolean
  is_rank_feature?: boolean
-  field_roles?: FieldRole[]
  status: MetadataFieldStatus
 }

@ -43,7 +40,6 @@ export interface MetadataFieldUpdateRequest {
  scope?: MetadataScope[]
  is_filterable?: boolean
  is_rank_feature?: boolean
-  field_roles?: FieldRole[]
  status?: MetadataFieldStatus
 }

@ -95,17 +91,3 @@ export const STATUS_TAG_MAP: Record<MetadataFieldStatus, '' | 'success' | 'warni
  active: 'success',
  deprecated: 'danger'
 }
-
-export const FIELD_ROLE_OPTIONS: { value: FieldRole; label: string; description: string }[] = [
-  { value: 'resource_filter', label: '资源过滤', description: '用于 KB 文档检索时的元数据过滤' },
-  { value: 'slot', label: '运行时槽位', description: '对话流程中的结构化槽位，用于信息收集' },
-  { value: 'prompt_var', label: '提示词变量', description: '注入到 LLM Prompt 中的变量' },
-  { value: 'routing_signal', label: '路由信号', description: '用于意图路由和风险判断的信号' }
-]
-
-export const FIELD_ROLE_TAG_MAP: Record<FieldRole, '' | 'success' | 'warning' | 'danger' | 'info'> = {
-  resource_filter: '',
-  slot: 'success',
-  prompt_var: 'warning',
-  routing_signal: 'danger'
-}
--- a/ai-service-admin/src/types/prompt-template.ts
+++ b/ai-service-admin/src/types/prompt-template.ts
@ -85,7 +85,6 @@ export const SCENE_OPTIONS = [
  { value: 'summary', label: '摘要场景' },
  { value: 'translation', label: '翻译场景' },
  { value: 'code', label: '代码场景' },
-  { value: 'agent_react', label: 'Agent ReAct 场景' },
  { value: 'custom', label: '自定义场景' }
 ]

@ -99,6 +98,5 @@ export const BUILTIN_VARIABLES: PromptVariable[] = [
  { name: 'user_name', description: '用户名称' },
  { name: 'context', description: '检索上下文' },
  { name: 'query', description: '用户问题' },
-  { name: 'history', description: '对话历史' },
-  { name: 'available_tools', description: '可用工具列表（Agent ReAct 场景专用，自动注入已注册工具的名称、描述和参数）' }
+  { name: 'history', description: '对话历史' }
 ]
--- a/ai-service-admin/src/types/slot-definition.ts
+++ b/ai-service-admin/src/types/slot-definition.ts
@ -1,71 +0,0 @@
-export type SlotType = 'string' | 'number' | 'boolean' | 'enum' | 'array_enum'
-export type ExtractStrategy = 'rule' | 'llm' | 'user_input'
-export type SlotSource = 'user_confirmed' | 'rule_extracted' | 'llm_inferred' | 'default'
-
-export interface SlotDefinition {
-  id: string
-  tenant_id: string
-  slot_key: string
-  type: SlotType
-  required: boolean
-  extract_strategy?: ExtractStrategy
-  validation_rule?: string
-  ask_back_prompt?: string
-  default_value?: string | number | boolean | string[]
-  linked_field_id?: string
-  linked_field?: LinkedField
-  created_at?: string
-  updated_at?: string
-}
-
-export interface LinkedField {
-  id: string
-  field_key: string
-  label: string
-  type: string
-  field_roles: string[]
-}
-
-export interface SlotDefinitionCreateRequest {
-  tenant_id?: string
-  slot_key: string
-  type: SlotType
-  required: boolean
-  extract_strategy?: ExtractStrategy
-  validation_rule?: string
-  ask_back_prompt?: string
-  default_value?: string | number | boolean | string[]
-  linked_field_id?: string
-}
-
-export interface SlotDefinitionUpdateRequest {
-  type?: SlotType
-  required?: boolean
-  extract_strategy?: ExtractStrategy
-  validation_rule?: string
-  ask_back_prompt?: string
-  default_value?: string | number | boolean | string[]
-  linked_field_id?: string
-}
-
-export interface RuntimeSlotValue {
-  key: string
-  value: string | number | boolean | string[] | undefined
-  source: SlotSource
-  confidence: number
-  updated_at?: string
-}
-
-export const SLOT_TYPE_OPTIONS = [
-  { value: 'string', label: '文本' },
-  { value: 'number', label: '数字' },
-  { value: 'boolean', label: '布尔值' },
-  { value: 'enum', label: '单选枚举' },
-  { value: 'array_enum', label: '多选枚举' }
-]
-
-export const EXTRACT_STRATEGY_OPTIONS = [
-  { value: 'rule', label: '规则提取', description: '通过预定义规则从对话中提取' },
-  { value: 'llm', label: 'LLM 推断', description: '通过大语言模型推断槽位值' },
-  { value: 'user_input', label: '用户输入', description: '通过追问提示语让用户主动输入' }
-]
--- a/ai-service-admin/src/views/admin/metadata-schema/index.vue
+++ b/ai-service-admin/src/views/admin/metadata-schema/index.vue
@ -15,14 +15,6 @@
              :value="opt.value"
            />
          </el-select>
-          <el-select v-model="filterRole" placeholder="按角色筛选" clearable style="width: 140px;">
-            <el-option
-              v-for="opt in FIELD_ROLE_OPTIONS"
-              :key="opt.value"
-              :label="opt.label"
-              :value="opt.value"
-            />
-          </el-select>
          <el-button type="primary" @click="handleCreate">
            <el-icon><Plus /></el-icon>
            新建字段
@ -65,22 +57,6 @@
            </div>
          </template>
        </el-table-column>
-        <el-table-column prop="field_roles" label="字段角色" min-width="200">
-          <template #default="{ row }">
-            <div class="role-tags">
-              <el-tag
-                v-for="role in row.field_roles"
-                :key="role"
-                :type="FIELD_ROLE_TAG_MAP[role as FieldRole]"
-                size="small"
-                class="role-tag"
-              >
-                {{ getRoleLabel(role) }}
-              </el-tag>
-              <span v-if="!row.field_roles || row.field_roles.length === 0" class="no-role">-</span>
-            </div>
-          </template>
-        </el-table-column>
        <el-table-column prop="status" label="状态" width="100">
          <template #default="{ row }">
            <el-tag :type="STATUS_TAG_MAP[row.status as MetadataFieldStatus]" size="small">
@ -191,9 +167,6 @@
            </el-checkbox>
          </el-checkbox-group>
        </el-form-item>
-        <el-form-item label="字段角色">
-          <FieldRolesSelector v-model="formData.field_roles" />
-        </el-form-item>
        <el-row :gutter="20">
          <el-col :span="8">
            <el-form-item label="是否必填">
@ -271,26 +244,21 @@ import { ElMessage, ElMessageBox } from 'element-plus'
 import { Plus, Edit, Delete, Switch, ArrowDown } from '@element-plus/icons-vue'
 import type { FormInstance, FormRules } from 'element-plus'
 import { metadataSchemaApi } from '@/api/metadata-schema'
-import FieldRolesSelector from '@/components/metadata/FieldRolesSelector.vue'
 import {
  METADATA_STATUS_OPTIONS,
  METADATA_SCOPE_OPTIONS,
  METADATA_TYPE_OPTIONS,
  STATUS_TAG_MAP,
-  FIELD_ROLE_OPTIONS,
-  FIELD_ROLE_TAG_MAP,
  type MetadataFieldDefinition,
  type MetadataFieldCreateRequest,
  type MetadataFieldUpdateRequest,
  type MetadataFieldStatus,
-  type MetadataScope,
-  type FieldRole
+  type MetadataScope
 } from '@/types/metadata'

 const loading = ref(false)
 const fields = ref<MetadataFieldDefinition[]>([])
 const filterStatus = ref<MetadataFieldStatus | ''>('')
-const filterRole = ref<FieldRole | ''>('')
 const dialogVisible = ref(false)
 const isEdit = ref(false)
 const submitting = ref(false)
@ -308,7 +276,6 @@ const formData = reactive({
  scope: [] as MetadataScope[],
  is_filterable: true,
  is_rank_feature: false,
-  field_roles: [] as FieldRole[],
  status: 'draft' as MetadataFieldStatus
 })

@ -347,14 +314,10 @@ const getScopeLabel = (scope: MetadataScope) => {
  return METADATA_SCOPE_OPTIONS.find(o => o.value === scope)?.label || scope
 }

-const getRoleLabel = (role: FieldRole) => {
-  return FIELD_ROLE_OPTIONS.find(o => o.value === role)?.label || role
-}
-
 const fetchFields = async () => {
  loading.value = true
  try {
-    const res = await metadataSchemaApi.list(filterStatus.value || undefined, filterRole.value || undefined)
+    const res = await metadataSchemaApi.list(filterStatus.value || undefined)
    fields.value = res.items || []
  } catch (error: any) {
    ElMessage.error(error.response?.data?.message || '获取元数据字段失败')
@ -376,7 +339,6 @@ const handleCreate = () => {
    scope: [],
    is_filterable: true,
    is_rank_feature: false,
-    field_roles: [],
    status: 'draft'
  })
  dialogVisible.value = true
@ -395,7 +357,6 @@ const handleEdit = (field: MetadataFieldDefinition) => {
    scope: [...field.scope],
    is_filterable: field.is_filterable,
    is_rank_feature: field.is_rank_feature,
-    field_roles: field.field_roles || [],
    status: field.status
  })
  dialogVisible.value = true
@ -472,7 +433,6 @@ const handleSubmit = async () => {
        scope: formData.scope,
        is_filterable: formData.is_filterable,
        is_rank_feature: formData.is_rank_feature,
-        field_roles: formData.field_roles,
        status: formData.status
      }

@ -506,10 +466,6 @@ watch(filterStatus, () => {
  fetchFields()
 })

-watch(filterRole, () => {
-  fetchFields()
-})
-
 onMounted(() => {
  fetchFields()
 })
@ -573,20 +529,6 @@ onMounted(() => {
  margin: 0;
 }

-.role-tags {
-  display: flex;
-  flex-wrap: wrap;
-  gap: 4px;
-}
-
-.role-tag {
-  margin: 0;
-}
-
-.no-role {
-  color: var(--el-text-color-placeholder);
-}
-
 .field-hint {
  margin-top: 4px;
  font-size: 12px;
--- a/ai-service-admin/src/views/admin/mid-platform-playground/index.vue
+++ b/ai-service-admin/src/views/admin/mid-platform-playground/index.vue
@ -1,516 +0,0 @@
-<template>
-  <div class="playground-page">
-    <el-card shadow="never" class="control-card">
-      <template #header>
-        <div class="header-row">
-          <span>中台联调工作台</span>
-          <el-tag type="info">真人模拟 + 中台真实接口</el-tag>
-        </div>
-      </template>
-
-      <el-form inline>
-        <el-form-item label="Session ID">
-          <el-input v-model="sessionId" style="width: 220px" />
-        </el-form-item>
-        <el-form-item label="User ID">
-          <el-input v-model="userId" style="width: 180px" />
-        </el-form-item>
-        <el-form-item label="Agent 灰度">
-          <el-switch v-model="agentEnabled" />
-        </el-form-item>
-        <el-form-item label="回滚传统链路">
-          <el-switch v-model="rollbackToLegacy" />
-        </el-form-item>
-        <el-form-item label="会话模式">
-          <el-select v-model="sessionMode" style="width: 150px">
-            <el-option label="BOT_ACTIVE" value="BOT_ACTIVE" />
-            <el-option label="HUMAN_ACTIVE" value="HUMAN_ACTIVE" />
-          </el-select>
-        </el-form-item>
-        <el-form-item>
-          <el-button :loading="switchingMode" @click="handleSwitchMode">应用模式</el-button>
-        </el-form-item>
-      </el-form>
-    </el-card>
-
-    <el-row :gutter="16">
-      <el-col :span="16">
-        <el-card shadow="never" class="chat-card">
-          <template #header>
-            <div class="header-row">
-              <span>实时对话</span>
-              <el-space>
-                <el-button size="small" @click="clearConversation">清空</el-button>
-                <el-button size="small" type="warning" :loading="reporting" @click="handleReportMessages">上报消息</el-button>
-                <el-button size="small" type="primary" @click="showShareDialog">分享对话</el-button>
-              </el-space>
-            </div>
-          </template>
-
-          <div class="chat-list">
-            <div
-              v-for="(msg, idx) in conversation"
-              :key="idx"
-              class="chat-item"
-              :class="`role-${msg.role}`"
-            >
-              <div class="meta">
-                <el-tag size="small" :type="tagType(msg.role)">{{ msg.role }}</el-tag>
-                <span class="time">{{ msg.timestamp }}</span>
-                <span v-if="msg.segment_id" class="segment">{{ msg.segment_id }}</span>
-              </div>
-              <div class="content">{{ msg.content }}</div>
-            </div>
-          </div>
-
-          <div class="composer">
-            <el-input
-              v-model="userInput"
-              type="textarea"
-              :rows="3"
-              placeholder="输入用户消息后发送，触发 /mid/dialogue/respond"
-            />
-            <div class="composer-actions">
-              <el-button type="primary" :loading="sending" @click="handleSend">发送</el-button>
-            </div>
-          </div>
-        </el-card>
-      </el-col>
-
-      <el-col :span="8">
-        <el-card shadow="never" class="trace-card">
-          <template #header>
-            <div class="header-row">
-              <span>Trace 观测</span>
-            </div>
-          </template>
-
-          <div v-if="lastTrace">
-            <el-descriptions :column="1" border size="small">
-              <el-descriptions-item label="mode">{{ lastTrace.mode }}</el-descriptions-item>
-              <el-descriptions-item label="intent">{{ lastTrace.intent || '-' }}</el-descriptions-item>
-              <el-descriptions-item label="request_id">{{ lastTrace.request_id || '-' }}</el-descriptions-item>
-              <el-descriptions-item label="generation_id">{{ lastTrace.generation_id || '-' }}</el-descriptions-item>
-              <el-descriptions-item label="fallback_reason_code">{{ lastTrace.fallback_reason_code || '-' }}</el-descriptions-item>
-              <el-descriptions-item label="react_iterations">{{ lastTrace.react_iterations ?? '-' }}</el-descriptions-item>
-              <el-descriptions-item label="guardrail_triggered">{{ lastTrace.guardrail_triggered ?? '-' }}</el-descriptions-item>
-            </el-descriptions>
-
-            <div class="json-panel">
-              <div class="json-title">tool_calls</div>
-              <pre>{{ JSON.stringify(lastTrace.tool_calls || [], null, 2) }}</pre>
-            </div>
-
-            <div class="json-panel">
-              <div class="json-title">metrics_snapshot</div>
-              <pre>{{ JSON.stringify(lastTrace.metrics_snapshot || {}, null, 2) }}</pre>
-            </div>
-          </div>
-          <el-empty v-else description="暂无 Trace" :image-size="90" />
-        </el-card>
-
-        <el-card shadow="never" class="shares-card" style="margin-top: 16px">
-          <template #header>
-            <div class="header-row">
-              <span>历史分享（旧）</span>
-              <el-button size="small" text @click="loadShares">刷新</el-button>
-            </div>
-          </template>
-          <div v-if="shares.length" class="shares-list">
-            <div v-for="share in shares" :key="share.share_token" class="share-item">
-              <div class="share-info">
-                <span class="share-title">{{ share.title || '无标题' }}</span>
-                <el-tag size="small" :type="share.is_active ? 'success' : 'info'">
-                  {{ share.is_active ? '有效' : '已失效' }}
-                </el-tag>
-              </div>
-              <div class="share-meta">
-                <span>在线: {{ share.current_users }}/{{ share.max_concurrent_users }}</span>
-                <span>过期: {{ formatShareExpires(share.expires_at) }}</span>
-              </div>
-              <div class="share-actions">
-                <el-button size="small" text type="primary" @click="copyShareUrl(share.share_url)">复制链接</el-button>
-                <el-button size="small" text type="danger" @click="handleDeleteShare(share.share_token)">删除</el-button>
-              </div>
-            </div>
-          </div>
-          <el-empty v-else description="暂无分享链接" :image-size="60" />
-        </el-card>
-      </el-col>
-    </el-row>
-
-    <el-dialog v-model="shareDialogVisible" title="分享对话（安全链接）" width="500px">
-      <el-form :model="shareForm" label-width="120px">
-        <el-form-item label="有效期">
-          <el-select v-model="shareForm.expires_in_minutes" style="width: 100%">
-            <el-option label="15 分钟" :value="15" />
-            <el-option label="1 小时" :value="60" />
-            <el-option label="6 小时" :value="360" />
-            <el-option label="24 小时" :value="1440" />
-          </el-select>
-        </el-form-item>
-      </el-form>
-      <template #footer>
-        <el-button @click="shareDialogVisible = false">取消</el-button>
-        <el-button type="primary" :loading="creatingShare" @click="handleCreateShare">创建安全分享</el-button>
-      </template>
-    </el-dialog>
-
-    <el-dialog v-model="shareResultVisible" title="分享成功" width="500px">
-      <el-result icon="success" title="安全分享链接已创建" :sub-title="shareResult?.share_url">
-        <template #extra>
-          <el-input :model-value="shareResult?.share_url" readonly>
-            <template #append>
-              <el-button @click="copyShareUrl(shareResult?.share_url || '')">复制</el-button>
-            </template>
-          </el-input>
-        </template>
-      </el-result>
-    </el-dialog>
-  </div>
-</template>
-
-<script setup lang="ts">
-import { ref, onMounted } from 'vue'
-import { ElMessage } from 'element-plus'
-import {
-  respondDialogue,
-  switchSessionMode,
-  reportMessages,
-  createPublicShareToken,
-  listShares,
-  deleteShare,
-  type DialogueMessage,
-  type DialogueResponse,
-  type SessionMode,
-  type CreatePublicShareTokenResponse,
-  type ShareListItem
-} from '@/api/mid-platform'
-
-type ChatRole = 'user' | 'assistant' | 'human' | 'system'
-
-interface ChatItem {
-  role: ChatRole
-  content: string
-  timestamp: string
-  segment_id?: string
-}
-
-const sessionId = ref(`sess_${Date.now()}`)
-const userId = ref(`user_${Date.now()}`)
-const sessionMode = ref<SessionMode>('BOT_ACTIVE')
-const agentEnabled = ref(true)
-const rollbackToLegacy = ref(false)
-
-const sending = ref(false)
-const switchingMode = ref(false)
-const reporting = ref(false)
-
-const userInput = ref('')
-const conversation = ref<ChatItem[]>([])
-const lastTrace = ref<DialogueResponse['trace'] | null>(null)
-
-const shares = ref<ShareListItem[]>([])
-const shareDialogVisible = ref(false)
-const shareResultVisible = ref(false)
-const creatingShare = ref(false)
-const shareResult = ref<CreatePublicShareTokenResponse | null>(null)
-const shareForm = ref({
-  expires_in_minutes: 60
-})
-
-const now = () => new Date().toLocaleTimeString()
-
-const tagType = (role: ChatRole) => {
-  if (role === 'user') return 'info'
-  if (role === 'assistant') return 'success'
-  if (role === 'human') return 'warning'
-  return ''
-}
-
-const toHistory = (): DialogueMessage[] => {
-  return conversation.value
-    .filter((m) => m.role === 'user' || m.role === 'assistant' || m.role === 'human')
-    .map((m) => ({ role: m.role as 'user' | 'assistant' | 'human', content: m.content }))
-}
-
-const handleSend = async () => {
-  const content = userInput.value.trim()
-  if (!content) {
-    ElMessage.warning('请输入用户消息')
-    return
-  }
-
-  conversation.value.push({ role: 'user', content, timestamp: now() })
-  userInput.value = ''
-
-  sending.value = true
-  try {
-    const res = await respondDialogue({
-      session_id: sessionId.value,
-      user_id: userId.value,
-      user_message: content,
-      history: toHistory(),
-      feature_flags: {
-        agent_enabled: agentEnabled.value,
-        rollback_to_legacy: rollbackToLegacy.value
-      }
-    })
-
-    lastTrace.value = res.trace
-    for (const seg of res.segments || []) {
-      conversation.value.push({
-        role: 'assistant',
-        content: seg.text,
-        segment_id: seg.segment_id,
-        timestamp: now()
-      })
-    }
-
-    if (!res.segments?.length) {
-      ElMessage.warning('接口返回了空 segments')
-    }
-  } catch {
-    ElMessage.error('调用中台 respond 接口失败')
-  } finally {
-    sending.value = false
-  }
-}
-
-const handleSwitchMode = async () => {
-  switchingMode.value = true
-  try {
-    await switchSessionMode(sessionId.value, sessionMode.value, 'playground manual switch')
-    ElMessage.success(`会话模式已切换为 ${sessionMode.value}`)
-  } catch {
-    ElMessage.error('切换会话模式失败')
-  } finally {
-    switchingMode.value = false
-  }
-}
-
-const handleReportMessages = async () => {
-  if (!conversation.value.length) {
-    ElMessage.warning('暂无可上报消息')
-    return
-  }
-
-  reporting.value = true
-  try {
-    await reportMessages({
-      session_id: sessionId.value,
-      messages: conversation.value.map((m) => ({
-        role: m.role,
-        content: m.content,
-        source: m.role === 'human' ? 'human' : m.role === 'assistant' ? 'bot' : 'channel',
-        timestamp: new Date().toISOString(),
-        segment_id: m.segment_id
-      }))
-    })
-    ElMessage.success('消息上报成功')
-  } catch {
-    ElMessage.error('消息上报失败')
-  } finally {
-    reporting.value = false
-  }
-}
-
-const clearConversation = () => {
-  conversation.value = []
-  lastTrace.value = null
-}
-
-const showShareDialog = () => {
-  shareForm.value = {
-    expires_in_minutes: 60
-  }
-  shareDialogVisible.value = true
-}
-
-const handleCreateShare = async () => {
-  creatingShare.value = true
-  try {
-    const result = await createPublicShareToken({
-      session_id: sessionId.value,
-      user_id: userId.value,
-      expires_in_minutes: shareForm.value.expires_in_minutes
-    })
-    shareResult.value = result
-    shareDialogVisible.value = false
-    shareResultVisible.value = true
-    ElMessage.success('安全分享链接创建成功')
-  } catch {
-    ElMessage.error('创建安全分享链接失败')
-  } finally {
-    creatingShare.value = false
-  }
-}
-
-const loadShares = async () => {
-  try {
-    const result = await listShares(sessionId.value, true)
-    shares.value = result.shares
-  } catch {
-    // ignore
-  }
-}
-
-const handleDeleteShare = async (shareToken: string) => {
-  try {
-    await deleteShare(shareToken)
-    ElMessage.success('分享链接已删除')
-    await loadShares()
-  } catch {
-    ElMessage.error('删除分享链接失败')
-  }
-}
-
-const copyShareUrl = (url: string) => {
-  navigator.clipboard.writeText(url)
-  ElMessage.success('链接已复制到剪贴板')
-}
-
-const formatShareExpires = (expiresAt: string) => {
-  const expires = new Date(expiresAt)
-  return expires.toLocaleDateString()
-}
-
-onMounted(() => {
-  loadShares()
-})
-</script>
-
-<style scoped>
-.playground-page {
-  padding: 20px;
-}
-
-.control-card,
-.chat-card,
-.trace-card {
-  margin-bottom: 16px;
-}
-
-.header-row {
-  display: flex;
-  align-items: center;
-  justify-content: space-between;
-}
-
-.chat-list {
-  max-height: 520px;
-  overflow: auto;
-  padding: 8px;
-  background: var(--el-fill-color-lighter);
-  border-radius: 8px;
-}
-
-.chat-item {
-  margin-bottom: 10px;
-  padding: 10px;
-  border-radius: 8px;
-  background: #fff;
-}
-
-.chat-item.role-user {
-  border-left: 4px solid var(--el-color-info);
-}
-
-.chat-item.role-assistant {
-  border-left: 4px solid var(--el-color-success);
-}
-
-.chat-item.role-human {
-  border-left: 4px solid var(--el-color-warning);
-}
-
-.meta {
-  display: flex;
-  align-items: center;
-  gap: 8px;
-  margin-bottom: 6px;
-}
-
-.time,
-.segment {
-  font-size: 12px;
-  color: var(--el-text-color-secondary);
-}
-
-.content {
-  white-space: pre-wrap;
-  line-height: 1.5;
-}
-
-.composer {
-  margin-top: 12px;
-}
-
-.composer-actions {
-  margin-top: 8px;
-  display: flex;
-  justify-content: flex-end;
-}
-
-.json-panel {
-  margin-top: 12px;
-}
-
-.json-title {
-  font-size: 12px;
-  color: var(--el-text-color-secondary);
-  margin-bottom: 4px;
-}
-
-pre {
-  background: var(--el-fill-color-lighter);
-  padding: 8px;
-  border-radius: 6px;
-  max-height: 220px;
-  overflow: auto;
-  font-size: 12px;
-}
-
-.shares-card {
-  margin-bottom: 16px;
-}
-
-.shares-list {
-  max-height: 200px;
-  overflow-y: auto;
-}
-
-.share-item {
-  padding: 8px;
-  border-bottom: 1px solid var(--el-border-color-lighter);
-}
-
-.share-item:last-child {
-  border-bottom: none;
-}
-
-.share-info {
-  display: flex;
-  justify-content: space-between;
-  align-items: center;
-  margin-bottom: 4px;
-}
-
-.share-title {
-  font-weight: 500;
-  font-size: 13px;
-}
-
-.share-meta {
-  display: flex;
-  gap: 12px;
-  font-size: 12px;
-  color: var(--el-text-color-secondary);
-  margin-bottom: 4px;
-}
-
-.share-actions {
-  display: flex;
-  gap: 8px;
-}
-</style>
--- a/ai-service-admin/src/views/admin/slot-definition/index.vue
+++ b/ai-service-admin/src/views/admin/slot-definition/index.vue
@ -1,470 +0,0 @@
-<template>
-  <div class="slot-definition-page">
-    <div class="page-header">
-      <div class="header-content">
-        <div class="title-section">
-          <h1 class="page-title">槽位定义管理</h1>
-          <p class="page-desc">配置对话流程中的结构化槽位，用于信息收集和槽位填充。[AC-MRS-07,08]</p>
-        </div>
-        <div class="header-actions">
-          <el-select v-model="filterRequired" placeholder="按必填筛选" clearable style="width: 140px;">
-            <el-option label="必填" :value="true" />
-            <el-option label="可选" :value="false" />
-          </el-select>
-          <el-button type="primary" @click="handleCreate">
-            <el-icon><Plus /></el-icon>
-            新建槽位
-          </el-button>
-        </div>
-      </div>
-    </div>
-
-    <el-card shadow="hover" class="slot-card" v-loading="loading">
-      <el-table :data="slots" stripe style="width: 100%">
-        <el-table-column prop="slot_key" label="槽位标识" min-width="140">
-          <template #default="{ row }">
-            <code class="slot-key">{{ row.slot_key }}</code>
-          </template>
-        </el-table-column>
-        <el-table-column prop="type" label="类型" width="100">
-          <template #default="{ row }">
-            <el-tag size="small" type="info">{{ getTypeLabel(row.type) }}</el-tag>
-          </template>
-        </el-table-column>
-        <el-table-column prop="required" label="必填" width="80">
-          <template #default="{ row }">
-            <el-tag :type="row.required ? 'danger' : 'info'" size="small">
-              {{ row.required ? '必填' : '可选' }}
-            </el-tag>
-          </template>
-        </el-table-column>
-        <el-table-column prop="extract_strategy" label="提取策略" width="120">
-          <template #default="{ row }">
-            <el-tag v-if="row.extract_strategy" size="small">
-              {{ getExtractStrategyLabel(row.extract_strategy) }}
-            </el-tag>
-            <span v-else class="no-value">-</span>
-          </template>
-        </el-table-column>
-        <el-table-column prop="linked_field" label="关联字段" min-width="160">
-          <template #default="{ row }">
-            <div v-if="row.linked_field" class="linked-field">
-              <code class="field-key">{{ row.linked_field.field_key }}</code>
-              <el-tag size="small" type="success" class="linked-tag">已关联</el-tag>
-            </div>
-            <span v-else class="no-value">-</span>
-          </template>
-        </el-table-column>
-        <el-table-column prop="ask_back_prompt" label="追问提示语" min-width="180">
-          <template #default="{ row }">
-            <span v-if="row.ask_back_prompt" class="ask-back-prompt">{{ row.ask_back_prompt }}</span>
-            <span v-else class="no-value">-</span>
-          </template>
-        </el-table-column>
-        <el-table-column label="操作" width="150" fixed="right">
-          <template #default="{ row }">
-            <el-button type="primary" link size="small" @click="handleEdit(row)">
-              <el-icon><Edit /></el-icon>
-              编辑
-            </el-button>
-            <el-button type="danger" link size="small" @click="handleDelete(row)">
-              <el-icon><Delete /></el-icon>
-              删除
-            </el-button>
-          </template>
-        </el-table-column>
-      </el-table>
-      <el-empty v-if="!loading && slots.length === 0" description="暂无槽位定义" />
-    </el-card>
-
-    <el-dialog
-      v-model="dialogVisible"
-      :title="isEdit ? '编辑槽位定义' : '新建槽位定义'"
-      width="650px"
-      :close-on-click-modal="false"
-      destroy-on-close
-    >
-      <el-form :model="formData" :rules="formRules" ref="formRef" label-width="100px">
-        <el-row :gutter="20">
-          <el-col :span="12">
-            <el-form-item label="槽位标识" prop="slot_key">
-              <el-input
-                v-model="formData.slot_key"
-                placeholder="如：grade, subject"
-                :disabled="isEdit"
-              />
-              <div class="field-hint">仅允许小写字母、数字、下划线</div>
-            </el-form-item>
-          </el-col>
-          <el-col :span="12">
-            <el-form-item label="槽位类型" prop="type">
-              <el-select v-model="formData.type" style="width: 100%;">
-                <el-option
-                  v-for="opt in SLOT_TYPE_OPTIONS"
-                  :key="opt.value"
-                  :label="opt.label"
-                  :value="opt.value"
-                />
-              </el-select>
-            </el-form-item>
-          </el-col>
-        </el-row>
-        <el-row :gutter="20">
-          <el-col :span="12">
-            <el-form-item label="是否必填" prop="required">
-              <el-switch v-model="formData.required" />
-            </el-form-item>
-          </el-col>
-          <el-col :span="12">
-            <el-form-item label="提取策略" prop="extract_strategy">
-              <el-select v-model="formData.extract_strategy" style="width: 100%;" clearable placeholder="选择提取策略">
-                <el-option
-                  v-for="opt in EXTRACT_STRATEGY_OPTIONS"
-                  :key="opt.value"
-                  :label="opt.label"
-                  :value="opt.value"
-                >
-                  <div class="extract-option">
-                    <span>{{ opt.label }}</span>
-                    <span class="extract-desc">{{ opt.description }}</span>
-                  </div>
-                </el-option>
-              </el-select>
-            </el-form-item>
-          </el-col>
-        </el-row>
-        <el-form-item label="关联字段" prop="linked_field_id">
-          <el-select
-            v-model="formData.linked_field_id"
-            style="width: 100%;"
-            clearable
-            filterable
-            placeholder="选择关联的元数据字段"
-          >
-            <el-option
-              v-for="field in slotFields"
-              :key="field.id"
-              :label="`${field.label} (${field.field_key})`"
-              :value="field.id"
-            >
-              <div class="field-option">
-                <span class="field-label">{{ field.label }}</span>
-                <code class="field-key-small">{{ field.field_key }}</code>
-              </div>
-            </el-option>
-          </el-select>
-          <div class="field-hint">关联字段后，槽位值可同步到元数据字段</div>
-        </el-form-item>
-        <el-form-item label="校验规则" prop="validation_rule">
-          <el-input
-            v-model="formData.validation_rule"
-            type="textarea"
-            :rows="2"
-            placeholder="正则表达式或 JSON Schema 格式"
-          />
-        </el-form-item>
-        <el-form-item label="追问提示语" prop="ask_back_prompt">
-          <el-input
-            v-model="formData.ask_back_prompt"
-            type="textarea"
-            :rows="2"
-            placeholder="当槽位缺失时，系统将使用此提示语追问用户"
-          />
-        </el-form-item>
-        <el-form-item label="默认值" prop="default_value">
-          <el-input v-model="formData.default_value" placeholder="可选默认值" />
-        </el-form-item>
-      </el-form>
-      <template #footer>
-        <el-button @click="dialogVisible = false">取消</el-button>
-        <el-button type="primary" :loading="submitting" @click="handleSubmit">
-          {{ isEdit ? '保存' : '创建' }}
-        </el-button>
-      </template>
-    </el-dialog>
-  </div>
-</template>
-
-<script setup lang="ts">
-import { ref, reactive, onMounted, watch } from 'vue'
-import { ElMessage, ElMessageBox } from 'element-plus'
-import { Plus, Edit, Delete } from '@element-plus/icons-vue'
-import type { FormInstance, FormRules } from 'element-plus'
-import { slotDefinitionApi } from '@/api/slot-definition'
-import { metadataSchemaApi } from '@/api/metadata-schema'
-import {
-  SLOT_TYPE_OPTIONS,
-  EXTRACT_STRATEGY_OPTIONS,
-  type SlotDefinition,
-  type SlotDefinitionCreateRequest,
-  type SlotDefinitionUpdateRequest,
-  type SlotType,
-  type ExtractStrategy
-} from '@/types/slot-definition'
-import type { MetadataFieldDefinition } from '@/types/metadata'
-
-const loading = ref(false)
-const slots = ref<SlotDefinition[]>([])
-const slotFields = ref<MetadataFieldDefinition[]>([])
-const filterRequired = ref<boolean | undefined>(undefined)
-const dialogVisible = ref(false)
-const isEdit = ref(false)
-const submitting = ref(false)
-const formRef = ref<FormInstance>()
-
-const formData = reactive({
-  id: '',
-  slot_key: '',
-  type: 'string' as SlotType,
-  required: false,
-  extract_strategy: '' as ExtractStrategy | '',
-  validation_rule: '',
-  ask_back_prompt: '',
-  default_value: '',
-  linked_field_id: ''
-})
-
-const formRules: FormRules = {
-  slot_key: [
-    { required: true, message: '请输入槽位标识', trigger: 'blur' },
-    { pattern: /^[a-z][a-z0-9_]*$/, message: '以小写字母开头，仅允许小写字母、数字、下划线', trigger: 'blur' }
-  ],
-  type: [{ required: true, message: '请选择槽位类型', trigger: 'change' }],
-  required: [{ required: true, message: '请选择是否必填', trigger: 'change' }]
-}
-
-const getTypeLabel = (type: SlotType) => {
-  return SLOT_TYPE_OPTIONS.find(o => o.value === type)?.label || type
-}
-
-const getExtractStrategyLabel = (strategy: ExtractStrategy) => {
-  return EXTRACT_STRATEGY_OPTIONS.find(o => o.value === strategy)?.label || strategy
-}
-
-const fetchSlots = async () => {
-  loading.value = true
-  try {
-    const res = await slotDefinitionApi.list(filterRequired.value)
-    slots.value = res || []
-  } catch (error: any) {
-    ElMessage.error(error.response?.data?.message || '获取槽位定义失败')
-  } finally {
-    loading.value = false
-  }
-}
-
-const fetchSlotFields = async () => {
-  try {
-    const res = await metadataSchemaApi.getByRole('slot', true)
-    slotFields.value = res.items || []
-  } catch (error: any) {
-    console.error('获取槽位角色字段失败', error)
-  }
-}
-
-const handleCreate = () => {
-  isEdit.value = false
-  Object.assign(formData, {
-    id: '',
-    slot_key: '',
-    type: 'string',
-    required: false,
-    extract_strategy: '',
-    validation_rule: '',
-    ask_back_prompt: '',
-    default_value: '',
-    linked_field_id: ''
-  })
-  dialogVisible.value = true
-}
-
-const handleEdit = (slot: SlotDefinition) => {
-  isEdit.value = true
-  Object.assign(formData, {
-    id: slot.id,
-    slot_key: slot.slot_key,
-    type: slot.type,
-    required: slot.required,
-    extract_strategy: slot.extract_strategy || '',
-    validation_rule: slot.validation_rule || '',
-    ask_back_prompt: slot.ask_back_prompt || '',
-    default_value: slot.default_value ?? '',
-    linked_field_id: slot.linked_field_id || ''
-  })
-  dialogVisible.value = true
-}
-
-const handleDelete = async (slot: SlotDefinition) => {
-  try {
-    await ElMessageBox.confirm(
-      `确定要删除槽位「${slot.slot_key}」吗？[AC-MRS-16]`,
-      '删除确认',
-      { type: 'warning' }
-    )
-    await slotDefinitionApi.delete(slot.id)
-    ElMessage.success('删除成功')
-    fetchSlots()
-  } catch (error: any) {
-    if (error !== 'cancel') {
-      ElMessage.error(error.response?.data?.message || '删除失败')
-    }
-  }
-}
-
-const handleSubmit = async () => {
-  if (!formRef.value) return
-
-  await formRef.value.validate(async (valid) => {
-    if (!valid) return
-
-    submitting.value = true
-    try {
-      const data: SlotDefinitionCreateRequest | SlotDefinitionUpdateRequest = {
-        slot_key: formData.slot_key,
-        type: formData.type,
-        required: formData.required,
-        extract_strategy: formData.extract_strategy || undefined,
-        validation_rule: formData.validation_rule || undefined,
-        ask_back_prompt: formData.ask_back_prompt || undefined,
-        linked_field_id: formData.linked_field_id || undefined
-      }
-
-      if (formData.default_value !== '') {
-        data.default_value = formData.default_value
-      }
-
-      if (isEdit.value) {
-        await slotDefinitionApi.update(formData.id, data as SlotDefinitionUpdateRequest)
-        ElMessage.success('更新成功')
-      } else {
-        await slotDefinitionApi.create(data as SlotDefinitionCreateRequest)
-        ElMessage.success('创建成功')
-      }
-
-      dialogVisible.value = false
-      fetchSlots()
-    } catch (error: any) {
-      ElMessage.error(error.response?.data?.message || '操作失败')
-    } finally {
-      submitting.value = false
-    }
-  })
-}
-
-watch(filterRequired, () => {
-  fetchSlots()
-})
-
-onMounted(() => {
-  fetchSlots()
-  fetchSlotFields()
-})
-</script>
-
-<style scoped lang="scss">
-.slot-definition-page {
-  padding: 20px;
-}
-
-.page-header {
-  margin-bottom: 24px;
-}
-
-.header-content {
-  display: flex;
-  justify-content: space-between;
-  align-items: flex-start;
-}
-
-.title-section {
-  .page-title {
-    font-size: 24px;
-    font-weight: 600;
-    margin: 0 0 8px 0;
-    color: var(--el-text-color-primary);
-  }
-
-  .page-desc {
-    font-size: 14px;
-    color: var(--el-text-color-secondary);
-    margin: 0;
-  }
-}
-
-.header-actions {
-  display: flex;
-  align-items: center;
-  gap: 12px;
-}
-
-.slot-card {
-  border-radius: 8px;
-}
-
-.slot-key {
-  padding: 2px 6px;
-  background-color: var(--el-fill-color-light);
-  border-radius: 4px;
-  font-family: monospace;
-  font-size: 12px;
-}
-
-.no-value {
-  color: var(--el-text-color-placeholder);
-}
-
-.linked-field {
-  display: flex;
-  align-items: center;
-  gap: 8px;
-
-  .linked-tag {
-    margin-left: 4px;
-  }
-}
-
-.ask-back-prompt {
-  color: var(--el-text-color-regular);
-  font-size: 13px;
-  overflow: hidden;
-  text-overflow: ellipsis;
-  white-space: nowrap;
-  display: block;
-  max-width: 180px;
-}
-
-.field-hint {
-  margin-top: 4px;
-  font-size: 12px;
-  color: var(--el-text-color-secondary);
-}
-
-.extract-option {
-  display: flex;
-  flex-direction: column;
-  gap: 2px;
-
-  .extract-desc {
-    font-size: 12px;
-    color: var(--el-text-color-secondary);
-  }
-}
-
-.field-option {
-  display: flex;
-  align-items: center;
-  gap: 8px;
-
-  .field-label {
-    font-weight: 500;
-  }
-
-  .field-key-small {
-    font-size: 11px;
-    padding: 1px 4px;
-    background-color: var(--el-fill-color-light);
-    border-radius: 3px;
-    font-family: monospace;
-  }
-}
-</style>
--- a/ai-service-admin/src/views/share/index.vue
+++ b/ai-service-admin/src/views/share/index.vue
@ -1,349 +0,0 @@
-<template>
-  <div class="share-page">
-    <div v-if="loading" class="loading-container">
-      <div class="loading-spinner"></div>
-      <p class="loading-text">正在加载...</p>
-    </div>
-
-    <div v-else-if="error" class="error-container">
-      <div class="error-icon">{{ errorEmoji }}</div>
-      <h2 class="error-title">{{ errorTitle }}</h2>
-      <p class="error-message">{{ errorMessage }}</p>
-      <button class="back-btn" @click="goHome">返回首页</button>
-    </div>
-
-    <template v-else-if="sessionInfo">
-      <div class="page-header">
-        <div class="header-icon">💬</div>
-        <h1 class="page-title">{{ sessionInfo.title || '对话记录' }}</h1>
-        <p v-if="sessionInfo.description" class="page-desc">{{ sessionInfo.description }}</p>
-      </div>
-
-      <div class="chat-container">
-        <div class="chat-list" ref="chatListRef">
-          <template v-for="(msg, idx) in sessionInfo.history" :key="idx">
-            <div class="chat-bubble" :class="`bubble-${msg.role}`">
-              <div class="bubble-avatar">
-                <span v-if="msg.role === 'user' || msg.role === 'human'">👤</span>
-                <span v-else>🤖</span>
-              </div>
-              <div class="bubble-content">
-                <div class="bubble-text">{{ msg.content }}</div>
-              </div>
-            </div>
-          </template>
-          
-          <div v-if="!sessionInfo.history.length" class="empty-state">
-            <div class="empty-icon">📭</div>
-            <p>暂无对话内容</p>
-          </div>
-        </div>
-      </div>
-    </template>
-  </div>
-</template>
-
-<script setup lang="ts">
-import { ref, onMounted, onUnmounted, computed } from 'vue'
-import { useRoute, useRouter } from 'vue-router'
-import {
-  joinSharedSession,
-  leaveSharedSession,
-  type SharedSessionInfo
-} from '@/api/mid-platform'
-
-const route = useRoute()
-const router = useRouter()
-
-const loading = ref(true)
-const error = ref<string | null>(null)
-const errorTitle = ref('无法访问')
-const errorMessage = ref('')
-const sessionInfo = ref<SharedSessionInfo | null>(null)
-const chatListRef = ref<HTMLElement | null>(null)
-
-const errorEmoji = computed(() => {
-  if (error.value === 'expired') return '⏰'
-  if (error.value === 'not_found') return '🔍'
-  if (error.value === 'too_many_users') return '👥'
-  return '😕'
-})
-
-const goHome = () => {
-  router.push('/')
-}
-
-const loadSession = async () => {
-  const shareToken = route.params.token as string
-  
-  if (!shareToken) {
-    error.value = 'not_found'
-    errorTitle.value = '链接无效'
-    errorMessage.value = '这个链接好像不对哦，请检查一下'
-    loading.value = false
-    return
-  }
-
-  try {
-    loading.value = true
-    error.value = null
-    
-    sessionInfo.value = await joinSharedSession(shareToken)
-  } catch (err: any) {
-    const status = err?.response?.status
-    const detail = err?.response?.data?.detail || '出了点问题'
-    
-    if (status === 404) {
-      error.value = 'not_found'
-      errorTitle.value = '找不到内容'
-      errorMessage.value = '这个分享链接不存在或已被删除'
-    } else if (status === 410) {
-      error.value = 'expired'
-      errorTitle.value = '链接已失效'
-      errorMessage.value = '这个分享链接已经过期了'
-    } else if (status === 429) {
-      error.value = 'too_many_users'
-      errorTitle.value = '访问人数太多'
-      errorMessage.value = '当前查看的人太多了，请稍后再试'
-    } else {
-      error.value = 'error'
-      errorTitle.value = '加载失败'
-      errorMessage.value = detail
-    }
-  } finally {
-    loading.value = false
-  }
-}
-
-const handleLeave = async () => {
-  const shareToken = route.params.token as string
-  if (shareToken && sessionInfo.value) {
-    try {
-      await leaveSharedSession(shareToken)
-    } catch {
-    }
-  }
-}
-
-onMounted(() => {
-  loadSession()
-  window.addEventListener('beforeunload', handleLeave)
-})
-
-onUnmounted(() => {
-  handleLeave()
-  window.removeEventListener('beforeunload', handleLeave)
-})
-</script>
-
-<style scoped>
-.share-page {
-  min-height: 100vh;
-  background: #f5f5f5;
-  padding: 0;
-  max-width: 100%;
-  margin: 0 auto;
-}
-
-.loading-container,
-.error-container {
-  display: flex;
-  flex-direction: column;
-  align-items: center;
-  justify-content: center;
-  min-height: 100vh;
-  padding: 24px;
-  background: white;
-}
-
-.loading-spinner {
-  width: 36px;
-  height: 36px;
-  border: 3px solid #f0f0f0;
-  border-top-color: #1677ff;
-  border-radius: 50%;
-  animation: spin 0.7s linear infinite;
-}
-
-@keyframes spin {
-  to { transform: rotate(360deg); }
-}
-
-.loading-text {
-  margin-top: 14px;
-  color: #999;
-  font-size: 14px;
-}
-
-.error-icon {
-  font-size: 48px;
-  margin-bottom: 10px;
-}
-
-.error-title {
-  font-size: 18px;
-  font-weight: 500;
-  color: #333;
-  margin: 0 0 6px 0;
-}
-
-.error-message {
-  font-size: 14px;
-  color: #999;
-  margin: 0 0 18px 0;
-  text-align: center;
-}
-
-.back-btn {
-  padding: 10px 24px;
-  background: #1677ff;
-  color: white;
-  border: none;
-  border-radius: 8px;
-  font-size: 14px;
-  cursor: pointer;
-}
-
-.back-btn:hover {
-  background: #4096ff;
-}
-
-.page-header {
-  background: white;
-  padding: 24px 20px;
-  text-align: center;
-  border-bottom: 1px solid #eee;
-}
-
-.header-icon {
-  font-size: 36px;
-  margin-bottom: 8px;
-}
-
-.page-title {
-  font-size: 18px;
-  font-weight: 600;
-  color: #222;
-  margin: 0 0 4px 0;
-}
-
-.page-desc {
-  font-size: 13px;
-  color: #999;
-  margin: 0;
-}
-
-.chat-container {
-  padding: 16px;
-  max-width: 800px;
-  margin: 0 auto;
-}
-
-.chat-list {
-  display: flex;
-  flex-direction: column;
-  gap: 12px;
-}
-
-.chat-bubble {
-  display: flex;
-  gap: 8px;
-  align-items: flex-start;
-}
-
-.bubble-user,
-.bubble-human {
-  flex-direction: row-reverse;
-}
-
-.bubble-avatar {
-  flex-shrink: 0;
-  width: 28px;
-  height: 28px;
-  border-radius: 50%;
-  display: flex;
-  align-items: center;
-  justify-content: center;
-  font-size: 14px;
-  background: white;
-  box-shadow: 0 1px 3px rgba(0, 0, 0, 0.08);
-}
-
-.bubble-content {
-  max-width: 80%;
-}
-
-.bubble-text {
-  padding: 10px 14px;
-  border-radius: 14px;
-  font-size: 14px;
-  line-height: 1.5;
-  white-space: pre-wrap;
-  word-break: break-word;
-}
-
-.bubble-user .bubble-text {
-  background: #1677ff;
-  color: white;
-  border-bottom-right-radius: 4px;
-}
-
-.bubble-human .bubble-text {
-  background: #fa8c16;
-  color: white;
-  border-bottom-right-radius: 4px;
-}
-
-.bubble-assistant .bubble-text {
-  background: white;
-  color: #333;
-  border-bottom-left-radius: 4px;
-  box-shadow: 0 1px 2px rgba(0, 0, 0, 0.06);
-}
-
-.empty-state {
-  display: flex;
-  flex-direction: column;
-  align-items: center;
-  justify-content: center;
-  padding: 80px 20px;
-  color: #bbb;
-}
-
-.empty-icon {
-  font-size: 48px;
-  margin-bottom: 12px;
-}
-
-.empty-state p {
-  font-size: 14px;
-  margin: 0;
-}
-
-@media (max-width: 640px) {
-  .page-header {
-    padding: 20px 16px;
-  }
-  
-  .header-icon {
-    font-size: 32px;
-  }
-  
-  .page-title {
-    font-size: 16px;
-  }
-  
-  .chat-container {
-    padding: 12px;
-  }
-  
-  .bubble-content {
-    max-width: 85%;
-  }
-  
-  .bubble-text {
-    font-size: 14px;
-    padding: 9px 12px;
-  }
-}
-</style>
--- a/ai-service-admin/vite.config.ts
+++ b/ai-service-admin/vite.config.ts
@ -14,7 +14,7 @@ export default defineConfig({
    port: 3000,
    proxy: {
      '/api': {
-        target: 'http://127.0.0.1:8000',
+        target: 'http://localhost:8000',
        changeOrigin: true,
        rewrite: (path) => path.replace(/^\/api/, ''),
      },
--- a/ai-service/app/api/admin/init.py
+++ b/ai-service/app/api/admin/init.py
@ -1,7 +1,6 @@
 """
 Admin API routes for AI Service management.
 [AC-ASA-01, AC-ASA-02, AC-ASA-05, AC-ASA-07, AC-ASA-08, AC-AISVC-50] Admin management endpoints.
-[AC-MRS-07,08,16] Slot definition management endpoints.
 """

 from app.api.admin.api_key import router as api_key_router
@ -20,7 +19,6 @@ from app.api.admin.prompt_templates import router as prompt_templates_router
 from app.api.admin.rag import router as rag_router
 from app.api.admin.script_flows import router as script_flows_router
 from app.api.admin.sessions import router as sessions_router
-from app.api.admin.slot_definition import router as slot_definition_router
 from app.api.admin.tenants import router as tenants_router

 __all__ = [
@ -40,6 +38,5 @@ __all__ = [
    "rag_router",
    "script_flows_router",
    "sessions_router",
-    "slot_definition_router",
    "tenants_router",
 ]
--- a/ai-service/app/api/admin/api_key.py
+++ b/ai-service/app/api/admin/api_key.py
@ -4,7 +4,6 @@ API Key management endpoints.
 """

 import logging
-from datetime import datetime
 from typing import Annotated

 from fastapi import APIRouter, Depends, HTTPException, status
@ -27,9 +26,6 @@ class ApiKeyResponse(BaseModel):
    key: str = Field(..., description="API key value")
    name: str = Field(..., description="API key name")
    is_active: bool = Field(..., description="Whether the key is active")
-    expires_at: str | None = Field(default=None, description="Expiration time")
-    allowed_ips: list[str] | None = Field(default=None, description="Optional client IP allowlist")
-    rate_limit_qpm: int | None = Field(default=60, description="Per-minute quota")
    created_at: str = Field(..., description="Creation time")
    updated_at: str = Field(..., description="Last update time")

@ -46,9 +42,6 @@ class CreateApiKeyRequest(BaseModel):

    name: str = Field(..., description="API key name/description")
    key: str | None = Field(default=None, description="Custom API key (auto-generated if not provided)")
-    expires_at: datetime | None = Field(default=None, description="Expiration time; null means never expires")
-    allowed_ips: list[str] | None = Field(default=None, description="Optional client IP allowlist")
-    rate_limit_qpm: int | None = Field(default=60, ge=1, le=60000, description="Per-minute quota")


 class ToggleApiKeyRequest(BaseModel):
@ -64,9 +57,6 @@ def api_key_to_response(api_key: ApiKey) -> ApiKeyResponse:
        key=api_key.key,
        name=api_key.name,
        is_active=api_key.is_active,
-        expires_at=api_key.expires_at.isoformat() if api_key.expires_at else None,
-        allowed_ips=api_key.allowed_ips,
-        rate_limit_qpm=api_key.rate_limit_qpm,
        created_at=api_key.created_at.isoformat(),
        updated_at=api_key.updated_at.isoformat(),
    )
@ -104,9 +94,6 @@ async def create_api_key(
        key=key_value,
        name=request.name,
        is_active=True,
-        expires_at=request.expires_at,
-        allowed_ips=request.allowed_ips,
-        rate_limit_qpm=request.rate_limit_qpm,
    )

    api_key = await service.create_key(session, key_create)
--- a/ai-service/app/api/admin/metadata_field_definition.py
+++ b/ai-service/app/api/admin/metadata_field_definition.py
@ -1,7 +1,6 @@
 """
 Metadata Field Definition API.
 [AC-IDSMETA-13, AC-IDSMETA-14] 元数据字段定义管理接口，支持字段级状态治理。
-[AC-MRS-01,04,05,06,16] 支持字段角色分层配置和按角色查询。
 """

 import logging
@ -21,12 +20,10 @@ from app.models.entities import (
    MetadataFieldStatus,
 )
 from app.services.metadata_field_definition_service import MetadataFieldDefinitionService
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider, InvalidRoleError
-from app.schemas.metadata import VALID_FIELD_ROLES

 logger = logging.getLogger(__name__)

-router = APIRouter(prefix="/admin/metadata-schemas", tags=["MetadataSchema"])
+router = APIRouter(prefix="/admin/metadata-schemas", tags=["MetadataSchemas"])


 def get_current_tenant_id() -> str:
@ -37,33 +34,11 @@ def get_current_tenant_id() -> str:
    return tenant_id


-def _field_to_dict(f: MetadataFieldDefinition) -> dict[str, Any]:
-    """Convert field definition to dict with field_roles"""
-    return {
-        "id": str(f.id),
-        "tenant_id": str(f.tenant_id),
-        "field_key": f.field_key,
-        "label": f.label,
-        "type": f.type,
-        "required": f.required,
-        "options": f.options,
-        "default_value": f.default_value,
-        "scope": f.scope,
-        "is_filterable": f.is_filterable,
-        "is_rank_feature": f.is_rank_feature,
-        "field_roles": f.field_roles or [],
-        "status": f.status,
-        "version": f.version,
-        "created_at": f.created_at.isoformat() if f.created_at else None,
-        "updated_at": f.updated_at.isoformat() if f.updated_at else None,
-    }
-
-
@router.get(
    "",
    operation_id="listMetadataSchemas",
    summary="List metadata schemas",
-    description="[AC-IDSMETA-13] [AC-MRS-06] 获取元数据字段定义列表，支持按状态、范围、角色过滤",
+    description="[AC-IDSMETA-13] 获取元数据字段定义列表，支持按状态和范围过滤",
 )
 async def list_schemas(
    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
@ -74,32 +49,28 @@ async def list_schemas(
    scope: Annotated[str | None, Query(
        description="按适用范围过滤: kb_document/intent_rule/script_flow/prompt_template"
    )] = None,
-    field_role: Annotated[str | None, Query(
-        description="[AC-MRS-06] 按字段角色过滤: resource_filter/slot/prompt_var/routing_signal"
-    )] = None,
    include_deprecated: Annotated[bool, Query(
        description="是否包含已废弃的字段"
    )] = False,
 ) -> JSONResponse:
    """
-    [AC-IDSMETA-13] [AC-MRS-06] 列出元数据字段定义
+    [AC-IDSMETA-13] 列出元数据字段定义
    
    Args:
        status: 按状态过滤
        scope: 按适用范围过滤
-        field_role: [AC-MRS-06] 按字段角色过滤
        include_deprecated: 是否包含已废弃的字段（当 status 未指定时生效）
    """
    logger.info(
-        f"[AC-IDSMETA-13] [AC-MRS-06] Listing metadata field definitions: "
-        f"tenant={tenant_id}, status={status}, scope={scope}, field_role={field_role}"
+        f"[AC-IDSMETA-13] Listing metadata field definitions: "
+        f"tenant={tenant_id}, status={status}, scope={scope}, include_deprecated={include_deprecated}"
    )
    
    if status and status not in [s.value for s in MetadataFieldStatus]:
        return JSONResponse(
            status_code=400,
            content={
-                "error_code": "INVALID_STATUS",
+                "code": "INVALID_STATUS",
                "message": f"Invalid status: {status}",
                "details": {
                    "valid_values": [s.value for s in MetadataFieldStatus]
@ -107,29 +78,34 @@ async def list_schemas(
            }
        )
    
-    if field_role and field_role not in VALID_FIELD_ROLES:
-        return JSONResponse(
-            status_code=400,
-            content={
-                "error_code": "INVALID_ROLE",
-                "message": f"Invalid role '{field_role}'. Valid roles are: {', '.join(VALID_FIELD_ROLES)}",
-                "details": {
-                    "valid_roles": VALID_FIELD_ROLES
-                }
-            }
-        )
-    
    service = MetadataFieldDefinitionService(session)
    
    if include_deprecated and not status:
        fields = await service.get_field_definitions_for_read(tenant_id, scope)
-        if field_role:
-            fields = [f for f in fields if field_role in (f.field_roles or [])]
    else:
-        fields = await service.list_field_definitions(tenant_id, status, scope, field_role)
+        fields = await service.list_field_definitions(tenant_id, status, scope)
    
    return JSONResponse(
-        content=[_field_to_dict(f) for f in fields]
+        content={
+            "items": [
+                {
+                    "id": str(f.id),
+                    "field_key": f.field_key,
+                    "label": f.label,
+                    "type": f.type,
+                    "required": f.required,
+                    "options": f.options,
+                    "default": f.default_value,
+                    "scope": f.scope,
+                    "is_filterable": f.is_filterable,
+                    "is_rank_feature": f.is_rank_feature,
+                    "status": f.status,
+                    "created_at": f.created_at.isoformat() if f.created_at else None,
+                    "updated_at": f.updated_at.isoformat() if f.updated_at else None,
+                }
+                for f in fields
+            ]
+        }
    )


@ -137,7 +113,7 @@ async def list_schemas(
    "",
    operation_id="createMetadataSchema",
    summary="Create metadata schema",
-    description="[AC-IDSMETA-13] [AC-MRS-01,02,03] 创建新的元数据字段定义，支持 field_roles 多选配置",
+    description="[AC-IDSMETA-13] 创建新的元数据字段定义",
    status_code=201,
 )
 async def create_schema(
@ -146,11 +122,11 @@ async def create_schema(
    field_create: MetadataFieldDefinitionCreate,
 ) -> JSONResponse:
    """
-    [AC-IDSMETA-13] [AC-MRS-01,02,03] 创建元数据字段定义
+    [AC-IDSMETA-13] 创建元数据字段定义
    """
    logger.info(
-        f"[AC-IDSMETA-13] [AC-MRS-01] Creating metadata field definition: "
-        f"tenant={tenant_id}, field_key={field_create.field_key}, field_roles={field_create.field_roles}"
+        f"[AC-IDSMETA-13] Creating metadata field definition: "
+        f"tenant={tenant_id}, field_key={field_create.field_key}"
    )
    
    service = MetadataFieldDefinitionService(session)
@ -162,104 +138,36 @@ async def create_schema(
        return JSONResponse(
            status_code=400,
            content={
-                "error_code": "VALIDATION_ERROR",
+                "code": "VALIDATION_ERROR",
                "message": str(e),
            }
        )
    
    return JSONResponse(
        status_code=201,
-        content=_field_to_dict(field)
+        content={
+            "id": str(field.id),
+            "field_key": field.field_key,
+            "label": field.label,
+            "type": field.type,
+            "required": field.required,
+            "options": field.options,
+            "default": field.default_value,
+            "scope": field.scope,
+            "is_filterable": field.is_filterable,
+            "is_rank_feature": field.is_rank_feature,
+            "status": field.status,
+            "created_at": field.created_at.isoformat() if field.created_at else None,
+            "updated_at": field.updated_at.isoformat() if field.updated_at else None,
+        }
    )


-@router.get(
-    "/by-role",
-    operation_id="getMetadataSchemasByRole",
-    summary="Get metadata schemas by role",
-    description="[AC-MRS-04,05] 按指定角色查询所有包含该角色的活跃字段定义",
-)
-async def get_schemas_by_role(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    role: Annotated[str, Query(
-        description="[AC-MRS-04] 字段角色: resource_filter/slot/prompt_var/routing_signal"
-    )],
-    include_deprecated: Annotated[bool, Query(
-        description="是否包含已废弃字段"
-    )] = False,
-) -> JSONResponse:
-    """
-    [AC-MRS-04,05] 按角色查询字段定义
-    
-    Args:
-        role: 字段角色
-        include_deprecated: 是否包含已废弃字段
-    """
-    logger.info(
-        f"[AC-MRS-04] Getting metadata schemas by role: "
-        f"tenant={tenant_id}, role={role}, include_deprecated={include_deprecated}"
-    )
-    
-    provider = RoleBasedFieldProvider(session)
-    
-    try:
-        fields = await provider.get_fields_by_role(tenant_id, role, include_deprecated)
-    except InvalidRoleError as e:
-        return JSONResponse(
-            status_code=400,
-            content={
-                "error_code": "INVALID_ROLE",
-                "message": str(e),
-                "details": {
-                    "valid_roles": e.valid_roles
-                }
-            }
-        )
-    
-    return JSONResponse(
-        content=[_field_to_dict(f) for f in fields]
-    )
-
-
-@router.get(
-    "/{id}",
-    operation_id="getMetadataSchema",
-    summary="Get metadata schema by ID",
-    description="获取单个元数据字段定义",
-)
-async def get_schema(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    id: str,
-) -> JSONResponse:
-    """
-    获取单个元数据字段定义
-    """
-    logger.info(
-        f"Getting metadata field definition: tenant={tenant_id}, id={id}"
-    )
-    
-    service = MetadataFieldDefinitionService(session)
-    field = await service.get_field_definition(tenant_id, id)
-    
-    if not field:
-        return JSONResponse(
-            status_code=404,
-            content={
-                "error_code": "NOT_FOUND",
-                "message": f"Field definition {id} not found",
-            }
-        )
-    
-    return JSONResponse(content=_field_to_dict(field))
-
-
@router.put(
    "/{id}",
    operation_id="updateMetadataSchema",
    summary="Update metadata schema",
-    description="[AC-IDSMETA-14] [AC-MRS-01] 更新元数据字段定义，支持修改 field_roles",
+    description="[AC-IDSMETA-14] 更新元数据字段定义，支持状态切换",
 )
 async def update_schema(
    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
@ -268,18 +176,18 @@ async def update_schema(
    field_update: MetadataFieldDefinitionUpdate,
 ) -> JSONResponse:
    """
-    [AC-IDSMETA-14] [AC-MRS-01] 更新元数据字段定义
+    [AC-IDSMETA-14] 更新元数据字段定义
    """
    logger.info(
-        f"[AC-IDSMETA-14] [AC-MRS-01] Updating metadata field definition: "
-        f"tenant={tenant_id}, id={id}, field_roles={field_update.field_roles}"
+        f"[AC-IDSMETA-14] Updating metadata field definition: "
+        f"tenant={tenant_id}, id={id}"
    )
    
    if field_update.status and field_update.status not in [s.value for s in MetadataFieldStatus]:
        return JSONResponse(
            status_code=400,
            content={
-                "error_code": "INVALID_STATUS",
+                "code": "INVALID_STATUS",
                "message": f"Invalid status: {field_update.status}",
                "details": {
                    "valid_values": [s.value for s in MetadataFieldStatus]
@ -295,7 +203,7 @@ async def update_schema(
        return JSONResponse(
            status_code=400,
            content={
-                "error_code": "VALIDATION_ERROR",
+                "code": "VALIDATION_ERROR",
                "message": str(e),
            }
        )
@ -304,49 +212,30 @@ async def update_schema(
        return JSONResponse(
            status_code=404,
            content={
-                "error_code": "NOT_FOUND",
+                "code": "NOT_FOUND",
                "message": f"Field definition {id} not found",
            }
        )
    
    await session.commit()
    
-    return JSONResponse(content=_field_to_dict(field))
-
-
-@router.delete(
-    "/{id}",
-    operation_id="deleteMetadataSchema",
-    summary="Delete metadata schema",
-    description="[AC-MRS-16] 删除元数据字段定义，无需考虑历史数据兼容性",
-    status_code=204,
-)
-async def delete_schema(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    id: str,
-) -> JSONResponse:
-    """
-    [AC-MRS-16] 删除元数据字段定义
-    """
-    logger.info(
-        f"[AC-MRS-16] Deleting metadata field definition: "
-        f"tenant={tenant_id}, id={id}"
+    return JSONResponse(
+        content={
+            "id": str(field.id),
+            "field_key": field.field_key,
+            "label": field.label,
+            "type": field.type,
+            "required": field.required,
+            "options": field.options,
+            "default": field.default_value,
+            "scope": field.scope,
+            "is_filterable": field.is_filterable,
+            "is_rank_feature": field.is_rank_feature,
+            "status": field.status,
+            "created_at": field.created_at.isoformat() if field.created_at else None,
+            "updated_at": field.updated_at.isoformat() if field.updated_at else None,
+        }
    )
-    
-    service = MetadataFieldDefinitionService(session)
-    success = await service.delete_field_definition(tenant_id, id)
-    
-    if not success:
-        return JSONResponse(
-            status_code=404,
-            content={
-                "error_code": "NOT_FOUND",
-                "message": f"Field definition not found: {id}",
-            }
-        )
-    
-    return JSONResponse(status_code=204, content=None)


@router.get(
@ -374,7 +263,26 @@ async def get_active_schemas(
    fields = await service.get_active_field_definitions(tenant_id, scope)
    
    return JSONResponse(
-        content=[_field_to_dict(f) for f in fields]
+        content={
+            "items": [
+                {
+                    "id": str(f.id),
+                    "field_key": f.field_key,
+                    "label": f.label,
+                    "type": f.type,
+                    "required": f.required,
+                    "options": f.options,
+                    "default": f.default_value,
+                    "scope": f.scope,
+                    "is_filterable": f.is_filterable,
+                    "is_rank_feature": f.is_rank_feature,
+                    "status": f.status,
+                    "created_at": f.created_at.isoformat() if f.created_at else None,
+                    "updated_at": f.updated_at.isoformat() if f.updated_at else None,
+                }
+                for f in fields
+            ]
+        }
    )


@ -403,7 +311,26 @@ async def get_readable_schemas(
    fields = await service.get_field_definitions_for_read(tenant_id, scope)
    
    return JSONResponse(
-        content=[_field_to_dict(f) for f in fields]
+        content={
+            "items": [
+                {
+                    "id": str(f.id),
+                    "field_key": f.field_key,
+                    "label": f.label,
+                    "type": f.type,
+                    "required": f.required,
+                    "options": f.options,
+                    "default": f.default_value,
+                    "scope": f.scope,
+                    "is_filterable": f.is_filterable,
+                    "is_rank_feature": f.is_rank_feature,
+                    "status": f.status,
+                    "created_at": f.created_at.isoformat() if f.created_at else None,
+                    "updated_at": f.updated_at.isoformat() if f.updated_at else None,
+                }
+                for f in fields
+            ]
+        }
    )


@ -446,3 +373,42 @@ async def validate_metadata_for_create(
            "errors": errors,
        }
    )
+
+
+@router.delete(
+    "/{field_id}",
+    operation_id="deleteMetadataSchema",
+    summary="Delete metadata schema",
+    description="[AC-IDSMETA-13] 删除元数据字段定义",
+)
+async def delete_schema(
+    field_id: str,
+    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
+    session: Annotated[AsyncSession, Depends(get_session)],
+) -> JSONResponse:
+    """
+    [AC-IDSMETA-13] 删除元数据字段定义
+    """
+    logger.info(
+        f"[AC-IDSMETA-13] Deleting metadata field definition: "
+        f"tenant={tenant_id}, field_id={field_id}"
+    )
+    
+    service = MetadataFieldDefinitionService(session)
+    success = await service.delete_field_definition(tenant_id, field_id)
+    
+    if not success:
+        return JSONResponse(
+            status_code=404,
+            content={
+                "code": "NOT_FOUND",
+                "message": f"Field definition not found: {field_id}",
+            }
+        )
+    
+    return JSONResponse(
+        content={
+            "success": True,
+            "message": "Field definition deleted successfully",
+        }
+    )
--- a/ai-service/app/api/admin/slot_definition.py
+++ b/ai-service/app/api/admin/slot_definition.py
@ -1,234 +0,0 @@
-"""
-Slot Definition API.
-[AC-MRS-07,08,16] 槽位定义管理接口
-"""
-
-import logging
-from typing import Annotated, Any
-
-from fastapi import APIRouter, Depends, Query
-from fastapi.responses import JSONResponse
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.core.database import get_session
-from app.core.exceptions import MissingTenantIdException
-from app.core.tenant import get_tenant_id
-from app.models.entities import SlotDefinitionCreate, SlotDefinitionUpdate
-from app.services.slot_definition_service import SlotDefinitionService
-
-logger = logging.getLogger(__name__)
-
-router = APIRouter(prefix="/admin/slot-definitions", tags=["SlotDefinition"])
-
-
-def get_current_tenant_id() -> str:
-    """Get current tenant ID from context."""
-    tenant_id = get_tenant_id()
-    if not tenant_id:
-        raise MissingTenantIdException()
-    return tenant_id
-
-
-def _slot_to_dict(slot: dict[str, Any] | Any) -> dict[str, Any]:
-    """Convert slot definition to dict"""
-    if isinstance(slot, dict):
-        return slot
-    
-    return {
-        "id": str(slot.id),
-        "tenant_id": str(slot.tenant_id),
-        "slot_key": slot.slot_key,
-        "type": slot.type,
-        "required": slot.required,
-        "extract_strategy": slot.extract_strategy,
-        "validation_rule": slot.validation_rule,
-        "ask_back_prompt": slot.ask_back_prompt,
-        "default_value": slot.default_value,
-        "linked_field_id": str(slot.linked_field_id) if slot.linked_field_id else None,
-        "created_at": slot.created_at.isoformat() if slot.created_at else None,
-        "updated_at": slot.updated_at.isoformat() if slot.updated_at else None,
-    }
-
-
-@router.get(
-    "",
-    operation_id="listSlotDefinitions",
-    summary="List slot definitions",
-    description="获取槽位定义列表",
-)
-async def list_slot_definitions(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    required: Annotated[bool | None, Query(
-        description="按是否必填过滤"
-    )] = None,
-) -> JSONResponse:
-    """
-    列出槽位定义
-    """
-    logger.info(
-        f"Listing slot definitions: tenant={tenant_id}, required={required}"
-    )
-    
-    service = SlotDefinitionService(session)
-    slots = await service.list_slot_definitions(tenant_id, required)
-    
-    return JSONResponse(
-        content=[_slot_to_dict(s) for s in slots]
-    )
-
-
-@router.post(
-    "",
-    operation_id="createSlotDefinition",
-    summary="Create slot definition",
-    description="[AC-MRS-07,08] 创建新的槽位定义，可关联已有元数据字段",
-    status_code=201,
-)
-async def create_slot_definition(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    slot_create: SlotDefinitionCreate,
-) -> JSONResponse:
-    """
-    [AC-MRS-07,08] 创建槽位定义
-    """
-    logger.info(
-        f"[AC-MRS-07] Creating slot definition: "
-        f"tenant={tenant_id}, slot_key={slot_create.slot_key}, "
-        f"linked_field_id={slot_create.linked_field_id}"
-    )
-    
-    service = SlotDefinitionService(session)
-    
-    try:
-        slot = await service.create_slot_definition(tenant_id, slot_create)
-        await session.commit()
-    except ValueError as e:
-        return JSONResponse(
-            status_code=400,
-            content={
-                "error_code": "VALIDATION_ERROR",
-                "message": str(e),
-            }
-        )
-    
-    return JSONResponse(
-        status_code=201,
-        content=_slot_to_dict(slot)
-    )
-
-
-@router.get(
-    "/{id}",
-    operation_id="getSlotDefinition",
-    summary="Get slot definition by ID",
-    description="获取单个槽位定义",
-)
-async def get_slot_definition(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    id: str,
-) -> JSONResponse:
-    """
-    获取单个槽位定义
-    """
-    logger.info(
-        f"Getting slot definition: tenant={tenant_id}, id={id}"
-    )
-    
-    service = SlotDefinitionService(session)
-    slot = await service.get_slot_definition_with_field(tenant_id, id)
-    
-    if not slot:
-        return JSONResponse(
-            status_code=404,
-            content={
-                "error_code": "NOT_FOUND",
-                "message": f"Slot definition {id} not found",
-            }
-        )
-    
-    return JSONResponse(content=slot)
-
-
-@router.put(
-    "/{id}",
-    operation_id="updateSlotDefinition",
-    summary="Update slot definition",
-    description="更新槽位定义",
-)
-async def update_slot_definition(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    id: str,
-    slot_update: SlotDefinitionUpdate,
-) -> JSONResponse:
-    """
-    更新槽位定义
-    """
-    logger.info(
-        f"Updating slot definition: tenant={tenant_id}, id={id}"
-    )
-    
-    service = SlotDefinitionService(session)
-    
-    try:
-        slot = await service.update_slot_definition(tenant_id, id, slot_update)
-    except ValueError as e:
-        return JSONResponse(
-            status_code=400,
-            content={
-                "error_code": "VALIDATION_ERROR",
-                "message": str(e),
-            }
-        )
-    
-    if not slot:
-        return JSONResponse(
-            status_code=404,
-            content={
-                "error_code": "NOT_FOUND",
-                "message": f"Slot definition {id} not found",
-            }
-        )
-    
-    await session.commit()
-    
-    return JSONResponse(content=_slot_to_dict(slot))
-
-
-@router.delete(
-    "/{id}",
-    operation_id="deleteSlotDefinition",
-    summary="Delete slot definition",
-    description="[AC-MRS-16] 删除槽位定义，无需考虑历史数据兼容性",
-    status_code=204,
-)
-async def delete_slot_definition(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    id: str,
-) -> JSONResponse:
-    """
-    [AC-MRS-16] 删除槽位定义
-    """
-    logger.info(
-        f"[AC-MRS-16] Deleting slot definition: tenant={tenant_id}, id={id}"
-    )
-    
-    service = SlotDefinitionService(session)
-    success = await service.delete_slot_definition(tenant_id, id)
-    
-    if not success:
-        return JSONResponse(
-            status_code=404,
-            content={
-                "error_code": "NOT_FOUND",
-                "message": f"Slot definition not found: {id}",
-            }
-        )
-    
-    await session.commit()
-    
-    return JSONResponse(status_code=204, content=None)
--- a/ai-service/app/api/mid/init.py
+++ b/ai-service/app/api/mid/init.py
@ -1,30 +0,0 @@
-"""
-Mid Platform API endpoints.
-[AC-IDMP-01~20] Mid platform dialogue, messages, and session management.
-[AC-IDMP-SHARE] Share session via unique token.
-[AC-MRS-09,10] Runtime slot query endpoints.
-"""
-
-from fastapi import APIRouter
-
-from .dialogue import router as dialogue_router
-from .messages import router as messages_router
-from .sessions import router as sessions_router
-from .share import router as share_router
-from .slots import router as slots_router
-
-router = APIRouter()
-router.include_router(dialogue_router)
-router.include_router(messages_router)
-router.include_router(sessions_router)
-router.include_router(share_router)
-router.include_router(slots_router)
-
-__all__ = [
-    "router",
-    "dialogue_router",
-    "messages_router",
-    "sessions_router",
-    "share_router",
-    "slots_router",
-]
--- a/ai-service/app/api/mid/dialogue.py
+++ b/ai-service/app/api/mid/dialogue.py
--- a/ai-service/app/api/mid/messages.py
+++ b/ai-service/app/api/mid/messages.py
@ -1,104 +0,0 @@
-"""
-Messages Controller for Mid Platform.
-[AC-IDMP-08] Message report endpoint: POST /mid/messages/report
-"""
-
-import logging
-import time
-from typing import Annotated, Any
-
-from fastapi import APIRouter, Depends, Request
-from fastapi.responses import JSONResponse
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.core.database import get_session
-from app.core.tenant import get_tenant_id
-from app.models.mid.schemas import MessageReportRequest
-from app.services.memory import MemoryService
-
-logger = logging.getLogger(__name__)
-
-router = APIRouter(prefix="/mid", tags=["Mid Platform Messages"])
-
-
-@router.post(
-    "/messages/report",
-    operation_id="reportMessages",
-    summary="Report session messages and events",
-    description="""
-    [AC-IDMP-08] Report messages from channel to mid platform.
-    
-    Accepts messages from bot, human, or channel sources for session closure.
-    Returns 202 Accepted for async processing.
-    """,
-    responses={
-        202: {"description": "Accepted for async processing"},
-        400: {"description": "Invalid request"},
-    },
-)
-async def report_messages(
-    request: Request,
-    report_request: MessageReportRequest,
-    session: Annotated[AsyncSession, Depends(get_session)],
-) -> JSONResponse:
-    """
-    [AC-IDMP-08] Report messages from channel.
-    
-    Accepts and stores messages for session data completeness.
-    Messages are stored asynchronously without blocking response.
-    """
-    tenant_id = get_tenant_id()
-    
-    if not tenant_id:
-        from app.core.exceptions import MissingTenantIdException
-        raise MissingTenantIdException()
-    
-    logger.info(
-        f"[AC-IDMP-08] Message report: tenant={tenant_id}, "
-        f"session={report_request.session_id}, count={len(report_request.messages)}"
-    )
-    
-    try:
-        memory_service = MemoryService(session)
-        
-        await memory_service.get_or_create_session(
-            tenant_id=tenant_id,
-            session_id=report_request.session_id,
-        )
-        
-        messages_to_save = []
-        for msg in report_request.messages:
-            role = msg.role
-            if msg.source == "human":
-                role = "human"
-            elif msg.source == "bot":
-                role = "assistant"
-            
-            messages_to_save.append({
-                "role": role,
-                "content": msg.content,
-            })
-        
-        if messages_to_save:
-            await memory_service.append_messages(
-                tenant_id=tenant_id,
-                session_id=report_request.session_id,
-                messages=messages_to_save,
-            )
-        
-        logger.info(
-            f"[AC-IDMP-08] Messages saved: tenant={tenant_id}, "
-            f"session={report_request.session_id}, count={len(messages_to_save)}"
-        )
-        
-        return JSONResponse(
-            status_code=202,
-            content={"status": "accepted", "session_id": report_request.session_id},
-        )
-    
-    except Exception as e:
-        logger.error(f"[AC-IDMP-08] Message report failed: {e}")
-        return JSONResponse(
-            status_code=202,
-            content={"status": "accepted", "warning": str(e)},
-        )
--- a/ai-service/app/api/mid/sessions.py
+++ b/ai-service/app/api/mid/sessions.py
@ -1,105 +0,0 @@
-"""
-Sessions Controller for Mid Platform.
-[AC-IDMP-09] Session mode switch endpoint: POST /mid/sessions/{sessionId}/mode
-"""
-
-import logging
-from typing import Annotated
-
-from fastapi import APIRouter, Depends, Path
-from fastapi.responses import JSONResponse
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.core.database import get_session
-from app.core.tenant import get_tenant_id
-from app.models.mid.schemas import (
-    SessionMode,
-    SwitchModeRequest,
-    SwitchModeResponse,
-)
-from app.services.memory import MemoryService
-
-logger = logging.getLogger(__name__)
-
-router = APIRouter(prefix="/mid", tags=["Mid Platform Sessions"])
-
-_session_modes: dict[str, SessionMode] = {}
-
-
-@router.post(
-    "/sessions/{sessionId}/mode",
-    operation_id="switchSessionMode",
-    summary="Switch session mode",
-    description="""
-    [AC-IDMP-09] Switch session mode between BOT_ACTIVE and HUMAN_ACTIVE.
-    
-    When mode is HUMAN_ACTIVE, dialogue responses will route to transfer mode.
-    """,
-    responses={
-        200: {"description": "Mode switched successfully", "model": SwitchModeResponse},
-        400: {"description": "Invalid request"},
-    },
-)
-async def switch_session_mode(
-    sessionId: Annotated[str, Path(description="Session ID")],
-    switch_request: SwitchModeRequest,
-    session: Annotated[AsyncSession, Depends(get_session)],
-) -> SwitchModeResponse:
-    """
-    [AC-IDMP-09] Switch session mode.
-    
-    Modes:
-    - BOT_ACTIVE: Bot handles responses
-    - HUMAN_ACTIVE: Transfer to human agent
-    """
-    tenant_id = get_tenant_id()
-    
-    if not tenant_id:
-        from app.core.exceptions import MissingTenantIdException
-        raise MissingTenantIdException()
-    
-    logger.info(
-        f"[AC-IDMP-09] Mode switch: tenant={tenant_id}, "
-        f"session={sessionId}, mode={switch_request.mode.value}"
-    )
-    
-    try:
-        memory_service = MemoryService(session)
-        
-        await memory_service.get_or_create_session(
-            tenant_id=tenant_id,
-            session_id=sessionId,
-        )
-        
-        session_key = f"{tenant_id}:{sessionId}"
-        _session_modes[session_key] = switch_request.mode
-        
-        logger.info(
-            f"[AC-IDMP-09] Mode switched: session={sessionId}, "
-            f"mode={switch_request.mode.value}, reason={switch_request.reason}"
-        )
-        
-        return SwitchModeResponse(
-            session_id=sessionId,
-            mode=switch_request.mode,
-        )
-    
-    except Exception as e:
-        logger.error(f"[AC-IDMP-09] Mode switch failed: {e}")
-        return SwitchModeResponse(
-            session_id=sessionId,
-            mode=switch_request.mode,
-        )
-
-
-def get_session_mode(tenant_id: str, session_id: str) -> SessionMode:
-    """Get current session mode."""
-    session_key = f"{tenant_id}:{session_id}"
-    return _session_modes.get(session_key, SessionMode.BOT_ACTIVE)
-
-
-def clear_session_mode(tenant_id: str, session_id: str) -> None:
-    """Clear session mode (reset to BOT_ACTIVE)."""
-    session_key = f"{tenant_id}:{session_id}"
-    if session_key in _session_modes:
-        del _session_modes[session_key]
--- a/ai-service/app/api/mid/share.py
+++ b/ai-service/app/api/mid/share.py
@ -1,441 +0,0 @@
-"""
-Share Controller for Mid Platform.
-[AC-IDMP-SHARE] Share session via unique token with expiration and concurrent user limits.
-"""
-
-import logging
-import uuid
-from datetime import datetime, timedelta, timezone
-from typing import Annotated
-
-from fastapi import APIRouter, Depends, HTTPException, Path, Query
-from sqlalchemy import and_, select
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.core.config import get_settings
-from app.core.database import get_session
-from app.core.tenant import get_tenant_id
-from app.models.entities import ChatMessage, SharedSession
-from app.models.mid.schemas import (
-    CreateShareRequest,
-    ShareListResponse,
-    ShareListItem,
-    ShareResponse,
-    SharedMessageRequest,
-    SharedSessionInfo,
-    HistoryMessage,
-)
-
-logger = logging.getLogger(__name__)
-router = APIRouter(prefix="/mid", tags=["Mid Platform Share"])
-
-settings = get_settings()
-
-
-def _utcnow() -> datetime:
-    """Get current UTC time without timezone info (for DB compatibility)."""
-    return datetime.utcnow()
-
-
-def _normalize_datetime(dt: datetime) -> datetime:
-    """Normalize datetime to offset-naive UTC for comparison."""
-    if dt.tzinfo is not None:
-        return dt.replace(tzinfo=None)
-    return dt
-
-
-def _generate_share_url(share_token: str) -> str:
-    """Generate full share URL from token."""
-    base_url = getattr(settings, 'frontend_base_url', 'http://localhost:3000')
-    return f"{base_url}/share/{share_token}"
-
-
-@router.post(
-    "/sessions/{session_id}/share",
-    operation_id="createShare",
-    summary="Create a share link for session",
-    description="""
-    [AC-IDMP-SHARE] Create a share link for a chat session.
-    
-    Returns share_token and share_url for accessing the shared conversation.
-    """,
-    responses={
-        200: {"description": "Share created successfully", "model": ShareResponse},
-        400: {"description": "Invalid request"},
-        404: {"description": "Session not found"},
-    },
-)
-async def create_share(
-    session_id: Annotated[str, Path(description="Session ID to share")],
-    request: CreateShareRequest,
-    db: Annotated[AsyncSession, Depends(get_session)],
-) -> ShareResponse:
-    """Create a share link for a session."""
-    tenant_id = get_tenant_id()
-    
-    if not tenant_id:
-        from app.core.exceptions import MissingTenantIdException
-        raise MissingTenantIdException()
-    
-    share_token = str(uuid.uuid4())
-    expires_at = _utcnow() + timedelta(days=request.expires_in_days)
-    
-    shared_session = SharedSession(
-        share_token=share_token,
-        session_id=session_id,
-        tenant_id=tenant_id,
-        title=request.title,
-        description=request.description,
-        expires_at=expires_at,
-        max_concurrent_users=request.max_concurrent_users,
-        current_users=0,
-        is_active=True,
-    )
-    
-    db.add(shared_session)
-    await db.commit()
-    await db.refresh(shared_session)
-    
-    logger.info(
-        f"[AC-IDMP-SHARE] Share created: tenant={tenant_id}, "
-        f"session={session_id}, token={share_token}, expires={expires_at}"
-    )
-    
-    return ShareResponse(
-        share_token=share_token,
-        share_url=_generate_share_url(share_token),
-        expires_at=expires_at.isoformat(),
-        title=request.title,
-        description=request.description,
-        max_concurrent_users=request.max_concurrent_users,
-    )
-
-
-@router.get(
-    "/share/{share_token}",
-    operation_id="getSharedSession",
-    summary="Get shared session info",
-    description="""
-    [AC-IDMP-SHARE] Get shared session information by share token.
-    
-    Returns session info with history messages. Public endpoint (no tenant required).
-    """,
-    responses={
-        200: {"description": "Shared session info", "model": SharedSessionInfo},
-        404: {"description": "Share not found or expired"},
-        410: {"description": "Share expired or inactive"},
-        429: {"description": "Too many concurrent users"},
-    },
-)
-async def get_shared_session(
-    share_token: Annotated[str, Path(description="Share token")],
-    db: Annotated[AsyncSession, Depends(get_session)],
-) -> SharedSessionInfo:
-    """Get shared session info by token (public endpoint)."""
-    result = await db.execute(
-        select(SharedSession).where(SharedSession.share_token == share_token)
-    )
-    shared = result.scalar_one_or_none()
-    
-    if not shared:
-        raise HTTPException(status_code=404, detail="Share not found")
-    
-    if not shared.is_active:
-        raise HTTPException(status_code=410, detail="Share is inactive")
-    
-    if _normalize_datetime(shared.expires_at) < _utcnow():
-        raise HTTPException(status_code=410, detail="Share has expired")
-    
-    if shared.current_users >= shared.max_concurrent_users:
-        raise HTTPException(status_code=429, detail="Maximum concurrent users reached")
-    
-    messages_result = await db.execute(
-        select(ChatMessage)
-        .where(
-            and_(
-                ChatMessage.tenant_id == shared.tenant_id,
-                ChatMessage.session_id == shared.session_id,
-            )
-        )
-        .order_by(ChatMessage.created_at.asc())
-    )
-    messages = messages_result.scalars().all()
-    
-    history = [
-        HistoryMessage(role=msg.role, content=msg.content)
-        for msg in messages
-    ]
-    
-    return SharedSessionInfo(
-        session_id=shared.session_id,
-        title=shared.title,
-        description=shared.description,
-        expires_at=shared.expires_at.isoformat(),
-        max_concurrent_users=shared.max_concurrent_users,
-        current_users=shared.current_users,
-        history=history,
-    )
-
-
-@router.get(
-    "/sessions/{session_id}/shares",
-    operation_id="listShares",
-    summary="List all shares for a session",
-    description="""
-    [AC-IDMP-SHARE] List all share links for a session.
-    """,
-    responses={
-        200: {"description": "List of shares", "model": ShareListResponse},
-    },
-)
-async def list_shares(
-    session_id: Annotated[str, Path(description="Session ID")],
-    db: Annotated[AsyncSession, Depends(get_session)],
-    include_expired: Annotated[bool, Query(description="Include expired shares")] = False,
-) -> ShareListResponse:
-    """List all shares for a session."""
-    tenant_id = get_tenant_id()
-    
-    if not tenant_id:
-        from app.core.exceptions import MissingTenantIdException
-        raise MissingTenantIdException()
-    
-    query = select(SharedSession).where(
-        and_(
-            SharedSession.tenant_id == tenant_id,
-            SharedSession.session_id == session_id,
-        )
-    )
-    
-    now = _utcnow()
-    if not include_expired:
-        query = query.where(SharedSession.expires_at > now)
-    
-    query = query.order_by(SharedSession.created_at.desc())
-    
-    result = await db.execute(query)
-    shares = result.scalars().all()
-    
-    items = [
-        ShareListItem(
-            share_token=s.share_token,
-            share_url=_generate_share_url(s.share_token),
-            title=s.title,
-            description=s.description,
-            expires_at=s.expires_at.isoformat(),
-            is_active=s.is_active and _normalize_datetime(s.expires_at) > now,
-            max_concurrent_users=s.max_concurrent_users,
-            current_users=s.current_users,
-            created_at=s.created_at.isoformat(),
-        )
-        for s in shares
-    ]
-    
-    return ShareListResponse(shares=items)
-
-
-@router.delete(
-    "/share/{share_token}",
-    operation_id="deleteShare",
-    summary="Delete a share",
-    description="""
-    [AC-IDMP-SHARE] Delete (deactivate) a share link.
-    """,
-    responses={
-        200: {"description": "Share deleted"},
-        404: {"description": "Share not found"},
-    },
-)
-async def delete_share(
-    share_token: Annotated[str, Path(description="Share token")],
-    db: Annotated[AsyncSession, Depends(get_session)],
-) -> dict:
-    """Delete a share (set is_active=False)."""
-    tenant_id = get_tenant_id()
-    
-    if not tenant_id:
-        from app.core.exceptions import MissingTenantIdException
-        raise MissingTenantIdException()
-    
-    result = await db.execute(
-        select(SharedSession).where(
-            and_(
-                SharedSession.share_token == share_token,
-                SharedSession.tenant_id == tenant_id,
-            )
-        )
-    )
-    shared = result.scalar_one_or_none()
-    
-    if not shared:
-        raise HTTPException(status_code=404, detail="Share not found")
-    
-    shared.is_active = False
-    await db.commit()
-    
-    logger.info(f"[AC-IDMP-SHARE] Share deleted: token={share_token}")
-    
-    return {"success": True, "message": "Share deleted"}
-
-
-@router.post(
-    "/share/{share_token}/join",
-    operation_id="joinSharedSession",
-    summary="Join a shared session",
-    description="""
-    [AC-IDMP-SHARE] Join a shared session (increment current_users).
-    
-    Returns updated session info.
-    """,
-    responses={
-        200: {"description": "Joined successfully", "model": SharedSessionInfo},
-        404: {"description": "Share not found"},
-        410: {"description": "Share expired or inactive"},
-        429: {"description": "Too many concurrent users"},
-    },
-)
-async def join_shared_session(
-    share_token: Annotated[str, Path(description="Share token")],
-    db: Annotated[AsyncSession, Depends(get_session)],
-) -> SharedSessionInfo:
-    """Join a shared session (increment current_users)."""
-    result = await db.execute(
-        select(SharedSession).where(SharedSession.share_token == share_token)
-    )
-    shared = result.scalar_one_or_none()
-    
-    if not shared:
-        raise HTTPException(status_code=404, detail="Share not found")
-    
-    if not shared.is_active:
-        raise HTTPException(status_code=410, detail="Share is inactive")
-    
-    if _normalize_datetime(shared.expires_at) < _utcnow():
-        raise HTTPException(status_code=410, detail="Share has expired")
-    
-    if shared.current_users >= shared.max_concurrent_users:
-        raise HTTPException(status_code=429, detail="Maximum concurrent users reached")
-    
-    shared.current_users += 1
-    shared.updated_at = _utcnow()
-    await db.commit()
-    
-    messages_result = await db.execute(
-        select(ChatMessage)
-        .where(
-            and_(
-                ChatMessage.tenant_id == shared.tenant_id,
-                ChatMessage.session_id == shared.session_id,
-            )
-        )
-        .order_by(ChatMessage.created_at.asc())
-    )
-    messages = messages_result.scalars().all()
-    
-    history = [
-        HistoryMessage(role=msg.role, content=msg.content)
-        for msg in messages
-    ]
-    
-    logger.info(f"[AC-IDMP-SHARE] User joined: token={share_token}, users={shared.current_users}")
-    
-    return SharedSessionInfo(
-        session_id=shared.session_id,
-        title=shared.title,
-        description=shared.description,
-        expires_at=shared.expires_at.isoformat(),
-        max_concurrent_users=shared.max_concurrent_users,
-        current_users=shared.current_users,
-        history=history,
-    )
-
-
-@router.post(
-    "/share/{share_token}/leave",
-    operation_id="leaveSharedSession",
-    summary="Leave a shared session",
-    description="""
-    [AC-IDMP-SHARE] Leave a shared session (decrement current_users).
-    """,
-    responses={
-        200: {"description": "Left successfully"},
-        404: {"description": "Share not found"},
-    },
-)
-async def leave_shared_session(
-    share_token: Annotated[str, Path(description="Share token")],
-    db: Annotated[AsyncSession, Depends(get_session)],
-) -> dict:
-    """Leave a shared session (decrement current_users)."""
-    result = await db.execute(
-        select(SharedSession).where(SharedSession.share_token == share_token)
-    )
-    shared = result.scalar_one_or_none()
-    
-    if not shared:
-        raise HTTPException(status_code=404, detail="Share not found")
-    
-    if shared.current_users > 0:
-        shared.current_users -= 1
-        shared.updated_at = _utcnow()
-        await db.commit()
-    
-    logger.info(f"[AC-IDMP-SHARE] User left: token={share_token}, users={shared.current_users}")
-    
-    return {"success": True, "current_users": shared.current_users}
-
-
-@router.post(
-    "/share/{share_token}/message",
-    operation_id="sendSharedMessage",
-    summary="Send message via shared session",
-    description="""
-    [AC-IDMP-SHARE] Send a message via shared session.
-    
-    Creates a new message in the shared session and returns the response.
-    """,
-    responses={
-        200: {"description": "Message sent successfully"},
-        404: {"description": "Share not found"},
-        410: {"description": "Share expired or inactive"},
-    },
-)
-async def send_shared_message(
-    share_token: Annotated[str, Path(description="Share token")],
-    request: SharedMessageRequest,
-    db: Annotated[AsyncSession, Depends(get_session)],
-) -> dict:
-    """Send a message via shared session."""
-    result = await db.execute(
-        select(SharedSession).where(SharedSession.share_token == share_token)
-    )
-    shared = result.scalar_one_or_none()
-    
-    if not shared:
-        raise HTTPException(status_code=404, detail="Share not found")
-    
-    if not shared.is_active:
-        raise HTTPException(status_code=410, detail="Share is inactive")
-    
-    if _normalize_datetime(shared.expires_at) < _utcnow():
-        raise HTTPException(status_code=410, detail="Share has expired")
-    
-    user_message = ChatMessage(
-        tenant_id=shared.tenant_id,
-        session_id=shared.session_id,
-        role="user",
-        content=request.user_message,
-    )
-    db.add(user_message)
-    await db.commit()
-    
-    logger.info(
-        f"[AC-IDMP-SHARE] Message sent via share: token={share_token}, "
-        f"session={shared.session_id}"
-    )
-    
-    return {
-        "success": True,
-        "message": "Message sent successfully",
-        "session_id": shared.session_id,
-    }
--- a/ai-service/app/api/mid/slots.py
+++ b/ai-service/app/api/mid/slots.py
@ -1,140 +0,0 @@
-"""
-Runtime Slot API.
-[AC-MRS-09,10] 运行时槽位查询接口
-"""
-
-import logging
-from datetime import datetime
-from typing import Annotated, Any
-
-from fastapi import APIRouter, Depends, Query
-from fastapi.responses import JSONResponse
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.core.database import get_session
-from app.core.exceptions import MissingTenantIdException
-from app.core.tenant import get_tenant_id
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider, InvalidRoleError
-from app.services.slot_definition_service import SlotDefinitionService
-from app.schemas.metadata import VALID_FIELD_ROLES
-
-logger = logging.getLogger(__name__)
-
-router = APIRouter(prefix="/mid/slots", tags=["RuntimeSlot"])
-
-
-def get_current_tenant_id() -> str:
-    """Get current tenant ID from context."""
-    tenant_id = get_tenant_id()
-    if not tenant_id:
-        raise MissingTenantIdException()
-    return tenant_id
-
-
-@router.get(
-    "/by-role",
-    operation_id="getSlotsByRole",
-    summary="Get slots by role",
-    description="[AC-MRS-10] 运行时接口，按角色获取槽位定义及关联的元数据字段信息",
-)
-async def get_slots_by_role(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    role: Annotated[str, Query(
-        description="[AC-MRS-10] 字段角色: resource_filter/slot/prompt_var/routing_signal"
-    )] = "slot",
-) -> JSONResponse:
-    """
-    [AC-MRS-10] 按角色获取槽位定义
-    
-    Args:
-        role: 字段角色，默认为 slot
-    """
-    logger.info(
-        f"[AC-MRS-10] Getting slots by role: tenant={tenant_id}, role={role}"
-    )
-    
-    provider = RoleBasedFieldProvider(session)
-    
-    try:
-        slots = await provider.get_slot_definitions_by_role(tenant_id, role)
-    except InvalidRoleError as e:
-        return JSONResponse(
-            status_code=400,
-            content={
-                "error_code": "INVALID_ROLE",
-                "message": str(e),
-                "details": {
-                    "valid_roles": e.valid_roles
-                }
-            }
-        )
-    
-    return JSONResponse(content=slots)
-
-
-@router.get(
-    "/{slot_key}",
-    operation_id="getSlotValue",
-    summary="Get runtime slot value",
-    description="[AC-MRS-09] 获取指定槽位的运行时值，包含来源、置信度、更新时间",
-)
-async def get_slot_value(
-    tenant_id: Annotated[str, Depends(get_current_tenant_id)],
-    session: Annotated[AsyncSession, Depends(get_session)],
-    slot_key: str,
-    user_id: Annotated[str | None, Query(
-        description="用户 ID"
-    )] = None,
-    session_id: Annotated[str | None, Query(
-        description="会话 ID"
-    )] = None,
-) -> JSONResponse:
-    """
-    [AC-MRS-09] 获取运行时槽位值
-    
-    Args:
-        slot_key: 槽位键名
-        user_id: 用户 ID
-        session_id: 会话 ID
-    """
-    logger.info(
-        f"[AC-MRS-09] Getting slot value: tenant={tenant_id}, slot_key={slot_key}, "
-        f"user_id={user_id}, session_id={session_id}"
-    )
-    
-    service = SlotDefinitionService(session)
-    slot_def = await service.get_slot_definition_by_key(tenant_id, slot_key)
-    
-    if not slot_def:
-        return JSONResponse(
-            status_code=404,
-            content={
-                "error_code": "NOT_FOUND",
-                "message": f"Slot '{slot_key}' not found",
-            }
-        )
-    
-    value = slot_def.default_value
-    source = "default"
-    confidence = 1.0
-    
-    if value is None:
-        if slot_def.type == "string":
-            value = ""
-        elif slot_def.type == "number":
-            value = 0
-        elif slot_def.type == "boolean":
-            value = False
-        elif slot_def.type in ["enum", "array_enum"]:
-            value = [] if slot_def.type == "array_enum" else ""
-    
-    return JSONResponse(
-        content={
-            "key": slot_key,
-            "value": value,
-            "source": source,
-            "confidence": confidence,
-            "updated_at": datetime.utcnow().isoformat(),
-        }
-    )
--- a/ai-service/app/api/openapi/init.py
+++ b/ai-service/app/api/openapi/init.py
@ -1,12 +0,0 @@
-"""OpenAPI-facing routers for third-party integrations."""
-
-from fastapi import APIRouter
-
-from .dialogue import router as dialogue_router
-from .share_page import router as share_page_router
-
-router = APIRouter()
-router.include_router(dialogue_router)
-router.include_router(share_page_router)
-
-__all__ = ["router", "dialogue_router", "share_page_router"]
--- a/ai-service/app/api/openapi/dialogue.py
+++ b/ai-service/app/api/openapi/dialogue.py
@ -1,7 +0,0 @@
-"""OpenAPI dialogue router placeholder."""
-
-from fastapi import APIRouter
-
-router = APIRouter(prefix="/openapi/v1/dialogue", tags=["OpenAPI Dialogue"])
-
-__all__ = ["router"]
--- a/ai-service/app/api/openapi/share_page.py
+++ b/ai-service/app/api/openapi/share_page.py
@ -1,466 +0,0 @@
-"""Simple shareable chat page and tokenized public chat APIs."""
-
-from __future__ import annotations
-
-import secrets
-from typing import Annotated
-
-from fastapi import APIRouter, Depends, HTTPException, Query, Request
-from fastapi.responses import HTMLResponse
-from pydantic import BaseModel, Field
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.api.mid.dialogue import (
-    get_default_kb_tool_runner,
-    get_feature_flag_service,
-    get_high_risk_handler,
-    get_interrupt_context_enricher,
-    get_metrics_collector,
-    get_output_guardrail_executor,
-    get_policy_router,
-    get_runtime_observer,
-    get_segment_humanizer,
-    get_timeout_governor,
-    get_trace_logger,
-    respond_dialogue,
-)
-from app.core.database import get_session
-from app.core.tenant import clear_tenant_context, set_tenant_context
-from app.models.mid.schemas import DialogueRequest, HistoryMessage
-from app.services.openapi.share_token_service import get_share_token_service
-
-router = APIRouter(prefix="/openapi/v1/share", tags=["OpenAPI Share Page"])
-
-SHARE_DEVICE_COOKIE = "share_device_id"
-
-
-class CreatePublicShareTokenRequest(BaseModel):
-    tenant_id: str | None = Field(default=None, description="Tenant id (optional, fallback to X-Tenant-Id)")
-    api_key: str | None = Field(default=None, description="API key (optional, fallback to X-API-Key)")
-    session_id: str = Field(..., description="Shared session id")
-    user_id: str | None = Field(default=None, description="Optional default user id")
-    expires_in_minutes: int = Field(default=60, ge=1, le=1440, description="Token ttl in minutes")
-
-
-class CreatePublicShareTokenResponse(BaseModel):
-    share_token: str
-    share_url: str
-    expires_at: str
-
-
-class PublicShareChatRequest(BaseModel):
-    message: str = Field(..., min_length=1, max_length=2000)
-    history: list[HistoryMessage] = Field(default_factory=list)
-    user_id: str | None = None
-
-
-def _get_or_create_device_id(request: Request) -> tuple[str, bool]:
-    existing = request.cookies.get(SHARE_DEVICE_COOKIE)
-    if existing:
-        return existing, False
-    return secrets.token_urlsafe(16), True
-
-
-@router.post("/token", response_model=CreatePublicShareTokenResponse, summary="Create a public one-time share token")
-async def create_public_share_token(request: Request, body: CreatePublicShareTokenRequest) -> CreatePublicShareTokenResponse:
-    tenant_id = body.tenant_id or request.headers.get("X-Tenant-Id")
-    api_key = body.api_key or request.headers.get("X-API-Key")
-
-    if not tenant_id or not api_key:
-        raise HTTPException(status_code=400, detail="tenant_id/api_key missing")
-
-    service = get_share_token_service()
-    token, expires_at = await service.create_token(
-        tenant_id=tenant_id,
-        api_key=api_key,
-        session_id=body.session_id,
-        user_id=body.user_id,
-        expires_in_minutes=body.expires_in_minutes,
-    )
-
-    base_url = str(request.base_url).rstrip("/")
-    share_url = f"{base_url}/openapi/v1/share/chat?token={token}"
-    return CreatePublicShareTokenResponse(
-        share_token=token,
-        share_url=share_url,
-        expires_at=expires_at.isoformat(),
-    )
-
-
-@router.post("/chat/{chat_token}", summary="Public chat via consumed share token")
-async def public_chat_via_share_token(
-    chat_token: str,
-    body: PublicShareChatRequest,
-    request: Request,
-    session: Annotated[AsyncSession, Depends(get_session)],
-):
-    service = get_share_token_service()
-    device_id, _ = _get_or_create_device_id(request)
-    grant = await service.get_chat_grant_for_device(chat_token, device_id)
-    if not grant:
-        raise HTTPException(status_code=403, detail="This share link is bound to another device or expired")
-
-    set_tenant_context(grant.tenant_id)
-    request.state.tenant_id = grant.tenant_id
-
-    try:
-        mid_request = DialogueRequest(
-            session_id=grant.session_id,
-            user_id=body.user_id or grant.user_id,
-            user_message=body.message,
-            history=body.history,
-        )
-
-        result = await respond_dialogue(
-            request=request,
-            dialogue_request=mid_request,
-            session=session,
-            policy_router=get_policy_router(),
-            high_risk_handler=get_high_risk_handler(),
-            timeout_governor=get_timeout_governor(),
-            feature_flag_service=get_feature_flag_service(),
-            trace_logger=get_trace_logger(),
-            metrics_collector=get_metrics_collector(),
-            output_guardrail_executor=get_output_guardrail_executor(),
-            interrupt_context_enricher=get_interrupt_context_enricher(),
-            default_kb_tool_runner=get_default_kb_tool_runner(),
-            segment_humanizer=get_segment_humanizer(),
-            runtime_observer=get_runtime_observer(),
-        )
-    finally:
-        clear_tenant_context()
-
-    merged_reply = "\n\n".join([s.text for s in result.segments if s.text])
-    return {
-        "request_id": result.trace.request_id,
-        "reply": merged_reply,
-        "segments": [s.model_dump() for s in result.segments],
-        "trace": result.trace.model_dump(),
-    }
-
-
-@router.get("/chat", response_class=HTMLResponse, summary="Shareable chat page")
-async def share_chat_page(
-    request: Request,
-    token: Annotated[str | None, Query(description="One-time share token")] = None,
-) -> HTMLResponse:
-    service = get_share_token_service()
-    device_id, is_new_cookie = _get_or_create_device_id(request)
-
-    chat_token = ""
-    token_error = ""
-
-    if token:
-        claim = await service.claim_or_reuse(token, device_id)
-        if claim.ok and claim.grant:
-            chat_token = claim.grant.chat_token
-        elif claim.status in {"invalid", "expired"}:
-            token_error = "分享链接已失效"
-        elif claim.status == "forbidden":
-            token_error = "该链接已绑定到其他设备，无法访问"
-        else:
-            token_error = "分享链接不可用"
-    else:
-        token_error = "缺少分享 token"
-
-    html = f"""
-<!doctype html>
-<html lang="zh-CN">
-<head>
-  <meta charset="UTF-8" />
-  <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-  <title>对话分享</title>
-  <style>
-    * {{ box-sizing: border-box; margin: 0; padding: 0; }}
-    body {{
-      font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, 'PingFang SC', 'Microsoft YaHei', sans-serif;
-      background: #f8f9fa;
-      min-height: 100vh;
-      display: flex;
-      flex-direction: column;
-    }}
-    .welcome-screen {{
-      flex: 1;
-      display: flex;
-      flex-direction: column;
-      align-items: center;
-      justify-content: center;
-      padding: 40px 20px;
-    }}
-    .welcome-screen.hidden {{ display: none; }}
-    .welcome-input-wrapper {{
-      width: 100%;
-      max-width: 680px;
-      background: white;
-      border-radius: 16px;
-      padding: 16px 20px;
-      box-shadow: 0 2px 12px rgba(0,0,0,0.08);
-    }}
-    .welcome-textarea, .input-textarea {{
-      width: 100%;
-      min-height: 56px;
-      border: 1px solid #e5e5e5;
-      border-radius: 12px;
-      padding: 12px 16px;
-      resize: none;
-      outline: none;
-      font-size: 15px;
-      line-height: 1.6;
-      font-family: inherit;
-      background: #fafafa;
-      transition: all 0.2s;
-    }}
-    .welcome-textarea:focus, .input-textarea:focus {{
-      border-color: #1677ff;
-      background: white;
-    }}
-    .chat-screen {{ flex: 1; display: none; flex-direction: column; }}
-    .chat-screen.active {{ display: flex; }}
-    .chat-list {{
-      flex: 1;
-      padding: 20px;
-      max-width: 800px;
-      width: 100%;
-      margin: 0 auto;
-      display: flex;
-      flex-direction: column;
-      gap: 16px;
-      overflow-y: auto;
-    }}
-    .bubble {{ display: flex; gap: 10px; align-items: flex-start; }}
-    .bubble.user {{ flex-direction: row-reverse; }}
-    .avatar {{
-      width: 32px; height: 32px; border-radius: 50%;
-      background: white; display: flex; align-items: center; justify-content: center;
-    }}
-    .bubble-content {{ max-width: 75%; }}
-    .bubble-text {{
-      padding: 12px 16px; border-radius: 16px; white-space: pre-wrap; word-break: break-word;
-      font-size: 14px; line-height: 1.6;
-    }}
-    .bubble.user .bubble-text {{ background: #1677ff; color: white; }}
-    .bubble.bot .bubble-text {{ background: white; color: #333; }}
-    .bubble.error .bubble-text {{ background: #fff2f0; color: #ff4d4f; border: 1px solid #ffccc7; }}
-    .thought-block {{
-      background: #f5f5f5;
-      color: #888;
-      padding: 12px 16px;
-      border-radius: 12px;
-      margin-bottom: 12px;
-      font-size: 13px;
-      line-height: 1.6;
-      border-left: 3px solid #ddd;
-    }}
-    .thought-label {{
-      font-weight: 600;
-      color: #999;
-      margin-bottom: 6px;
-      font-size: 12px;
-    }}
-    .final-answer-block {{
-      background: white;
-      color: #333;
-      padding: 12px 16px;
-      border-radius: 12px;
-      font-size: 14px;
-      line-height: 1.6;
-    }}
-    .final-answer-label {{
-      font-weight: 600;
-      color: #1677ff;
-      margin-bottom: 6px;
-      font-size: 12px;
-    }}
-    .input-area {{ background: white; padding: 16px 20px 20px; border-top: 1px solid #eee; }}
-    .input-wrapper {{ max-width: 800px; margin: 0 auto; display: flex; gap: 12px; align-items: flex-end; }}
-    .send-btn, .welcome-send {{
-      width: 40px; height: 40px; border-radius: 50%; border: none; cursor: pointer;
-      background: #1677ff; color: white;
-    }}
-    .status {{ text-align: center; padding: 8px; font-size: 12px; color: #999; }}
-    .status.error {{ color: #ff4d4f; }}
-  </style>
-</head>
-<body>
-  <div class="welcome-screen" id="welcomeScreen">
-    <h1>今天有什么可以帮到你？</h1>
-    <div class="welcome-input-wrapper">
-      <textarea class="welcome-textarea" id="welcomeInput" placeholder="输入消息，按 Enter 发送" rows="1"></textarea>
-      <button class="welcome-send" id="welcomeSendBtn">➤</button>
-    </div>
-  </div>
-
-  <div class="chat-screen" id="chatScreen">
-    <div class="chat-list" id="chatList"></div>
-    <div class="input-area">
-      <div class="input-wrapper">
-        <textarea class="input-textarea" id="chatInput" placeholder="输入消息..." rows="1"></textarea>
-        <button class="send-btn" id="chatSendBtn">➤</button>
-      </div>
-      <div class="status" id="status"></div>
-    </div>
-  </div>
-
-<script>
-if (window.location.search.includes('token=')) {{
-  window.history.replaceState(null, '', '/openapi/v1/share/chat');
-}}
-const CHAT_TOKEN = {chat_token!r};
-const TOKEN_ERROR = {token_error!r};
-
-const welcomeScreen = document.getElementById('welcomeScreen');
-const chatScreen = document.getElementById('chatScreen');
-const welcomeInput = document.getElementById('welcomeInput');
-const welcomeSendBtn = document.getElementById('welcomeSendBtn');
-const chatInput = document.getElementById('chatInput');
-const chatSendBtn = document.getElementById('chatSendBtn');
-const chatList = document.getElementById('chatList');
-const statusEl = document.getElementById('status');
-
-const chatHistory = [];
-let sending = false;
-let started = false;
-
-function setStatus(text, type) {{
-  statusEl.textContent = text || '';
-  statusEl.className = 'status ' + (type || '');
-}}
-
-function formatBotMessage(text) {{
-  const thoughtKey = 'Thought:';
-  const finalKey = 'Final Answer:';
-  const thoughtIdx = text.indexOf(thoughtKey);
-  const finalIdx = text.indexOf(finalKey);
-
-  if (thoughtIdx === -1 && finalIdx === -1) {{
-    return '<div class="bubble-text">' + text + '</div>';
-  }}
-
-  let html = '';
-
-  if (thoughtIdx !== -1) {{
-    const thoughtStart = thoughtIdx + thoughtKey.length;
-    const thoughtEnd = finalIdx !== -1 ? finalIdx : text.length;
-    const thoughtContent = text.slice(thoughtStart, thoughtEnd).trim().split('\\n').join('<br>');
-    if (thoughtContent) {{
-      html += '<div class="thought-block"><div class="thought-label">💭 思考过程</div><div>' + thoughtContent + '</div></div>';
-    }}
-  }}
-
-  if (finalIdx !== -1) {{
-    const answerStart = finalIdx + finalKey.length;
-    const answerContent = text.slice(answerStart).trim().split('\\n').join('<br>');
-    if (answerContent) {{
-      html += '<div class="final-answer-block"><div class="final-answer-label">✨ 回答</div><div>' + answerContent + '</div></div>';
-    }}
-  }}
-
-  return html || '<div class="bubble-text">' + text + '</div>';
-}}
-
-function addMessage(role, text) {{
-  const div = document.createElement('div');
-  div.className = 'bubble ' + role;
-  const avatar = role === 'user' ? '👤' : (role === 'bot' ? '🤖' : '⚠️');
-  
-  let contentHtml;
-  if (role === 'bot') {{
-    contentHtml = formatBotMessage(text);
-  }} else {{
-    contentHtml = '<div class="bubble-text">' + text + '</div>';
-  }}
-  
-  div.innerHTML = '<div class="avatar">' + avatar + '</div><div class="bubble-content">' + contentHtml + '</div>';
-  chatList.appendChild(div);
-  chatList.scrollTop = chatList.scrollHeight;
-}}
-
-function switchToChat() {{
-  if (!started) {{
-    welcomeScreen.classList.add('hidden');
-    chatScreen.classList.add('active');
-    started = true;
-  }}
-}}
-
-async function sendMessage(fromWelcome) {{
-  if (sending) return;
-  const input = fromWelcome ? welcomeInput : chatInput;
-  const message = (input.value || '').trim();
-  if (!message) return;
-  if (!CHAT_TOKEN) {{
-    setStatus(TOKEN_ERROR || '链接无效', 'error');
-    return;
-  }}
-
-  switchToChat();
-  addMessage('user', message);
-  chatHistory.push({{ role: 'user', content: message }});
-  input.value = '';
-
-  sending = true;
-  chatSendBtn.disabled = true;
-  welcomeSendBtn.disabled = true;
-  setStatus('发送中...');
-
-  try {{
-    const resp = await fetch('/openapi/v1/share/chat/' + encodeURIComponent(CHAT_TOKEN), {{
-      method: 'POST',
-      headers: {{ 'Content-Type': 'application/json' }},
-      body: JSON.stringify({{ message, history: chatHistory }})
-    }});
-    const data = await resp.json().catch(() => ({{}}));
-
-    if (!resp.ok) {{
-      const err = data?.detail || data?.message || '请求失败';
-      addMessage('error', '发送失败：' + err);
-      setStatus('发送失败', 'error');
-      return;
-    }}
-
-    const reply = data.reply || '(无回复)';
-    addMessage('bot', reply);
-    chatHistory.push({{ role: 'assistant', content: reply }});
-    setStatus('');
-  }} catch (e) {{
-    addMessage('error', '网络异常，请稍后重试');
-    setStatus('网络异常', 'error');
-  }} finally {{
-    sending = false;
-    chatSendBtn.disabled = false;
-    welcomeSendBtn.disabled = false;
-  }}
-}}
-
-welcomeSendBtn.addEventListener('click', () => sendMessage(true));
-chatSendBtn.addEventListener('click', () => sendMessage(false));
-welcomeInput.addEventListener('keydown', (e) => {{ if (e.key === 'Enter' && !e.shiftKey) {{ e.preventDefault(); sendMessage(true); }} }});
-chatInput.addEventListener('keydown', (e) => {{ if (e.key === 'Enter' && !e.shiftKey) {{ e.preventDefault(); sendMessage(false); }} }});
-
-if (!CHAT_TOKEN) {{
-  welcomeInput.disabled = true;
-  welcomeSendBtn.disabled = true;
-  setStatus(TOKEN_ERROR || '链接无效', 'error');
-}}
-</script>
-</body>
-</html>
-"""
-    response = HTMLResponse(
-        content=html,
-        headers={
-            "Cache-Control": "no-store, no-cache, must-revalidate, max-age=0",
-            "Pragma": "no-cache",
-            "Expires": "0",
-        },
-    )
-    if is_new_cookie:
-        response.set_cookie(
-            key=SHARE_DEVICE_COOKIE,
-            value=device_id,
-            httponly=True,
-            samesite="lax",
-            secure=False,
-            max_age=30 * 24 * 3600,
-        )
-    return response
--- a/ai-service/app/core/config.py
+++ b/ai-service/app/core/config.py
@ -65,8 +65,6 @@ class Settings(BaseSettings):
    dashboard_cache_ttl: int = 60
    stats_counter_ttl: int = 7776000

-    frontend_base_url: str = "http://localhost:3000"
-

@lru_cache
 def get_settings() -> Settings:
--- a/ai-service/app/core/middleware.py
+++ b/ai-service/app/core/middleware.py
@ -5,7 +5,6 @@ Middleware for AI Service.

 import logging
 import re
-import uuid
 from collections.abc import Callable

 from fastapi import Request, Response, status
@ -21,17 +20,6 @@ TENANT_ID_HEADER = "X-Tenant-Id"
 API_KEY_HEADER = "X-API-Key"
 ACCEPT_HEADER = "Accept"
 SSE_CONTENT_TYPE = "text/event-stream"
-REQUEST_ID_HEADER = "X-Request-Id"
-
-# Prompt template protected variable names injected by system/runtime.
-# These are reserved for internal orchestration and should not be overridden by user input.
-PROMPT_PROTECTED_VARIABLES = {
-    "available_tools",
-    "query",
-    "history",
-    "internal_protocol",
-    "output_contract",
-}

 TENANT_ID_PATTERN = re.compile(r'^[^@]+@ash@\d{4}$')

@ -41,15 +29,6 @@ PATHS_SKIP_API_KEY = {
    "/docs",
    "/redoc",
    "/openapi.json",
-    "/favicon.ico",
-    "/openapi/v1/share/chat",
-}
-
-PATHS_SKIP_TENANT = {
-    "/health",
-    "/ai/health",
-    "/favicon.ico",
-    "/openapi/v1/share/chat",
 }


@ -84,24 +63,17 @@ class ApiKeyMiddleware(BaseHTTPMiddleware):
        if self._should_skip_api_key(request.url.path):
            return await call_next(request)

-        request_id = request.headers.get(REQUEST_ID_HEADER) or str(uuid.uuid4())
-        request.state.request_id = request_id
-
        api_key = request.headers.get(API_KEY_HEADER)

        if not api_key or not api_key.strip():
-            logger.warning(
-                f"[AC-AISVC-50] Missing X-API-Key header for {request.url.path}, request_id={request_id}"
-            )
-            response = JSONResponse(
+            logger.warning(f"[AC-AISVC-50] Missing X-API-Key header for {request.url.path}")
+            return JSONResponse(
                status_code=status.HTTP_401_UNAUTHORIZED,
                content=ErrorResponse(
                    code=ErrorCode.UNAUTHORIZED.value,
                    message="Missing required header: X-API-Key",
                ).model_dump(exclude_none=True),
            )
-            response.headers[REQUEST_ID_HEADER] = request_id
-            return response

        api_key = api_key.strip()

@ -118,45 +90,17 @@ class ApiKeyMiddleware(BaseHTTPMiddleware):
            except Exception as e:
                logger.error(f"[AC-AISVC-50] Failed to initialize API key service: {e}")

-        client_ip = request.client.host if request.client else None
-        tenant_id = request.headers.get(TENANT_ID_HEADER, "")
-
-        validation = service.validate_key_with_context(api_key, client_ip=client_ip)
-        if not validation.ok:
-            if validation.reason == "rate_limited":
-                logger.warning(
-                    f"[AC-AISVC-50] Rate limited: path={request.url.path}, tenant={tenant_id}, "
-                    f"ip={client_ip}, qpm={validation.rate_limit_qpm}, request_id={request_id}"
-                )
-                response = JSONResponse(
-                    status_code=status.HTTP_429_TOO_MANY_REQUESTS,
-                    content=ErrorResponse(
-                        code=ErrorCode.SERVICE_UNAVAILABLE.value,
-                        message="Rate limit exceeded",
-                        details=[{"reason": "rate_limited", "limit_qpm": validation.rate_limit_qpm}],
-                    ).model_dump(exclude_none=True),
-                )
-                response.headers[REQUEST_ID_HEADER] = request_id
-                return response
-
-            logger.warning(
-                f"[AC-AISVC-50] API key validation failed: reason={validation.reason}, "
-                f"path={request.url.path}, tenant={tenant_id}, ip={client_ip}, request_id={request_id}"
-            )
-            response = JSONResponse(
+        if not service.validate_key(api_key):
+            logger.warning(f"[AC-AISVC-50] Invalid API key for {request.url.path}")
+            return JSONResponse(
                status_code=status.HTTP_401_UNAUTHORIZED,
                content=ErrorResponse(
                    code=ErrorCode.UNAUTHORIZED.value,
                    message="Invalid API key",
-                    details=[{"reason": validation.reason}],
                ).model_dump(exclude_none=True),
            )
-            response.headers[REQUEST_ID_HEADER] = request_id
-            return response

-        response = await call_next(request)
-        response.headers[REQUEST_ID_HEADER] = request_id
-        return response
+        return await call_next(request)

    def _should_skip_api_key(self, path: str) -> bool:
        """Check if the path should skip API key validation."""
@ -178,7 +122,7 @@ class TenantContextMiddleware(BaseHTTPMiddleware):
    async def dispatch(self, request: Request, call_next: Callable) -> Response:
        clear_tenant_context()

-        if self._should_skip_tenant(request.url.path):
+        if request.url.path in ("/health", "/ai/health"):
            return await call_next(request)

        tenant_id = request.headers.get(TENANT_ID_HEADER)
@ -229,15 +173,6 @@ class TenantContextMiddleware(BaseHTTPMiddleware):

        return response

-    def _should_skip_tenant(self, path: str) -> bool:
-        """Check if the path should skip tenant validation."""
-        if path in PATHS_SKIP_TENANT:
-            return True
-        for skip_path in PATHS_SKIP_TENANT:
-            if path.startswith(skip_path):
-                return True
-        return False
-
    async def _ensure_tenant_exists(self, request: Request, tenant_id: str) -> None:
        """
        [AC-AISVC-10] Ensure tenant exists in database, create if not.
--- a/ai-service/app/main.py
+++ b/ai-service/app/main.py
@ -9,11 +9,9 @@ from contextlib import asynccontextmanager
 from fastapi import FastAPI, Request, status
 from fastapi.exceptions import HTTPException, RequestValidationError
 from fastapi.middleware.cors import CORSMiddleware
-from fastapi.responses import JSONResponse, Response
+from fastapi.responses import JSONResponse

 from app.api import chat_router, health_router
-from app.api.mid import router as mid_router
-from app.api.openapi import router as openapi_router
 from app.api.admin import (
    api_key_router,
    dashboard_router,
@ -31,7 +29,6 @@ from app.api.admin import (
    rag_router,
    script_flows_router,
    sessions_router,
-    slot_definition_router,
    tenants_router,
 )
 from app.api.admin.kb_optimized import router as kb_optimized_router
@ -131,11 +128,6 @@ app.add_exception_handler(HTTPException, http_exception_handler)
 app.add_exception_handler(Exception, generic_exception_handler)


-@app.get("/favicon.ico", include_in_schema=False)
-async def favicon() -> Response:
-    return Response(status_code=204)
-
-
@app.exception_handler(RequestValidationError)
 async def validation_exception_handler(request: Request, exc: RequestValidationError):
    """
@ -173,12 +165,8 @@ app.include_router(prompt_templates_router)
 app.include_router(rag_router)
 app.include_router(script_flows_router)
 app.include_router(sessions_router)
-app.include_router(slot_definition_router)
 app.include_router(tenants_router)

-app.include_router(mid_router)
-app.include_router(openapi_router)
-

 if __name__ == "__main__":
    import uvicorn
--- a/ai-service/app/models/entities.py
+++ b/ai-service/app/models/entities.py
@ -110,33 +110,6 @@ class ChatMessageCreate(SQLModel):
    content: str


-class SharedSession(SQLModel, table=True):
-    """
-    [AC-IDMP-SHARE] Shared session entity for dialogue sharing.
-    Allows sharing chat sessions via unique token with expiration and concurrent user limits.
-    """
-
-    __tablename__ = "shared_sessions"
-    __table_args__ = (
-        Index("ix_shared_sessions_share_token", "share_token", unique=True),
-        Index("ix_shared_sessions_tenant_session", "tenant_id", "session_id"),
-        Index("ix_shared_sessions_expires_at", "expires_at"),
-    )
-
-    id: uuid.UUID = Field(default_factory=uuid.uuid4, primary_key=True)
-    share_token: str = Field(..., description="Unique share token (UUID)", index=True)
-    session_id: str = Field(..., description="Associated session ID", index=True)
-    tenant_id: str = Field(..., description="Tenant ID for multi-tenant isolation", index=True)
-    title: str | None = Field(default=None, description="Share title")
-    description: str | None = Field(default=None, description="Share description")
-    expires_at: datetime = Field(..., description="Expiration time")
-    is_active: bool = Field(default=True, description="Whether share is active")
-    max_concurrent_users: int = Field(default=10, description="Maximum concurrent users allowed")
-    current_users: int = Field(default=0, description="Current number of online users")
-    created_at: datetime = Field(default_factory=datetime.utcnow, description="Creation time")
-    updated_at: datetime = Field(default_factory=datetime.utcnow, description="Last update time")
-
-
 class DocumentStatus(str, Enum):
    PENDING = "pending"
    PROCESSING = "processing"
@ -294,13 +267,6 @@ class ApiKey(SQLModel, table=True):
    key: str = Field(..., description="API Key (unique)", unique=True, index=True)
    name: str = Field(..., description="Key name/description for identification")
    is_active: bool = Field(default=True, description="Whether the key is active")
-    expires_at: datetime | None = Field(default=None, description="Expiration time; null means never expires")
-    allowed_ips: list[str] | None = Field(
-        default=None,
-        sa_column=Column("allowed_ips", JSON, nullable=True),
-        description="Optional IP allowlist for this key",
-    )
-    rate_limit_qpm: int | None = Field(default=60, description="Per-minute quota for this key")
    created_at: datetime = Field(default_factory=datetime.utcnow, description="Creation time")
    updated_at: datetime = Field(default_factory=datetime.utcnow, description="Last update time")

@ -311,9 +277,6 @@ class ApiKeyCreate(SQLModel):
    key: str
    name: str
    is_active: bool = True
-    expires_at: datetime | None = None
-    allowed_ips: list[str] | None = None
-    rate_limit_qpm: int | None = 60


 class TemplateVersionStatus(str, Enum):
@ -993,17 +956,6 @@ class MetadataFieldType(str, Enum):
    ARRAY_ENUM = "array_enum"


-class FieldRole(str, Enum):
-    """
-    [AC-MRS-01] 字段角色枚举
-    用于标识元数据字段的职责分层
-    """
-    RESOURCE_FILTER = "resource_filter"
-    SLOT = "slot"
-    PROMPT_VAR = "prompt_var"
-    ROUTING_SIGNAL = "routing_signal"
-
-
 class MetadataFieldStatus(str, Enum):
    """[AC-IDSMETA-13] 元数据字段状态"""
    DRAFT = "draft"
@ -1040,7 +992,6 @@ class MetadataField(SQLModel):
 class MetadataFieldDefinition(SQLModel, table=True):
    """
    [AC-IDSMETA-13] 元数据字段定义表
-    [AC-MRS-01,02,03] 支持字段角色分层配置
    每个字段独立存储，支持字段级状态管理（draft/active/deprecated）
    """

@ -1078,11 +1029,6 @@ class MetadataFieldDefinition(SQLModel, table=True):
    )
    is_filterable: bool = Field(default=True, description="是否可用于过滤")
    is_rank_feature: bool = Field(default=False, description="是否用于排序特征")
-    field_roles: list[str] = Field(
-        default_factory=list,
-        sa_column=Column("field_roles", JSON, nullable=False, server_default="'[]'"),
-        description="[AC-MRS-01] 字段角色列表: resource_filter/slot/prompt_var/routing_signal"
-    )
    status: str = Field(
        default=MetadataFieldStatus.DRAFT.value,
        description="字段状态: draft/active/deprecated"
@ -1093,7 +1039,7 @@ class MetadataFieldDefinition(SQLModel, table=True):


 class MetadataFieldDefinitionCreate(SQLModel):
-    """[AC-IDSMETA-13] [AC-MRS-01] 创建元数据字段定义"""
+    """[AC-IDSMETA-13] 创建元数据字段定义"""

    field_key: str = Field(..., min_length=1, max_length=64)
    label: str = Field(..., min_length=1, max_length=64)
@ -1104,12 +1050,11 @@ class MetadataFieldDefinitionCreate(SQLModel):
    scope: list[str] = Field(default_factory=lambda: [MetadataScope.KB_DOCUMENT.value])
    is_filterable: bool = Field(default=True)
    is_rank_feature: bool = Field(default=False)
-    field_roles: list[str] = Field(default_factory=list)
    status: str = Field(default=MetadataFieldStatus.DRAFT.value)


 class MetadataFieldDefinitionUpdate(SQLModel):
-    """[AC-IDSMETA-14] [AC-MRS-01] 更新元数据字段定义"""
+    """[AC-IDSMETA-14] 更新元数据字段定义"""

    label: str | None = Field(default=None, min_length=1, max_length=64)
    required: bool | None = None
@ -1118,129 +1063,9 @@ class MetadataFieldDefinitionUpdate(SQLModel):
    scope: list[str] | None = None
    is_filterable: bool | None = None
    is_rank_feature: bool | None = None
-    field_roles: list[str] | None = None
    status: str | None = None


-class ExtractStrategy(str, Enum):
-    """
-    [AC-MRS-07] 槽位值提取策略
-    """
-    RULE = "rule"
-    LLM = "llm"
-    USER_INPUT = "user_input"
-
-
-class SlotValueSource(str, Enum):
-    """
-    [AC-MRS-09] 槽位值来源
-    """
-    USER_CONFIRMED = "user_confirmed"
-    RULE_EXTRACTED = "rule_extracted"
-    LLM_INFERRED = "llm_inferred"
-    DEFAULT = "default"
-
-
-class SlotDefinition(SQLModel, table=True):
-    """
-    [AC-MRS-07,08] 槽位定义表
-    独立的槽位定义模型，与元数据字段解耦但可复用
-    """
-
-    __tablename__ = "slot_definitions"
-    __table_args__ = (
-        Index("ix_slot_definitions_tenant", "tenant_id"),
-        Index("ix_slot_definitions_tenant_key", "tenant_id", "slot_key", unique=True),
-        Index("ix_slot_definitions_linked_field", "linked_field_id"),
-    )
-
-    id: uuid.UUID = Field(default_factory=uuid.uuid4, primary_key=True)
-    tenant_id: str = Field(..., description="Tenant ID for multi-tenant isolation", index=True)
-    slot_key: str = Field(
-        ...,
-        description="槽位键名，可与元数据字段 field_key 关联",
-        min_length=1,
-        max_length=100,
-    )
-    type: str = Field(
-        default=MetadataFieldType.STRING.value,
-        description="槽位类型: string/number/boolean/enum/array_enum"
-    )
-    required: bool = Field(default=False, description="是否必填槽位")
-    extract_strategy: str | None = Field(
-        default=None,
-        description="提取策略: rule/llm/user_input"
-    )
-    validation_rule: str | None = Field(
-        default=None,
-        description="校验规则（正则或 JSON Schema）"
-    )
-    ask_back_prompt: str | None = Field(
-        default=None,
-        description="追问提示语模板"
-    )
-    default_value: dict[str, Any] | None = Field(
-        default=None,
-        sa_column=Column("default_value", JSON, nullable=True),
-        description="默认值"
-    )
-    linked_field_id: uuid.UUID | None = Field(
-        default=None,
-        description="关联的元数据字段 ID",
-        foreign_key="metadata_field_definitions.id",
-    )
-    created_at: datetime = Field(default_factory=datetime.utcnow, description="创建时间")
-    updated_at: datetime = Field(default_factory=datetime.utcnow, description="更新时间")
-
-
-class SlotDefinitionCreate(SQLModel):
-    """[AC-MRS-07,08] 创建槽位定义"""
-
-    slot_key: str = Field(..., min_length=1, max_length=100)
-    type: str = Field(default=MetadataFieldType.STRING.value)
-    required: bool = Field(default=False)
-    extract_strategy: str | None = None
-    validation_rule: str | None = None
-    ask_back_prompt: str | None = None
-    default_value: dict[str, Any] | None = None
-    linked_field_id: str | None = None
-
-
-class SlotDefinitionUpdate(SQLModel):
-    """[AC-MRS-07] 更新槽位定义"""
-
-    type: str | None = None
-    required: bool | None = None
-    extract_strategy: str | None = None
-    validation_rule: str | None = None
-    ask_back_prompt: str | None = None
-    default_value: dict[str, Any] | None = None
-    linked_field_id: str | None = None
-
-
-class SlotValue(SQLModel):
-    """
-    [AC-MRS-09] 运行时槽位值
-    """
-
-    key: str = Field(..., description="槽位键名")
-    value: Any = Field(..., description="槽位值")
-    source: str = Field(
-        default=SlotValueSource.DEFAULT.value,
-        description="来源: user_confirmed/rule_extracted/llm_inferred/default"
-    )
-    confidence: float = Field(
-        default=1.0,
-        ge=0.0,
-        le=1.0,
-        description="置信度 0.0~1.0"
-    )
-    updated_at: datetime = Field(
-        default_factory=datetime.utcnow,
-        description="最后更新时间"
-    )
-
-
 class MetadataSchema(SQLModel, table=True):
    """
    元数据模式定义（保留兼容性）
@ -1388,137 +1213,3 @@ class DecompositionResult(SQLModel):
    confidence: float | None = Field(default=None, description="拆解置信度")
    error: str | None = Field(default=None, description="错误信息")
    latency_ms: int | None = Field(default=None, description="处理耗时（毫秒）")
-
-
-class HighRiskScenarioType(str, Enum):
-    """[AC-IDMP-20] 高风险场景类型"""
-    REFUND = "refund"
-    COMPLAINT_ESCALATION = "complaint_escalation"
-    PRIVACY_SENSITIVE_PROMISE = "privacy_sensitive_promise"
-    TRANSFER = "transfer"
-
-
-class HighRiskPolicy(SQLModel, table=True):
-    """
-    [AC-IDMP-20] 高风险场景策略配置
-    定义高风险场景的最小集，支持动态配置
-    """
-
-    __tablename__ = "high_risk_policies"
-    __table_args__ = (
-        Index("ix_high_risk_policies_tenant", "tenant_id"),
-        Index("ix_high_risk_policies_tenant_enabled", "tenant_id", "is_enabled"),
-    )
-
-    id: uuid.UUID = Field(default_factory=uuid.uuid4, primary_key=True)
-    tenant_id: str = Field(..., description="Tenant ID for multi-tenant isolation", index=True)
-    scenario: str = Field(..., description="场景类型: refund/complaint_escalation/privacy_sensitive_promise/transfer")
-    handler_mode: str = Field(
-        default="micro_flow",
-        description="处理模式: micro_flow/transfer"
-    )
-    flow_id: uuid.UUID | None = Field(
-        default=None,
-        description="微流程ID (handler_mode=micro_flow时使用)",
-        foreign_key="script_flows.id"
-    )
-    transfer_message: str | None = Field(
-        default=None,
-        description="转人工消息 (handler_mode=transfer时使用)"
-    )
-    keywords: list[str] | None = Field(
-        default=None,
-        sa_column=Column("keywords", JSON, nullable=True),
-        description="触发关键词列表"
-    )
-    patterns: list[str] | None = Field(
-        default=None,
-        sa_column=Column("patterns", JSON, nullable=True),
-        description="触发正则模式列表"
-    )
-    priority: int = Field(default=0, description="优先级 (值越高优先级越高)")
-    is_enabled: bool = Field(default=True, description="是否启用")
-    created_at: datetime = Field(default_factory=datetime.utcnow, description="创建时间")
-    updated_at: datetime = Field(default_factory=datetime.utcnow, description="更新时间")
-
-
-class HighRiskPolicyCreate(SQLModel):
-    """[AC-IDMP-20] 创建高风险策略"""
-
-    scenario: str
-    handler_mode: str = "micro_flow"
-    flow_id: str | None = None
-    transfer_message: str | None = None
-    keywords: list[str] | None = None
-    patterns: list[str] | None = None
-    priority: int = 0
-
-
-class HighRiskPolicyUpdate(SQLModel):
-    """[AC-IDMP-20] 更新高风险策略"""
-
-    handler_mode: str | None = None
-    flow_id: str | None = None
-    transfer_message: str | None = None
-    keywords: list[str] | None = None
-    patterns: list[str] | None = None
-    priority: int | None = None
-    is_enabled: bool | None = None
-
-
-class SessionModeRecord(SQLModel, table=True):
-    """
-    [AC-IDMP-09] 会话模式记录
-    记录会话的当前模式状态
-    """
-
-    __tablename__ = "session_mode_records"
-    __table_args__ = (
-        Index("ix_session_mode_records_tenant_session", "tenant_id", "session_id", unique=True),
-    )
-
-    id: uuid.UUID = Field(default_factory=uuid.uuid4, primary_key=True)
-    tenant_id: str = Field(..., description="Tenant ID for multi-tenant isolation", index=True)
-    session_id: str = Field(..., description="会话ID", index=True)
-    mode: str = Field(
-        default="BOT_ACTIVE",
-        description="会话模式: BOT_ACTIVE/HUMAN_ACTIVE"
-    )
-    reason: str | None = Field(default=None, description="模式切换原因")
-    switched_at: datetime | None = Field(default=None, description="模式切换时间")
-    created_at: datetime = Field(default_factory=datetime.utcnow, description="创建时间")
-    updated_at: datetime = Field(default_factory=datetime.utcnow, description="更新时间")
-
-
-class MidAuditLog(SQLModel, table=True):
-    """
-    [AC-IDMP-07] 中台审计日志
-    记录 generation/request 维度的审计字段
-    """
-
-    __tablename__ = "mid_audit_logs"
-    __table_args__ = (
-        Index("ix_mid_audit_logs_tenant_session", "tenant_id", "session_id"),
-        Index("ix_mid_audit_logs_tenant_request", "tenant_id", "request_id"),
-        Index("ix_mid_audit_logs_tenant_generation", "tenant_id", "generation_id"),
-        Index("ix_mid_audit_logs_created", "created_at"),
-    )
-
-    id: uuid.UUID = Field(default_factory=uuid.uuid4, primary_key=True)
-    tenant_id: str = Field(..., description="Tenant ID for multi-tenant isolation", index=True)
-    session_id: str = Field(..., description="会话ID", index=True)
-    request_id: str = Field(..., description="请求ID", index=True)
-    generation_id: str = Field(..., description="生成ID", index=True)
-    mode: str = Field(..., description="执行模式: agent/micro_flow/fixed/transfer")
-    intent: str | None = Field(default=None, description="意图")
-    tool_calls: list[dict[str, Any]] | None = Field(
-        default=None,
-        sa_column=Column("tool_calls", JSON, nullable=True),
-        description="工具调用记录"
-    )
-    guardrail_triggered: bool = Field(default=False, description="护栏是否触发")
-    fallback_reason_code: str | None = Field(default=None, description="降级原因码")
-    react_iterations: int | None = Field(default=None, description="ReAct循环次数")
-    high_risk_scenario: str | None = Field(default=None, description="触发的高风险场景")
-    latency_ms: int | None = Field(default=None, description="总耗时(ms)")
-    created_at: datetime = Field(default_factory=datetime.utcnow, description="创建时间", index=True)
--- a/ai-service/app/models/mid/init.py
+++ b/ai-service/app/models/mid/init.py
@ -1,224 +0,0 @@
-"""
-Mid Platform models for Intent-Driven Agent.
-[AC-IDMP-01~20] 中台统一响应协议模型
-"""
-
-import uuid
-from datetime import datetime
-from enum import Enum
-from typing import Any
-
-from pydantic import BaseModel, Field
-
-
-class Mode(str, Enum):
-    """[AC-IDMP-02] 执行模式"""
-    AGENT = "agent"
-    MICRO_FLOW = "micro_flow"
-    FIXED = "fixed"
-    TRANSFER = "transfer"
-
-
-class SessionMode(str, Enum):
-    """[AC-IDMP-09] 会话模式"""
-    BOT_ACTIVE = "BOT_ACTIVE"
-    HUMAN_ACTIVE = "HUMAN_ACTIVE"
-
-
-class HighRiskScenario(str, Enum):
-    """[AC-IDMP-20] 高风险场景最小集"""
-    REFUND = "refund"
-    COMPLAINT_ESCALATION = "complaint_escalation"
-    PRIVACY_SENSITIVE_PROMISE = "privacy_sensitive_promise"
-    TRANSFER = "transfer"
-
-
-class ToolCallStatus(str, Enum):
-    """[AC-IDMP-15] 工具调用状态"""
-    OK = "ok"
-    TIMEOUT = "timeout"
-    ERROR = "error"
-    REJECTED = "rejected"
-
-
-class ToolType(str, Enum):
-    """[AC-IDMP-19] 工具类型"""
-    INTERNAL = "internal"
-    MCP = "mcp"
-
-
-class HistoryMessage(BaseModel):
-    """[AC-IDMP-03] 已送达历史消息"""
-    role: str = Field(..., description="消息角色: user/assistant/human")
-    content: str = Field(..., description="消息内容")
-
-
-class InterruptedSegment(BaseModel):
-    """[AC-IDMP-04] 打断的分段"""
-    segment_id: str = Field(..., description="分段ID")
-    content: str = Field(..., description="分段内容")
-
-
-class FeatureFlags(BaseModel):
-    """[AC-IDMP-17] 特性开关"""
-    agent_enabled: bool | None = Field(default=None, description="会话级 Agent 灰度开关")
-    rollback_to_legacy: bool | None = Field(default=None, description="强制回滚传统链路")
-
-
-class DialogueRequest(BaseModel):
-    """[AC-IDMP-01~04] 会话响应请求"""
-    session_id: str = Field(..., description="会话ID")
-    user_id: str | None = Field(default=None, description="用户ID，用于记忆召回与更新")
-    user_message: str = Field(..., min_length=1, max_length=2000, description="用户消息")
-    history: list[HistoryMessage] = Field(default_factory=list, description="已送达历史")
-    interrupted_segments: list[InterruptedSegment] | None = Field(default=None, description="打断的分段")
-    feature_flags: FeatureFlags | None = Field(default=None, description="特性开关")
-
-
-class Segment(BaseModel):
-    """[AC-IDMP-01] 响应分段"""
-    segment_id: str = Field(..., description="分段ID")
-    text: str = Field(..., description="分段文本")
-    delay_after: int = Field(default=0, ge=0, description="分段后延迟(ms)")
-
-
-class TimeoutProfile(BaseModel):
-    """[AC-IDMP-12] 超时配置"""
-    per_tool_timeout_ms: int | None = Field(default=30000, le=60000, description="单工具超时(ms)")
-    end_to_end_timeout_ms: int | None = Field(default=120000, le=180000, description="端到端超时(ms)")
-
-
-class MetricsSnapshot(BaseModel):
-    """[AC-IDMP-18] 运行指标快照"""
-    task_completion_rate: float | None = Field(default=None, ge=0, le=1, description="任务达成率")
-    slot_completion_rate: float | None = Field(default=None, ge=0, le=1, description="槽位完整率")
-    wrong_transfer_rate: float | None = Field(default=None, ge=0, le=1, description="误转人工率")
-    no_recall_rate: float | None = Field(default=None, ge=0, le=1, description="无召回率")
-    avg_latency_ms: float | None = Field(default=None, ge=0, description="平均时延(ms)")
-
-
-class ToolCallTraceModel(BaseModel):
-    """[AC-IDMP-15/19] 工具调用追踪 (Pydantic 模型)"""
-    tool_name: str = Field(..., description="工具名称")
-    tool_type: ToolType | None = Field(default=None, description="工具类型: internal/mcp")
-    registry_version: str | None = Field(default=None, description="注册版本")
-    auth_applied: bool | None = Field(default=None, description="是否应用鉴权")
-    duration_ms: int = Field(..., ge=0, description="耗时(ms)")
-    status: ToolCallStatus = Field(..., description="状态: ok/timeout/error/rejected")
-    error_code: str | None = Field(default=None, description="错误码")
-    args_digest: str | None = Field(default=None, description="参数摘要")
-    result_digest: str | None = Field(default=None, description="结果摘要")
-
-
-class TraceInfo(BaseModel):
-    """[AC-IDMP-02/07] 追踪信息"""
-    mode: Mode = Field(..., description="执行模式")
-    intent: str | None = Field(default=None, description="意图")
-    request_id: str | None = Field(default=None, description="请求ID")
-    generation_id: str | None = Field(default=None, description="生成ID")
-    guardrail_triggered: bool | None = Field(default=False, description="护栏是否触发")
-    fallback_reason_code: str | None = Field(default=None, description="降级原因码")
-    react_iterations: int | None = Field(default=None, ge=0, le=5, description="ReAct循环次数")
-    timeout_profile: TimeoutProfile | None = Field(default=None, description="超时配置")
-    metrics_snapshot: MetricsSnapshot | None = Field(default=None, description="指标快照")
-    high_risk_policy_set: list[HighRiskScenario] | None = Field(
-        default=None, 
-        description="当前启用的高风险最小场景集"
-    )
-    tools_used: list[str] | None = Field(default=None, description="使用的工具列表")
-    tool_calls: list[ToolCallTraceModel] | None = Field(default=None, description="工具调用追踪")
-
-
-class DialogueResponse(BaseModel):
-    """[AC-IDMP-01/02] 会话响应"""
-    segments: list[Segment] = Field(..., description="响应分段列表")
-    trace: TraceInfo = Field(..., description="追踪信息")
-
-
-class ReportedMessage(BaseModel):
-    """[AC-IDMP-08] 上报的消息"""
-    role: str = Field(..., description="角色: user/assistant/human/system")
-    content: str = Field(..., description="消息内容")
-    source: str = Field(..., description="来源: bot/human/channel")
-    timestamp: datetime = Field(..., description="时间戳")
-    segment_id: str | None = Field(default=None, description="分段ID")
-
-
-class MessageReportRequest(BaseModel):
-    """[AC-IDMP-08] 消息上报请求"""
-    session_id: str = Field(..., description="会话ID")
-    messages: list[ReportedMessage] = Field(..., description="消息列表")
-
-
-class SwitchModeRequest(BaseModel):
-    """[AC-IDMP-09] 切换模式请求"""
-    mode: SessionMode = Field(..., description="目标模式")
-    reason: str | None = Field(default=None, description="切换原因")
-
-
-class SwitchModeResponse(BaseModel):
-    """[AC-IDMP-09] 切换模式响应"""
-    session_id: str = Field(..., description="会话ID")
-    mode: SessionMode = Field(..., description="当前模式")
-
-
-from app.models.mid.memory import (
-    RecallRequest,
-    RecallResponse,
-    UpdateRequest,
-    MemoryProfile,
-    MemoryFact,
-    MemoryPreferences,
-)
-from app.models.mid.tool_trace import (
-    ToolCallTrace,
-    ToolCallBuilder,
-    ToolCallStatus as ToolCallStatusEnum,
-    ToolType as ToolTypeEnum,
-)
-from app.models.mid.tool_registry import (
-    ToolDefinition,
-    ToolAuthConfig,
-    ToolTimeoutPolicy,
-    ToolRegistryEntity,
-    ToolRegistryCreate,
-    ToolRegistryUpdate,
-)
-
-__all__ = [
-    "Mode",
-    "SessionMode",
-    "HighRiskScenario",
-    "ToolCallStatus",
-    "ToolType",
-    "HistoryMessage",
-    "InterruptedSegment",
-    "FeatureFlags",
-    "DialogueRequest",
-    "Segment",
-    "TimeoutProfile",
-    "MetricsSnapshot",
-    "ToolCallTraceModel",
-    "TraceInfo",
-    "DialogueResponse",
-    "ReportedMessage",
-    "MessageReportRequest",
-    "SwitchModeRequest",
-    "SwitchModeResponse",
-    "RecallRequest",
-    "RecallResponse",
-    "UpdateRequest",
-    "MemoryProfile",
-    "MemoryFact",
-    "MemoryPreferences",
-    "ToolCallTrace",
-    "ToolCallBuilder",
-    "ToolCallStatusEnum",
-    "ToolTypeEnum",
-    "ToolDefinition",
-    "ToolAuthConfig",
-    "ToolTimeoutPolicy",
-    "ToolRegistryEntity",
-    "ToolRegistryCreate",
-    "ToolRegistryUpdate",
-]
--- a/ai-service/app/models/mid/memory.py
+++ b/ai-service/app/models/mid/memory.py
@ -1,182 +0,0 @@
-"""
-Memory models for Mid Platform.
-[AC-IDMP-13] 记忆召回数据模型
-[AC-IDMP-14] 记忆更新数据模型
-
-Reference: spec/intent-driven-mid-platform/openapi.deps.yaml
-"""
-
-from dataclasses import dataclass, field
-from datetime import datetime
-from typing import Any
-from uuid import UUID
-
-
-@dataclass
-class MemoryProfile:
-    """
-    [AC-IDMP-13] 用户基础属性记忆
-    包含年级、地区、渠道等基础信息
-    """
-    grade: str | None = None
-    region: str | None = None
-    channel: str | None = None
-    vip_level: str | None = None
-    registration_date: datetime | None = None
-    extra: dict[str, Any] = field(default_factory=dict)
-
-    def to_dict(self) -> dict[str, Any]:
-        result = {}
-        if self.grade:
-            result["grade"] = self.grade
-        if self.region:
-            result["region"] = self.region
-        if self.channel:
-            result["channel"] = self.channel
-        if self.vip_level:
-            result["vip_level"] = self.vip_level
-        if self.registration_date:
-            result["registration_date"] = self.registration_date.isoformat()
-        if self.extra:
-            result.update(self.extra)
-        return result
-
-
-@dataclass
-class MemoryFact:
-    """
-    [AC-IDMP-13] 事实型记忆
-    包含已购课程、学习结论等客观事实
-    """
-    content: str
-    source: str | None = None
-    confidence: float | None = None
-    created_at: datetime | None = None
-    expires_at: datetime | None = None
-
-    def to_string(self) -> str:
-        return self.content
-
-
-@dataclass
-class MemoryPreferences:
-    """
-    [AC-IDMP-13] 偏好记忆
-    包含语气偏好、关注科目等用户偏好
-    """
-    tone: str | None = None
-    focus_subjects: list[str] = field(default_factory=list)
-    communication_style: str | None = None
-    preferred_time: str | None = None
-    extra: dict[str, Any] = field(default_factory=dict)
-
-    def to_dict(self) -> dict[str, Any]:
-        result = {}
-        if self.tone:
-            result["tone"] = self.tone
-        if self.focus_subjects:
-            result["focus_subjects"] = self.focus_subjects
-        if self.communication_style:
-            result["communication_style"] = self.communication_style
-        if self.preferred_time:
-            result["preferred_time"] = self.preferred_time
-        if self.extra:
-            result.update(self.extra)
-        return result
-
-
-@dataclass
-class RecallRequest:
-    """
-    [AC-IDMP-13] 记忆召回请求
-    Reference: openapi.deps.yaml - RecallRequest
-    """
-    user_id: str
-    session_id: str
-
-    def to_dict(self) -> dict[str, Any]:
-        return {
-            "user_id": self.user_id,
-            "session_id": self.session_id,
-        }
-
-
-@dataclass
-class RecallResponse:
-    """
-    [AC-IDMP-13] 记忆召回响应
-    Reference: openapi.deps.yaml - RecallResponse
-    """
-    profile: MemoryProfile | None = None
-    facts: list[MemoryFact] = field(default_factory=list)
-    preferences: MemoryPreferences | None = None
-    last_summary: str | None = None
-
-    def to_dict(self) -> dict[str, Any]:
-        result = {}
-        if self.profile:
-            result["profile"] = self.profile.to_dict()
-        if self.facts:
-            result["facts"] = [f.to_string() for f in self.facts]
-        if self.preferences:
-            result["preferences"] = self.preferences.to_dict()
-        if self.last_summary:
-            result["last_summary"] = self.last_summary
-        return result
-
-    def get_context_for_prompt(self) -> str:
-        """
-        生成用于注入 Prompt 的上下文字符串
-        """
-        parts = []
-        
-        if self.profile:
-            profile_parts = []
-            if self.profile.grade:
-                profile_parts.append(f"年级: {self.profile.grade}")
-            if self.profile.region:
-                profile_parts.append(f"地区: {self.profile.region}")
-            if self.profile.vip_level:
-                profile_parts.append(f"会员等级: {self.profile.vip_level}")
-            if profile_parts:
-                parts.append("【用户属性】" + "、".join(profile_parts))
-        
-        if self.facts:
-            fact_strs = [f.content for f in self.facts[:5]]
-            parts.append("【已知事实】" + "；".join(fact_strs))
-        
-        if self.preferences:
-            pref_parts = []
-            if self.preferences.tone:
-                pref_parts.append(f"语气偏好: {self.preferences.tone}")
-            if self.preferences.focus_subjects:
-                pref_parts.append(f"关注科目: {', '.join(self.preferences.focus_subjects)}")
-            if pref_parts:
-                parts.append("【用户偏好】" + "、".join(pref_parts))
-        
-        if self.last_summary:
-            parts.append(f"【上次会话摘要】{self.last_summary}")
-        
-        return "\n".join(parts) if parts else ""
-
-
-@dataclass
-class UpdateRequest:
-    """
-    [AC-IDMP-14] 记忆更新请求
-    Reference: openapi.deps.yaml - UpdateRequest
-    """
-    user_id: str
-    session_id: str
-    messages: list[dict[str, Any]]
-    summary: str | None = None
-
-    def to_dict(self) -> dict[str, Any]:
-        result = {
-            "user_id": self.user_id,
-            "session_id": self.session_id,
-            "messages": self.messages,
-        }
-        if self.summary:
-            result["summary"] = self.summary
-        return result
--- a/ai-service/app/models/mid/schemas.py
+++ b/ai-service/app/models/mid/schemas.py
@ -1,407 +0,0 @@
-"""
-Mid Platform schemas.
-[AC-IDMP-01, AC-IDMP-02, AC-IDMP-07, AC-IDMP-11, AC-IDMP-12, AC-IDMP-15, AC-IDMP-17, AC-IDMP-18, AC-IDMP-19, AC-IDMP-20]
-Aligned with spec/intent-driven-mid-platform/openapi.provider.yaml
-"""
-
-from __future__ import annotations
-
-import uuid
-from enum import Enum
-from typing import Any
-
-from pydantic import BaseModel, Field
-
-
-class ExecutionMode(str, Enum):
-    """[AC-IDMP-02] Execution mode for dialogue response."""
-    AGENT = "agent"
-    MICRO_FLOW = "micro_flow"
-    FIXED = "fixed"
-    TRANSFER = "transfer"
-
-
-class HighRiskScenario(str, Enum):
-    """[AC-IDMP-20] High risk scenario types for mandatory takeover."""
-    REFUND = "refund"
-    COMPLAINT_ESCALATION = "complaint_escalation"
-    PRIVACY_SENSITIVE_PROMISE = "privacy_sensitive_promise"
-    TRANSFER = "transfer"
-
-
-class ToolCallStatus(str, Enum):
-    """[AC-IDMP-15] Tool call status."""
-    OK = "ok"
-    TIMEOUT = "timeout"
-    ERROR = "error"
-    REJECTED = "rejected"
-
-
-class ToolType(str, Enum):
-    """[AC-IDMP-19] Tool type for registry governance."""
-    INTERNAL = "internal"
-    MCP = "mcp"
-
-
-class SessionMode(str, Enum):
-    """[AC-IDMP-09] Session mode for bot/human switching."""
-    BOT_ACTIVE = "BOT_ACTIVE"
-    HUMAN_ACTIVE = "HUMAN_ACTIVE"
-
-
-class HistoryMessage(BaseModel):
-    """[AC-IDMP-03] History message with only delivered content."""
-    role: str = Field(..., description="Message role: user, assistant, or human")
-    content: str = Field(..., description="Message content")
-
-
-class InterruptedSegment(BaseModel):
-    """[AC-IDMP-04] Interrupted segment for handling user interruption."""
-    segment_id: str = Field(..., description="Segment ID")
-    content: str = Field(..., description="Segment content")
-
-
-class FeatureFlags(BaseModel):
-    """[AC-IDMP-17] Feature flags for session-level grayscale and rollback."""
-    agent_enabled: bool | None = Field(default=True, description="Session-level Agent grayscale switch")
-    rollback_to_legacy: bool | None = Field(default=False, description="Force rollback to legacy pipeline")
-
-
-class HumanizeConfigRequest(BaseModel):
-    """[AC-MARH-11] 拟人化配置请求。"""
-    enabled: bool | None = Field(default=True, description="Enable humanize strategy")
-    min_delay_ms: int | None = Field(default=50, ge=0, description="Minimum delay in milliseconds")
-    max_delay_ms: int | None = Field(default=500, ge=0, description="Maximum delay in milliseconds")
-    length_bucket_strategy: str | None = Field(default="simple", description="Strategy: simple or semantic")
-
-
-class DialogueRequest(BaseModel):
-    """[AC-IDMP-01, AC-IDMP-03, AC-IDMP-04, AC-IDMP-17, AC-MARH-11] Dialogue request schema."""
-    session_id: str = Field(..., description="Session ID for conversation tracking")
-    user_id: str | None = Field(default=None, description="User ID for memory recall and update")
-    user_message: str = Field(..., min_length=1, max_length=2000, description="User message content")
-    history: list[HistoryMessage] = Field(default_factory=list, description="Only delivered history")
-    interrupted_segments: list[InterruptedSegment] | None = Field(default=None, description="Interrupted segments")
-    feature_flags: FeatureFlags | None = Field(default=None, description="Feature flags for grayscale control")
-    humanize_config: HumanizeConfigRequest | None = Field(
-        default=None, description="Humanize config for segment delay"
-    )
-
-
-class Segment(BaseModel):
-    """[AC-IDMP-01] Response segment with delay control."""
-    segment_id: str = Field(default_factory=lambda: str(uuid.uuid4()), description="Segment ID")
-    text: str = Field(..., description="Segment text content")
-    delay_after: int = Field(default=0, ge=0, description="Delay after this segment in milliseconds")
-
-
-class TimeoutProfile(BaseModel):
-    """[AC-MARH-08, AC-MARH-09] Timeout configuration profile."""
-    per_tool_timeout_ms: int = Field(default=30000, le=60000, description="Per-tool timeout in milliseconds")
-    llm_timeout_ms: int = Field(default=60000, le=120000, description="LLM call timeout in milliseconds")
-    end_to_end_timeout_ms: int = Field(default=120000, le=180000, description="End-to-end timeout in milliseconds")
-
-
-class MetricsSnapshot(BaseModel):
-    """[AC-IDMP-18] Runtime metrics snapshot."""
-    task_completion_rate: float | None = Field(default=None, ge=0.0, le=1.0, description="Task completion rate")
-    slot_completion_rate: float | None = Field(default=None, ge=0.0, le=1.0, description="Slot completion rate")
-    wrong_transfer_rate: float | None = Field(default=None, ge=0.0, le=1.0, description="Wrong transfer rate")
-    no_recall_rate: float | None = Field(default=None, ge=0.0, le=1.0, description="No recall rate")
-    avg_latency_ms: float | None = Field(default=None, ge=0.0, description="Average latency in milliseconds")
-
-
-class ToolCallTrace(BaseModel):
-    """[AC-IDMP-15, AC-IDMP-19] Tool call trace for observability."""
-    tool_name: str = Field(..., description="Tool name")
-    tool_type: ToolType | None = Field(default=ToolType.INTERNAL, description="Tool type: internal or mcp")
-    registry_version: str | None = Field(default=None, description="Tool registry version")
-    auth_applied: bool | None = Field(default=False, description="Whether auth was applied")
-    duration_ms: int = Field(..., ge=0, description="Duration in milliseconds")
-    status: ToolCallStatus = Field(..., description="Tool call status")
-    error_code: str | None = Field(default=None, description="Error code if failed")
-    args_digest: str | None = Field(default=None, description="Arguments digest for logging")
-    result_digest: str | None = Field(default=None, description="Result digest for logging")
-
-
-class SegmentStats(BaseModel):
-    """[AC-MARH-12] Segment statistics for humanize strategy."""
-    segment_count: int = Field(default=0, ge=0, description="Number of segments")
-    avg_segment_length: float = Field(default=0.0, ge=0.0, description="Average segment length")
-    humanize_strategy: str | None = Field(default=None, description="Humanize strategy used")
-
-
-class TraceInfo(BaseModel):
-    """[AC-MARH-02, AC-MARH-03, AC-MARH-05, AC-MARH-06, AC-MARH-07, AC-MARH-11,
-    AC-MARH-12, AC-MARH-18, AC-MARH-19, AC-MARH-20] Trace info for observability."""
-    mode: ExecutionMode = Field(..., description="Execution mode")
-    intent: str | None = Field(default=None, description="Matched intent")
-    request_id: str | None = Field(
-        default_factory=lambda: str(uuid.uuid4()), description="Request ID"
-    )
-    generation_id: str | None = Field(
-        default_factory=lambda: str(uuid.uuid4()),
-        description="Generation ID for interrupt handling",
-    )
-    guardrail_triggered: bool | None = Field(default=False, description="Whether guardrail was triggered")
-    guardrail_rule_id: str | None = Field(default=None, description="Guardrail rule ID that triggered")
-    interrupt_consumed: bool | None = Field(default=False, description="Whether interrupted segments were consumed")
-    kb_tool_called: bool | None = Field(default=False, description="Whether KB tool was called")
-    kb_hit: bool | None = Field(default=False, description="Whether KB search had results")
-    fallback_reason_code: str | None = Field(default=None, description="Fallback reason code")
-    react_iterations: int | None = Field(default=0, ge=0, le=5, description="ReAct loop iterations")
-    timeout_profile: TimeoutProfile | None = Field(default=None, description="Timeout profile")
-    segment_stats: SegmentStats | None = Field(default=None, description="Segment statistics")
-    metrics_snapshot: MetricsSnapshot | None = Field(default=None, description="Metrics snapshot")
-    high_risk_policy_set: list[HighRiskScenario] | None = Field(default=None, description="Active high-risk policy set")
-    tools_used: list[str] | None = Field(default=None, description="Tools used in this request")
-    tool_calls: list[ToolCallTrace] | None = Field(default=None, description="Tool call traces")
-
-
-class DialogueResponse(BaseModel):
-    """[AC-IDMP-01, AC-IDMP-02] Dialogue response with segments and trace."""
-    segments: list[Segment] = Field(..., description="Response segments")
-    trace: TraceInfo = Field(..., description="Trace info for observability")
-
-
-class ReportedMessage(BaseModel):
-    """[AC-IDMP-08] Reported message for message report API."""
-    role: str = Field(..., description="Message role: user, assistant, human, or system")
-    content: str = Field(..., description="Message content")
-    source: str = Field(..., description="Message source: bot, human, or channel")
-    timestamp: str = Field(..., description="Message timestamp in ISO format")
-    segment_id: str | None = Field(default=None, description="Segment ID if applicable")
-
-
-class MessageReportRequest(BaseModel):
-    """[AC-IDMP-08] Message report request schema."""
-    session_id: str = Field(..., description="Session ID")
-    messages: list[ReportedMessage] = Field(..., description="Messages to report")
-
-
-class SwitchModeRequest(BaseModel):
-    """[AC-IDMP-09] Switch session mode request."""
-    mode: SessionMode = Field(..., description="Target mode: BOT_ACTIVE or HUMAN_ACTIVE")
-    reason: str | None = Field(default=None, description="Reason for mode switch")
-
-
-class SwitchModeResponse(BaseModel):
-    """[AC-IDMP-09] Switch session mode response."""
-    session_id: str = Field(..., description="Session ID")
-    mode: SessionMode = Field(..., description="Current mode after switch")
-
-
-class MidSessionState(BaseModel):
-    """Internal session state for mid platform."""
-    session_id: str
-    tenant_id: str
-    mode: SessionMode = SessionMode.BOT_ACTIVE
-    generation_id: str = Field(default_factory=lambda: str(uuid.uuid4()))
-    active_flow_id: str | None = None
-    context: dict[str, Any] | None = None
-    created_at: str | None = None
-    updated_at: str | None = None
-
-
-class PolicyRouterResult(BaseModel):
-    """[AC-IDMP-02, AC-IDMP-05, AC-IDMP-16] Policy router decision result."""
-    mode: ExecutionMode = Field(..., description="Decided execution mode")
-    intent: str | None = Field(default=None, description="Matched intent")
-    confidence: float | None = Field(default=None, ge=0.0, le=1.0, description="Intent confidence")
-    fallback_reason_code: str | None = Field(default=None, description="Fallback reason if applicable")
-    high_risk_triggered: bool = Field(default=False, description="Whether high-risk scenario triggered")
-    target_flow_id: str | None = Field(default=None, description="Target flow ID for micro_flow mode")
-    fixed_reply: str | None = Field(default=None, description="Fixed reply for fixed mode")
-    transfer_message: str | None = Field(default=None, description="Transfer message for transfer mode")
-
-
-class ReActContext(BaseModel):
-    """[AC-IDMP-11] ReAct loop context for iteration control."""
-    iteration: int = Field(default=0, ge=0, le=5, description="Current iteration count")
-    max_iterations: int = Field(default=5, ge=3, le=5, description="Maximum iterations allowed")
-    tool_calls: list[ToolCallTrace] = Field(default_factory=list, description="Tool call history")
-    should_continue: bool = Field(default=True, description="Whether to continue ReAct loop")
-    final_answer: str | None = Field(default=None, description="Final answer if completed")
-
-
-class CreateShareRequest(BaseModel):
-    """[AC-IDMP-SHARE] Request to create a shared session."""
-    title: str | None = Field(default=None, max_length=255, description="Share title")
-    description: str | None = Field(default=None, max_length=1000, description="Share description")
-    expires_in_days: int = Field(default=7, ge=1, le=365, description="Expiration time in days")
-    max_concurrent_users: int = Field(default=10, ge=1, le=100, description="Maximum concurrent users")
-
-
-class ShareResponse(BaseModel):
-    """[AC-IDMP-SHARE] Response after creating a share."""
-    share_token: str = Field(..., description="Unique share token")
-    share_url: str = Field(..., description="Full share URL")
-    expires_at: str = Field(..., description="Expiration time in ISO format")
-    title: str | None = Field(default=None, description="Share title")
-    description: str | None = Field(default=None, description="Share description")
-    max_concurrent_users: int = Field(..., description="Maximum concurrent users")
-
-
-class SharedSessionInfo(BaseModel):
-    """[AC-IDMP-SHARE] Information about a shared session."""
-    session_id: str = Field(..., description="Session ID")
-    title: str | None = Field(default=None, description="Share title")
-    description: str | None = Field(default=None, description="Share description")
-    expires_at: str = Field(..., description="Expiration time in ISO format")
-    max_concurrent_users: int = Field(..., description="Maximum concurrent users")
-    current_users: int = Field(..., description="Current online users")
-    history: list[HistoryMessage] = Field(default_factory=list, description="Historical messages")
-
-
-class SharedMessageRequest(BaseModel):
-    """[AC-IDMP-SHARE] Request to send a message via shared session."""
-    user_message: str = Field(..., min_length=1, max_length=2000, description="User message content")
-
-
-class ShareListItem(BaseModel):
-    """[AC-IDMP-SHARE] Share list item for listing all shares of a session."""
-    share_token: str = Field(..., description="Share token")
-    share_url: str = Field(..., description="Full share URL")
-    title: str | None = Field(default=None, description="Share title")
-    description: str | None = Field(default=None, description="Share description")
-    expires_at: str = Field(..., description="Expiration time in ISO format")
-    is_active: bool = Field(..., description="Whether share is active")
-    max_concurrent_users: int = Field(..., description="Maximum concurrent users")
-    current_users: int = Field(..., description="Current online users")
-    created_at: str = Field(..., description="Creation time in ISO format")
-
-
-class ShareListResponse(BaseModel):
-    """[AC-IDMP-SHARE] Response for listing shares."""
-    shares: list[ShareListItem] = Field(..., description="List of shares")
-
-
-class KbSearchDynamicHit(BaseModel):
-    """[AC-MARH-05] Single KB search hit."""
-    id: str = Field(..., description="Hit ID")
-    content: str = Field(..., description="Hit content")
-    score: float = Field(..., ge=0.0, le=1.0, description="Relevance score")
-    metadata: dict[str, Any] = Field(default_factory=dict, description="Hit metadata")
-
-
-class MissingRequiredSlot(BaseModel):
-    """[AC-MARH-05] Missing required slot info."""
-    field_key: str = Field(..., description="Field key")
-    label: str = Field(..., description="Field label")
-    reason: str = Field(..., description="Missing reason")
-
-
-class KbSearchDynamicResultSchema(BaseModel):
-    """[AC-MARH-05, AC-MARH-06] KB dynamic search result schema."""
-    success: bool = Field(..., description="Whether search succeeded")
-    hits: list[KbSearchDynamicHit] = Field(default_factory=list, description="Search hits")
-    applied_filter: dict[str, Any] = Field(default_factory=dict, description="Applied filter")
-    missing_required_slots: list[MissingRequiredSlot] = Field(
-        default_factory=list, description="Missing required slots"
-    )
-    filter_debug: dict[str, Any] = Field(default_factory=dict, description="Filter debug info")
-    fallback_reason_code: str | None = Field(default=None, description="Fallback reason code")
-    duration_ms: int = Field(default=0, ge=0, description="Duration in milliseconds")
-
-
-class IntentHintOutput(BaseModel):
-    """[AC-IDMP-02, AC-IDMP-16] 轻量意图提示工具输出。"""
-    intent: str | None = Field(default=None, description="识别到的意图名称")
-    confidence: float = Field(default=0.0, ge=0.0, le=1.0, description="置信度 0~1")
-    response_type: str | None = Field(
-        default=None,
-        description="响应类型: fixed|rag|flow|transfer|null"
-    )
-    suggested_mode: ExecutionMode | None = Field(
-        default=None,
-        description="建议执行模式: agent|micro_flow|fixed|transfer"
-    )
-    target_flow_id: str | None = Field(default=None, description="目标流程ID（flow模式）")
-    target_kb_ids: list[str] | None = Field(default=None, description="目标知识库ID列表")
-    fallback_reason_code: str | None = Field(default=None, description="降级原因码")
-    high_risk_detected: bool = Field(default=False, description="是否检测到高风险场景")
-    duration_ms: int = Field(default=0, ge=0, description="执行耗时（毫秒）")
-
-
-class HighRiskCheckResult(BaseModel):
-    """[AC-IDMP-05, AC-IDMP-20] 高风险检测工具输出。"""
-    matched: bool = Field(default=False, description="是否命中高风险场景")
-    risk_scenario: HighRiskScenario | None = Field(
-        default=None,
-        description="风险场景: refund|complaint_escalation|privacy_sensitive_promise|transfer|none"
-    )
-    confidence: float = Field(default=0.0, ge=0.0, le=1.0, description="置信度 0~1")
-    recommended_mode: ExecutionMode | None = Field(
-        default=None,
-        description="推荐执行模式: micro_flow|transfer|agent"
-    )
-    rule_id: str | None = Field(default=None, description="匹配的规则ID")
-    reason: str | None = Field(default=None, description="匹配原因说明")
-    fallback_reason_code: str | None = Field(default=None, description="降级原因码（工具失败时）")
-    duration_ms: int = Field(default=0, ge=0, description="执行耗时（毫秒）")
-    matched_text: str | None = Field(default=None, description="匹配到的文本片段")
-    matched_pattern: str | None = Field(default=None, description="匹配到的模式（关键词或正则）")
-
-
-class SlotSource(str, Enum):
-    """[AC-IDMP-13] 槽位来源类型。"""
-    USER_CONFIRMED = "user_confirmed"
-    RULE_EXTRACTED = "rule_extracted"
-    LLM_INFERRED = "llm_inferred"
-    DEFAULT = "default"
-
-
-class MemorySlot(BaseModel):
-    """[AC-IDMP-13] 单个槽位信息。"""
-    key: str = Field(..., description="槽位键名")
-    value: Any = Field(..., description="槽位值")
-    source: SlotSource = Field(default=SlotSource.DEFAULT, description="槽位来源")
-    confidence: float = Field(default=1.0, ge=0.0, le=1.0, description="置信度")
-    updated_at: str | None = Field(default=None, description="最后更新时间")
-
-
-class MemoryRecallResult(BaseModel):
-    """[AC-IDMP-13] 记忆召回工具输出。"""
-    profile: dict[str, Any] = Field(default_factory=dict, description="用户基础属性")
-    facts: list[str] = Field(default_factory=list, description="事实型记忆列表")
-    preferences: dict[str, Any] = Field(default_factory=dict, description="用户偏好")
-    last_summary: str | None = Field(default=None, description="最近会话摘要")
-    slots: dict[str, MemorySlot] = Field(default_factory=dict, description="结构化槽位")
-    missing_slots: list[str] = Field(default_factory=list, description="缺失的必填槽位")
-    fallback_reason_code: str | None = Field(default=None, description="降级原因码")
-    duration_ms: int = Field(default=0, ge=0, description="执行耗时（毫秒）")
-
-    def get_context_for_prompt(self) -> str:
-        """生成用于注入 Prompt 的上下文字符串。"""
-        parts = []
-
-        if self.profile:
-            profile_parts = []
-            for key, value in self.profile.items():
-                if value:
-                    profile_parts.append(f"{key}: {value}")
-            if profile_parts:
-                parts.append("【用户属性】" + "、".join(profile_parts))
-
-        if self.facts:
-            parts.append("【已知事实】" + "；".join(self.facts[:5]))
-
-        if self.preferences:
-            pref_parts = []
-            for key, value in self.preferences.items():
-                if value:
-                    pref_parts.append(f"{key}: {value}")
-            if pref_parts:
-                parts.append("【用户偏好】" + "、".join(pref_parts))
-
-        if self.last_summary:
-            parts.append(f"【上次会话摘要】{self.last_summary}")
-
-        if self.slots:
-            slot_parts = []
-            for key, slot in self.slots.items():
-                slot_parts.append(f"{key}={slot.value}")
-            if slot_parts:
-                parts.append("【已知槽位】" + ", ".join(slot_parts))
-
-        return "\n".join(parts) if parts else ""
--- a/ai-service/app/models/mid/tool_registry.py
+++ b/ai-service/app/models/mid/tool_registry.py
@ -1,222 +0,0 @@
-"""
-Tool Registry models for Mid Platform.
-[AC-IDMP-19] Tool Registry 治理模型
-
-Reference: spec/intent-driven-mid-platform/openapi.provider.yaml
-"""
-
-import uuid
-from dataclasses import dataclass, field
-from datetime import datetime
-from enum import Enum
-from typing import Any
-
-from sqlalchemy import JSON, Column
-from sqlmodel import Field, Index, SQLModel
-
-
-class ToolStatus(str, Enum):
-    """工具状态"""
-    ENABLED = "enabled"
-    DISABLED = "disabled"
-    DEPRECATED = "deprecated"
-
-
-class ToolAuthType(str, Enum):
-    """工具鉴权类型"""
-    NONE = "none"
-    API_KEY = "api_key"
-    OAUTH = "oauth"
-    CUSTOM = "custom"
-
-
-@dataclass
-class ToolAuthConfig:
-    """
-    [AC-IDMP-19] 工具鉴权配置
-    """
-    auth_type: ToolAuthType = ToolAuthType.NONE
-    required_scopes: list[str] = field(default_factory=list)
-    api_key_header: str | None = None
-    oauth_url: str | None = None
-    custom_validator: str | None = None
-
-    def to_dict(self) -> dict[str, Any]:
-        result = {"auth_type": self.auth_type.value}
-        if self.required_scopes:
-            result["required_scopes"] = self.required_scopes
-        if self.api_key_header:
-            result["api_key_header"] = self.api_key_header
-        if self.oauth_url:
-            result["oauth_url"] = self.oauth_url
-        if self.custom_validator:
-            result["custom_validator"] = self.custom_validator
-        return result
-
-    @classmethod
-    def from_dict(cls, data: dict[str, Any]) -> "ToolAuthConfig":
-        return cls(
-            auth_type=ToolAuthType(data.get("auth_type", "none")),
-            required_scopes=data.get("required_scopes", []),
-            api_key_header=data.get("api_key_header"),
-            oauth_url=data.get("oauth_url"),
-            custom_validator=data.get("custom_validator"),
-        )
-
-
-@dataclass
-class ToolTimeoutPolicy:
-    """
-    [AC-IDMP-19] 工具超时策略
-    Reference: openapi.provider.yaml - TimeoutProfile
-    """
-    per_tool_timeout_ms: int = 30000
-    end_to_end_timeout_ms: int = 120000
-    retry_count: int = 0
-    retry_delay_ms: int = 100
-
-    def to_dict(self) -> dict[str, Any]:
-        return {
-            "per_tool_timeout_ms": self.per_tool_timeout_ms,
-            "end_to_end_timeout_ms": self.end_to_end_timeout_ms,
-            "retry_count": self.retry_count,
-            "retry_delay_ms": self.retry_delay_ms,
-        }
-
-    @classmethod
-    def from_dict(cls, data: dict[str, Any]) -> "ToolTimeoutPolicy":
-        return cls(
-            per_tool_timeout_ms=data.get("per_tool_timeout_ms", 30000),
-            end_to_end_timeout_ms=data.get("end_to_end_timeout_ms", 120000),
-            retry_count=data.get("retry_count", 0),
-            retry_delay_ms=data.get("retry_delay_ms", 100),
-        )
-
-
-@dataclass
-class ToolDefinition:
-    """
-    [AC-IDMP-19] 工具定义（内存模型）
-    
-    包含：
-    - name: 工具名称
-    - type: 工具类型 (internal | mcp)
-    - version: 版本号
-    - timeout_policy: 超时策略
-    - auth_config: 鉴权配置
-    - is_enabled: 启停状态
-    """
-    name: str
-    type: str = "internal"
-    version: str = "1.0.0"
-    description: str | None = None
-    timeout_policy: ToolTimeoutPolicy = field(default_factory=ToolTimeoutPolicy)
-    auth_config: ToolAuthConfig = field(default_factory=ToolAuthConfig)
-    is_enabled: bool = True
-    metadata: dict[str, Any] = field(default_factory=dict)
-    created_at: datetime = field(default_factory=datetime.utcnow)
-    updated_at: datetime = field(default_factory=datetime.utcnow)
-
-    def to_dict(self) -> dict[str, Any]:
-        return {
-            "name": self.name,
-            "type": self.type,
-            "version": self.version,
-            "description": self.description,
-            "timeout_policy": self.timeout_policy.to_dict(),
-            "auth_config": self.auth_config.to_dict(),
-            "is_enabled": self.is_enabled,
-            "metadata": self.metadata,
-            "created_at": self.created_at.isoformat(),
-            "updated_at": self.updated_at.isoformat(),
-        }
-
-    @classmethod
-    def from_dict(cls, data: dict[str, Any]) -> "ToolDefinition":
-        return cls(
-            name=data["name"],
-            type=data.get("type", "internal"),
-            version=data.get("version", "1.0.0"),
-            description=data.get("description"),
-            timeout_policy=ToolTimeoutPolicy.from_dict(data.get("timeout_policy", {})),
-            auth_config=ToolAuthConfig.from_dict(data.get("auth_config", {})),
-            is_enabled=data.get("is_enabled", True),
-            metadata=data.get("metadata", {}),
-            created_at=datetime.fromisoformat(data["created_at"]) if data.get("created_at") else datetime.utcnow(),
-            updated_at=datetime.fromisoformat(data["updated_at"]) if data.get("updated_at") else datetime.utcnow(),
-        )
-
-
-class ToolRegistryEntity(SQLModel, table=True):
-    """
-    [AC-IDMP-19] 工具注册表数据库实体
-    支持动态配置更新
-    """
-    __tablename__ = "tool_registry"
-    __table_args__ = (
-        Index("ix_tool_registry_tenant_name", "tenant_id", "name", unique=True),
-        Index("ix_tool_registry_tenant_enabled", "tenant_id", "is_enabled"),
-    )
-
-    id: uuid.UUID = Field(default_factory=uuid.uuid4, primary_key=True)
-    tenant_id: str = Field(..., description="租户ID", index=True)
-    name: str = Field(..., description="工具名称", max_length=128)
-    type: str = Field(default="internal", description="工具类型: internal | mcp")
-    version: str = Field(default="1.0.0", description="版本号", max_length=32)
-    description: str | None = Field(default=None, description="工具描述")
-    timeout_policy: dict[str, Any] | None = Field(
-        default=None,
-        sa_column=Column("timeout_policy", JSON, nullable=True),
-        description="超时策略配置"
-    )
-    auth_config: dict[str, Any] | None = Field(
-        default=None,
-        sa_column=Column("auth_config", JSON, nullable=True),
-        description="鉴权配置"
-    )
-    is_enabled: bool = Field(default=True, description="是否启用")
-    metadata_: dict[str, Any] | None = Field(
-        default=None,
-        sa_column=Column("metadata", JSON, nullable=True),
-        description="扩展元数据"
-    )
-    created_at: datetime = Field(default_factory=datetime.utcnow, description="创建时间")
-    updated_at: datetime = Field(default_factory=datetime.utcnow, description="更新时间")
-
-    def to_definition(self) -> ToolDefinition:
-        """转换为内存模型"""
-        return ToolDefinition(
-            name=self.name,
-            type=self.type,
-            version=self.version,
-            description=self.description,
-            timeout_policy=ToolTimeoutPolicy.from_dict(self.timeout_policy or {}),
-            auth_config=ToolAuthConfig.from_dict(self.auth_config or {}),
-            is_enabled=self.is_enabled,
-            metadata=self.metadata_ or {},
-            created_at=self.created_at,
-            updated_at=self.updated_at,
-        )
-
-
-class ToolRegistryCreate(SQLModel):
-    """创建工具注册请求"""
-    name: str = Field(..., max_length=128)
-    type: str = "internal"
-    version: str = "1.0.0"
-    description: str | None = None
-    timeout_policy: dict[str, Any] | None = None
-    auth_config: dict[str, Any] | None = None
-    is_enabled: bool = True
-    metadata_: dict[str, Any] | None = None
-
-
-class ToolRegistryUpdate(SQLModel):
-    """更新工具注册请求"""
-    type: str | None = None
-    version: str | None = None
-    description: str | None = None
-    timeout_policy: dict[str, Any] | None = None
-    auth_config: dict[str, Any] | None = None
-    is_enabled: bool | None = None
-    metadata_: dict[str, Any] | None = None
--- a/ai-service/app/models/mid/tool_trace.py
+++ b/ai-service/app/models/mid/tool_trace.py
@ -1,174 +0,0 @@
-"""
-Tool trace models for Mid Platform.
-[AC-IDMP-15] 工具调用结构化记录
-
-Reference: spec/intent-driven-mid-platform/openapi.provider.yaml - ToolCallTrace
-"""
-
-import hashlib
-import json
-from dataclasses import dataclass, field
-from datetime import datetime
-from enum import Enum
-from typing import Any
-
-
-class ToolCallStatus(str, Enum):
-    """工具调用状态"""
-    OK = "ok"
-    TIMEOUT = "timeout"
-    ERROR = "error"
-    REJECTED = "rejected"
-
-
-class ToolType(str, Enum):
-    """工具类型"""
-    INTERNAL = "internal"
-    MCP = "mcp"
-
-
-@dataclass
-class ToolCallTrace:
-    """
-    [AC-IDMP-15] 工具调用追踪记录
-    Reference: openapi.provider.yaml - ToolCallTrace
-    
-    记录字段：
-    - tool_name: 工具名称
-    - tool_type: 工具类型 (internal | mcp)
-    - registry_version: 注册表版本
-    - auth_applied: 是否应用鉴权
-    - duration_ms: 调用耗时（毫秒）
-    - status: 调用状态 (ok | timeout | error | rejected)
-    - error_code: 错误码
-    - args_digest: 参数摘要（脱敏）
-    - result_digest: 结果摘要
-    """
-    tool_name: str
-    duration_ms: int
-    status: ToolCallStatus
-    tool_type: ToolType = ToolType.INTERNAL
-    registry_version: str | None = None
-    auth_applied: bool = False
-    error_code: str | None = None
-    args_digest: str | None = None
-    result_digest: str | None = None
-    started_at: datetime = field(default_factory=datetime.utcnow)
-    completed_at: datetime | None = None
-
-    def to_dict(self) -> dict[str, Any]:
-        result = {
-            "tool_name": self.tool_name,
-            "duration_ms": self.duration_ms,
-            "status": self.status.value,
-        }
-        if self.tool_type != ToolType.INTERNAL:
-            result["tool_type"] = self.tool_type.value
-        if self.registry_version:
-            result["registry_version"] = self.registry_version
-        if self.auth_applied:
-            result["auth_applied"] = self.auth_applied
-        if self.error_code:
-            result["error_code"] = self.error_code
-        if self.args_digest:
-            result["args_digest"] = self.args_digest
-        if self.result_digest:
-            result["result_digest"] = self.result_digest
-        return result
-
-    @staticmethod
-    def compute_digest(data: Any, max_length: int = 64) -> str:
-        """
-        计算数据摘要（用于脱敏记录）
-        
-        Args:
-            data: 原始数据
-            max_length: 最大长度限制
-            
-        Returns:
-            摘要字符串
-        """
-        if data is None:
-            return ""
-        
-        if isinstance(data, (dict, list)):
-            data_str = json.dumps(data, ensure_ascii=False, sort_keys=True)
-        else:
-            data_str = str(data)
-        
-        if len(data_str) <= max_length:
-            return data_str
-        
-        hash_value = hashlib.sha256(data_str.encode("utf-8")).hexdigest()[:16]
-        preview = data_str[:32]
-        return f"{preview}...[hash:{hash_value}]"
-
-
-@dataclass
-class ToolCallBuilder:
-    """
-    [AC-IDMP-15] 工具调用记录构建器
-    用于在工具执行过程中逐步构建追踪记录
-    """
-    tool_name: str
-    tool_type: ToolType = ToolType.INTERNAL
-    registry_version: str | None = None
-    auth_applied: bool = False
-    _started_at: datetime = field(default_factory=datetime.utcnow)
-    _args: Any = None
-    _result: Any = None
-    _error: Exception | None = None
-    _status: ToolCallStatus = ToolCallStatus.OK
-    _error_code: str | None = None
-
-    def with_args(self, args: Any) -> "ToolCallBuilder":
-        """设置调用参数"""
-        self._args = args
-        return self
-
-    def with_registry_info(self, version: str, auth_applied: bool) -> "ToolCallBuilder":
-        """设置注册表信息"""
-        self.registry_version = version
-        self.auth_applied = auth_applied
-        return self
-
-    def with_result(self, result: Any) -> "ToolCallBuilder":
-        """设置调用结果"""
-        self._result = result
-        self._status = ToolCallStatus.OK
-        return self
-
-    def with_error(self, error: Exception, error_code: str | None = None) -> "ToolCallBuilder":
-        """设置错误信息"""
-        self._error = error
-        self._error_code = error_code
-        if isinstance(error, TimeoutError):
-            self._status = ToolCallStatus.TIMEOUT
-        else:
-            self._status = ToolCallStatus.ERROR
-        return self
-
-    def with_rejected(self, reason: str) -> "ToolCallBuilder":
-        """设置拒绝状态"""
-        self._status = ToolCallStatus.REJECTED
-        self._error_code = reason
-        return self
-
-    def build(self) -> ToolCallTrace:
-        """构建追踪记录"""
-        completed_at = datetime.utcnow()
-        duration_ms = int((completed_at - self._started_at).total_seconds() * 1000)
-        
-        return ToolCallTrace(
-            tool_name=self.tool_name,
-            tool_type=self.tool_type,
-            registry_version=self.registry_version,
-            auth_applied=self.auth_applied,
-            duration_ms=duration_ms,
-            status=self._status,
-            error_code=self._error_code,
-            args_digest=ToolCallTrace.compute_digest(self._args) if self._args else None,
-            result_digest=ToolCallTrace.compute_digest(self._result) if self._result else None,
-            started_at=self._started_at,
-            completed_at=completed_at,
-        )
--- a/ai-service/app/schemas/init.py
+++ b/ai-service/app/schemas/init.py
@ -1,39 +0,0 @@
-"""
-Schemas package for API request/response models.
-"""
-
-from app.schemas.metadata import (
-    FieldRole,
-    ExtractStrategy,
-    SlotValueSource,
-    MetadataFieldDefinitionResponse,
-    MetadataFieldDefinitionCreateRequest,
-    MetadataFieldDefinitionUpdateRequest,
-    SlotDefinitionResponse,
-    SlotDefinitionCreateRequest,
-    SlotDefinitionUpdateRequest,
-    SlotValueResponse,
-    SlotWithFieldDefinitionResponse,
-    GetFieldsByRoleQuery,
-    GetSlotsByRoleQuery,
-    InvalidRoleErrorResponse,
-    VALID_FIELD_ROLES,
-)
-
-__all__ = [
-    "FieldRole",
-    "ExtractStrategy",
-    "SlotValueSource",
-    "MetadataFieldDefinitionResponse",
-    "MetadataFieldDefinitionCreateRequest",
-    "MetadataFieldDefinitionUpdateRequest",
-    "SlotDefinitionResponse",
-    "SlotDefinitionCreateRequest",
-    "SlotDefinitionUpdateRequest",
-    "SlotValueResponse",
-    "SlotWithFieldDefinitionResponse",
-    "GetFieldsByRoleQuery",
-    "GetSlotsByRoleQuery",
-    "InvalidRoleErrorResponse",
-    "VALID_FIELD_ROLES",
-]
--- a/ai-service/app/schemas/metadata.py
+++ b/ai-service/app/schemas/metadata.py
@ -1,242 +0,0 @@
-"""
-Metadata schemas for API request/response.
-[AC-MRS-01, AC-MRS-07] 元数据职责分层相关的 Pydantic Schema
-"""
-
-from __future__ import annotations
-
-from datetime import datetime
-from enum import Enum
-from typing import Any
-
-from pydantic import BaseModel, Field
-
-
-class FieldRole(str, Enum):
-    """
-    [AC-MRS-01] 字段角色枚举
-    用于标识元数据字段的职责分层
-    """
-    RESOURCE_FILTER = "resource_filter"
-    SLOT = "slot"
-    PROMPT_VAR = "prompt_var"
-    ROUTING_SIGNAL = "routing_signal"
-
-
-class ExtractStrategy(str, Enum):
-    """
-    [AC-MRS-07] 槽位值提取策略
-    """
-    RULE = "rule"
-    LLM = "llm"
-    USER_INPUT = "user_input"
-
-
-class SlotValueSource(str, Enum):
-    """
-    [AC-MRS-09] 槽位值来源
-    """
-    USER_CONFIRMED = "user_confirmed"
-    RULE_EXTRACTED = "rule_extracted"
-    LLM_INFERRED = "llm_inferred"
-    DEFAULT = "default"
-
-
-VALID_FIELD_ROLES = [role.value for role in FieldRole]
-
-
-class MetadataFieldDefinitionResponse(BaseModel):
-    """[AC-MRS-01] 元数据字段定义响应"""
-
-    id: str = Field(..., description="字段定义 ID")
-    field_key: str = Field(..., description="字段键名")
-    label: str = Field(..., description="字段显示名称")
-    type: str = Field(..., description="字段类型")
-    required: bool = Field(default=False, description="是否必填")
-    options: list[str] | None = Field(default=None, description="选项列表")
-    default_value: Any | None = Field(default=None, description="默认值")
-    scope: list[str] = Field(default_factory=list, description="适用范围")
-    is_filterable: bool = Field(default=True, description="是否可用于过滤")
-    is_rank_feature: bool = Field(default=False, description="是否用于排序特征")
-    field_roles: list[str] = Field(
-        default_factory=list,
-        description="[AC-MRS-01] 字段角色列表"
-    )
-    status: str = Field(..., description="字段状态")
-    version: int = Field(default=1, description="版本号")
-    created_at: datetime = Field(..., description="创建时间")
-    updated_at: datetime = Field(..., description="更新时间")
-
-    class Config:
-        from_attributes = True
-
-
-class MetadataFieldDefinitionCreateRequest(BaseModel):
-    """[AC-MRS-01,02,03] 创建元数据字段定义请求"""
-
-    field_key: str = Field(..., min_length=1, max_length=64, description="字段键名")
-    label: str = Field(..., min_length=1, max_length=64, description="字段显示名称")
-    type: str = Field(default="string", description="字段类型")
-    required: bool = Field(default=False, description="是否必填")
-    options: list[str] | None = Field(default=None, description="选项列表")
-    default_value: Any | None = Field(default=None, description="默认值")
-    scope: list[str] = Field(
-        default_factory=lambda: ["kb_document"],
-        description="适用范围"
-    )
-    is_filterable: bool = Field(default=True, description="是否可用于过滤")
-    is_rank_feature: bool = Field(default=False, description="是否用于排序特征")
-    field_roles: list[str] = Field(
-        default_factory=list,
-        description="[AC-MRS-01] 字段角色列表，可选值: resource_filter, slot, prompt_var, routing_signal"
-    )
-    status: str = Field(default="draft", description="字段状态")
-
-    def validate_field_roles(self) -> list[str]:
-        """验证 field_roles 中的角色值是否有效"""
-        invalid_roles = [r for r in self.field_roles if r not in VALID_FIELD_ROLES]
-        if invalid_roles:
-            raise ValueError(
-                f"Invalid field_roles: {invalid_roles}. "
-                f"Valid roles are: {VALID_FIELD_ROLES}"
-            )
-        return self.field_roles
-
-
-class MetadataFieldDefinitionUpdateRequest(BaseModel):
-    """[AC-MRS-01] 更新元数据字段定义请求"""
-
-    label: str | None = Field(default=None, min_length=1, max_length=64)
-    required: bool | None = None
-    options: list[str] | None = None
-    default_value: Any | None = None
-    scope: list[str] | None = None
-    is_filterable: bool | None = None
-    is_rank_feature: bool | None = None
-    field_roles: list[str] | None = Field(
-        default=None,
-        description="[AC-MRS-01] 字段角色列表"
-    )
-    status: str | None = None
-
-    def validate_field_roles(self) -> list[str] | None:
-        """验证 field_roles 中的角色值是否有效"""
-        if self.field_roles is None:
-            return None
-        invalid_roles = [r for r in self.field_roles if r not in VALID_FIELD_ROLES]
-        if invalid_roles:
-            raise ValueError(
-                f"Invalid field_roles: {invalid_roles}. "
-                f"Valid roles are: {VALID_FIELD_ROLES}"
-            )
-        return self.field_roles
-
-
-class SlotDefinitionResponse(BaseModel):
-    """[AC-MRS-07,08] 槽位定义响应"""
-
-    id: str = Field(..., description="槽位定义 ID")
-    slot_key: str = Field(..., description="槽位键名")
-    type: str = Field(..., description="槽位类型")
-    required: bool = Field(default=False, description="是否必填槽位")
-    extract_strategy: str | None = Field(default=None, description="提取策略")
-    validation_rule: str | None = Field(default=None, description="校验规则")
-    ask_back_prompt: str | None = Field(default=None, description="追问提示语模板")
-    default_value: dict[str, Any] | None = Field(default=None, description="默认值")
-    linked_field_id: str | None = Field(default=None, description="关联的元数据字段 ID")
-    created_at: datetime = Field(..., description="创建时间")
-    updated_at: datetime = Field(..., description="更新时间")
-
-    class Config:
-        from_attributes = True
-
-
-class SlotDefinitionCreateRequest(BaseModel):
-    """[AC-MRS-07,08] 创建槽位定义请求"""
-
-    slot_key: str = Field(..., min_length=1, max_length=100, description="槽位键名")
-    type: str = Field(default="string", description="槽位类型")
-    required: bool = Field(default=False, description="是否必填槽位")
-    extract_strategy: str | None = Field(
-        default=None,
-        description="提取策略: rule/llm/user_input"
-    )
-    validation_rule: str | None = Field(default=None, description="校验规则")
-    ask_back_prompt: str | None = Field(default=None, description="追问提示语模板")
-    default_value: dict[str, Any] | None = Field(default=None, description="默认值")
-    linked_field_id: str | None = Field(default=None, description="关联的元数据字段 ID")
-
-
-class SlotDefinitionUpdateRequest(BaseModel):
-    """[AC-MRS-07] 更新槽位定义请求"""
-
-    type: str | None = None
-    required: bool | None = None
-    extract_strategy: str | None = None
-    validation_rule: str | None = None
-    ask_back_prompt: str | None = None
-    default_value: dict[str, Any] | None = None
-    linked_field_id: str | None = None
-
-
-class SlotValueResponse(BaseModel):
-    """[AC-MRS-09] 运行时槽位值响应"""
-
-    key: str = Field(..., description="槽位键名")
-    value: Any = Field(..., description="槽位值")
-    source: str = Field(
-        default="default",
-        description="来源: user_confirmed/rule_extracted/llm_inferred/default"
-    )
-    confidence: float = Field(
-        default=1.0,
-        ge=0.0,
-        le=1.0,
-        description="置信度 0.0~1.0"
-    )
-    updated_at: datetime = Field(..., description="最后更新时间")
-
-
-class SlotWithFieldDefinitionResponse(BaseModel):
-    """[AC-MRS-10] 槽位定义与关联字段定义响应"""
-
-    slot_definition: SlotDefinitionResponse | None = Field(
-        default=None,
-        description="槽位定义"
-    )
-    field_definition: MetadataFieldDefinitionResponse | None = Field(
-        default=None,
-        description="关联的元数据字段定义"
-    )
-
-
-class GetFieldsByRoleQuery(BaseModel):
-    """[AC-MRS-04,05] 按角色查询字段定义请求"""
-
-    role: str = Field(..., description="字段角色")
-
-    def validate_role(self) -> str:
-        """验证角色值是否有效"""
-        if self.role not in VALID_FIELD_ROLES:
-            raise ValueError(
-                f"[AC-MRS-05] Invalid role '{self.role}'. "
-                f"Valid roles are: {VALID_FIELD_ROLES}"
-            )
-        return self.role
-
-
-class GetSlotsByRoleQuery(BaseModel):
-    """[AC-MRS-10] 按角色获取槽位定义请求"""
-
-    role: str = Field(default="slot", description="字段角色，默认为 slot")
-
-
-class InvalidRoleErrorResponse(BaseModel):
-    """[AC-MRS-05] 无效角色错误响应"""
-
-    error: str = Field(default="INVALID_ROLE", description="错误码")
-    message: str = Field(..., description="错误消息")
-    valid_roles: list[str] = Field(
-        default_factory=lambda: VALID_FIELD_ROLES,
-        description="有效的角色列表"
-    )
--- a/ai-service/app/services/api_key.py
+++ b/ai-service/app/services/api_key.py
@ -3,13 +3,9 @@ API Key management service.
 [AC-AISVC-50] Lightweight authentication with in-memory cache.
 """

-from __future__ import annotations
-
 import logging
 import secrets
-from collections import deque
-from dataclasses import dataclass, field
-from datetime import datetime, timedelta
+from datetime import datetime
 from typing import Optional

 from sqlalchemy import select
@ -20,25 +16,6 @@ from app.models.entities import ApiKey, ApiKeyCreate
 logger = logging.getLogger(__name__)


-@dataclass
-class CachedApiKeyMeta:
-    """Cached metadata for API key policy checks."""
-
-    is_active: bool
-    expires_at: datetime | None
-    allowed_ips: set[str] = field(default_factory=set)
-    rate_limit_qpm: int = 60
-
-
-@dataclass
-class ValidationResult:
-    """Validation output for middleware auth + policy checks."""
-
-    ok: bool
-    reason: str | None = None
-    rate_limit_qpm: int = 60
-
-
 class ApiKeyService:
    """
    [AC-AISVC-50] API Key management service.
@ -51,8 +28,6 @@ class ApiKeyService:

    def __init__(self):
        self._keys_cache: set[str] = set()
-        self._key_meta: dict[str, CachedApiKeyMeta] = {}
-        self._rate_buckets: dict[str, deque[datetime]] = {}
        self._initialized: bool = False

    async def initialize(self, session: AsyncSession) -> None:
@ -60,50 +35,15 @@ class ApiKeyService:
        Load all active API keys from database into memory.
        Should be called on application startup.
        """
-        try:
-            result = await session.execute(
-                select(ApiKey).where(ApiKey.is_active == True)
-            )
-            keys = result.scalars().all()
+        result = await session.execute(
+            select(ApiKey).where(ApiKey.is_active == True)
+        )
+        keys = result.scalars().all()

-            self._keys_cache = {key.key for key in keys}
-            self._key_meta = {
-                key.key: CachedApiKeyMeta(
-                    is_active=key.is_active,
-                    expires_at=key.expires_at,
-                    allowed_ips=set(key.allowed_ips or []),
-                    rate_limit_qpm=key.rate_limit_qpm or 60,
-                )
-                for key in keys
-            }
-            self._initialized = True
-            logger.info(f"[AC-AISVC-50] Loaded {len(self._keys_cache)} API keys into memory")
-            return
-        except Exception as e:
-            logger.warning(f"[AC-AISVC-50] Full API key schema load failed, fallback to legacy columns: {e}")
-            await session.rollback()
+        self._keys_cache = {key.key for key in keys}
+        self._initialized = True

-        # Backward-compat fallback for environments without new columns
-        try:
-            result = await session.execute(
-                select(ApiKey.key, ApiKey.is_active).where(ApiKey.is_active == True)
-            )
-            rows = result.all()
-            self._keys_cache = {row[0] for row in rows}
-            self._key_meta = {
-                row[0]: CachedApiKeyMeta(
-                    is_active=bool(row[1]),
-                    expires_at=None,
-                    allowed_ips=set(),
-                    rate_limit_qpm=60,
-                )
-                for row in rows
-            }
-            self._initialized = True
-            logger.info(f"[AC-AISVC-50] Loaded {len(self._keys_cache)} API keys in legacy compatibility mode")
-        except Exception as fallback_error:
-            self._initialized = False
-            logger.error(f"[AC-AISVC-50] API key initialization failed in both full/legacy mode: {fallback_error}")
+        logger.info(f"[AC-AISVC-50] Loaded {len(self._keys_cache)} API keys into memory")

    def validate_key(self, key: str) -> bool:
        """
@ -121,41 +61,6 @@ class ApiKeyService:

        return key in self._keys_cache

-    def validate_key_with_context(self, key: str, client_ip: str | None) -> ValidationResult:
-        """Validate key and policy constraints: expiration, IP allowlist, and per-minute rate."""
-        if not self._initialized:
-            return ValidationResult(ok=False, reason="service_not_initialized")
-
-        if key not in self._keys_cache:
-            return ValidationResult(ok=False, reason="invalid_key")
-
-        meta = self._key_meta.get(key)
-        if not meta or not meta.is_active:
-            return ValidationResult(ok=False, reason="inactive_key")
-
-        now = datetime.utcnow()
-        if meta.expires_at and now > meta.expires_at:
-            return ValidationResult(ok=False, reason="expired_key")
-
-        if meta.allowed_ips and client_ip and client_ip not in meta.allowed_ips:
-            return ValidationResult(ok=False, reason="ip_not_allowed")
-
-        self._evict_stale_rate_entries(key, now)
-        bucket = self._rate_buckets.setdefault(key, deque())
-        limit = meta.rate_limit_qpm or 60
-        if len(bucket) >= limit:
-            return ValidationResult(ok=False, reason="rate_limited", rate_limit_qpm=limit)
-
-        bucket.append(now)
-        return ValidationResult(ok=True, rate_limit_qpm=limit)
-
-    def _evict_stale_rate_entries(self, key: str, now: datetime) -> None:
-        """Keep only requests in the latest 60 seconds for token bucket emulation."""
-        bucket = self._rate_buckets.setdefault(key, deque())
-        threshold = now - timedelta(seconds=60)
-        while bucket and bucket[0] < threshold:
-            bucket.popleft()
-
    def generate_key(self) -> str:
        """
        Generate a new secure API key.
@ -184,9 +89,6 @@ class ApiKeyService:
            key=key_create.key,
            name=key_create.name,
            is_active=key_create.is_active,
-            expires_at=key_create.expires_at,
-            allowed_ips=key_create.allowed_ips,
-            rate_limit_qpm=key_create.rate_limit_qpm or 60,
        )

        session.add(api_key)
@ -195,12 +97,6 @@ class ApiKeyService:

        if api_key.is_active:
            self._keys_cache.add(api_key.key)
-            self._key_meta[api_key.key] = CachedApiKeyMeta(
-                is_active=api_key.is_active,
-                expires_at=api_key.expires_at,
-                allowed_ips=set(api_key.allowed_ips or []),
-                rate_limit_qpm=api_key.rate_limit_qpm or 60,
-            )

        logger.info(f"[AC-AISVC-50] Created API key: {api_key.name}")
        return api_key
@ -212,14 +108,8 @@ class ApiKeyService:
        Returns:
            The created ApiKey or None if keys already exist
        """
-        try:
-            result = await session.execute(select(ApiKey).limit(1))
-            existing = result.scalar_one_or_none()
-        except Exception as e:
-            logger.warning(f"[AC-AISVC-50] Full schema query failed in create_default_key, using fallback: {e}")
-            await session.rollback()
-            result = await session.execute(select(ApiKey.key).limit(1))
-            existing = result.scalar_one_or_none()
+        result = await session.execute(select(ApiKey).limit(1))
+        existing = result.scalar_one_or_none()

        if existing:
            return None
@ -236,12 +126,6 @@ class ApiKeyService:
        await session.refresh(api_key)

        self._keys_cache.add(api_key.key)
-        self._key_meta[api_key.key] = CachedApiKeyMeta(
-            is_active=api_key.is_active,
-            expires_at=getattr(api_key, 'expires_at', None),
-            allowed_ips=set(getattr(api_key, 'allowed_ips', []) or []),
-            rate_limit_qpm=getattr(api_key, 'rate_limit_qpm', 60) or 60,
-        )

        logger.info(f"[AC-AISVC-50] Created default API key: {api_key.key}")
        return api_key
@ -281,8 +165,6 @@ class ApiKeyService:
        await session.commit()

        self._keys_cache.discard(key_value)
-        self._key_meta.pop(key_value, None)
-        self._rate_buckets.pop(key_value, None)

        logger.info(f"[AC-AISVC-50] Deleted API key: {api_key.name}")
        return True
@ -328,16 +210,8 @@ class ApiKeyService:

        if is_active:
            self._keys_cache.add(api_key.key)
-            self._key_meta[api_key.key] = CachedApiKeyMeta(
-                is_active=api_key.is_active,
-                expires_at=api_key.expires_at,
-                allowed_ips=set(api_key.allowed_ips or []),
-                rate_limit_qpm=api_key.rate_limit_qpm or 60,
-            )
        else:
            self._keys_cache.discard(api_key.key)
-            self._key_meta.pop(api_key.key, None)
-            self._rate_buckets.pop(api_key.key, None)

        logger.info(f"[AC-AISVC-50] Toggled API key {api_key.name}: active={is_active}")
        return api_key
@ -360,8 +234,6 @@ class ApiKeyService:
        Reload all API keys from database into memory.
        """
        self._keys_cache.clear()
-        self._key_meta.clear()
-        self._rate_buckets.clear()
        await self.initialize(session)
        logger.info("[AC-AISVC-50] API key cache reloaded")

--- a/ai-service/app/services/flow/template_engine.py
+++ b/ai-service/app/services/flow/template_engine.py
@ -1,7 +1,6 @@
 """
 Template Engine for Intent-Driven Script Flow.
 [AC-IDS-06] Template mode script generation with variable filling.
-[AC-MRS-14] 只消费 field_roles 包含 prompt_var 的字段
 """

 import asyncio
@ -9,53 +8,40 @@ import logging
 import re
 from typing import Any

-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.entities import FieldRole
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider
-
 logger = logging.getLogger(__name__)


 class TemplateEngine:
    """
    [AC-IDS-06] Template script engine.
-    [AC-MRS-14] 只消费 field_roles 包含 prompt_var 的字段
-    
    Fills template variables using context or LLM generation.
    """

    VARIABLE_PATTERN = re.compile(r'\{(\w+)\}')
    DEFAULT_TIMEOUT = 5.0

-    def __init__(self, llm_client: Any = None, session: AsyncSession | None = None):
+    def __init__(self, llm_client: Any = None):
        """
        Initialize TemplateEngine.

        Args:
            llm_client: LLM client for variable generation (optional)
-            session: Database session for role-based field provider (optional)
        """
        self._llm_client = llm_client
-        self._session = session
-        self._role_provider = RoleBasedFieldProvider(session) if session else None

    async def fill_template(
        self,
        template: str,
        context: dict[str, Any] | None,
        history: list[dict[str, str]] | None,
-        tenant_id: str | None = None,
    ) -> str:
        """
        [AC-IDS-06] Fill template variables with context or LLM-generated values.
-        [AC-MRS-14] 只消费 prompt_var 角色的字段

        Args:
            template: Script template with {variable} placeholders
            context: Session context with collected inputs
            history: Conversation history for context
-            tenant_id: Tenant ID for role-based field filtering

        Returns:
            Filled template string
@ -66,28 +52,11 @@ class TemplateEngine:
            if not variables:
                return template

-            prompt_var_fields = []
-            if tenant_id and self._role_provider:
-                prompt_var_fields = await self._role_provider.get_prompt_var_field_keys(tenant_id)
-                logger.info(
-                    f"[AC-MRS-14] Retrieved {len(prompt_var_fields)} prompt_var fields for tenant={tenant_id}: {prompt_var_fields}"
-                )
-
-            filtered_context = {}
-            if context:
-                if prompt_var_fields:
-                    filtered_context = {k: v for k, v in context.items() if k in prompt_var_fields}
-                    logger.info(
-                        f"[AC-MRS-14] Applied prompt_var context: {list(filtered_context.keys())}"
-                    )
-                else:
-                    filtered_context = context
-
            variable_values = {}
            for var in variables:
                value = await self._generate_variable_value(
                    variable_name=var,
-                    context=filtered_context,
+                    context=context,
                    history=history,
                )
                variable_values[var] = value
--- a/ai-service/app/services/metadata_field_definition_service.py
+++ b/ai-service/app/services/metadata_field_definition_service.py
@ -1,7 +1,6 @@
 """
 Metadata Field Definition Service.
 [AC-IDSMETA-13, AC-IDSMETA-14] 元数据字段定义管理服务，支持字段级状态治理。
-[AC-MRS-01, AC-MRS-02, AC-MRS-03, AC-MRS-06] 支持字段角色分层配置。
 """

 import logging
@ -22,9 +21,7 @@ from app.models.entities import (
    MetadataFieldStatus,
    MetadataFieldType,
    MetadataScope,
-    FieldRole,
 )
-from app.schemas.metadata import VALID_FIELD_ROLES

 logger = logging.getLogger(__name__)

@ -45,16 +42,14 @@ class MetadataFieldDefinitionService:
        tenant_id: str,
        status: str | None = None,
        scope: str | None = None,
-        role: str | None = None,
    ) -> list[MetadataFieldDefinition]:
        """
-        [AC-IDSMETA-13] [AC-MRS-06] 列出租户所有元数据字段定义
+        [AC-IDSMETA-13] 列出租户所有元数据字段定义
        
        Args:
            tenant_id: 租户 ID
            status: 按状态过滤（draft/active/deprecated）
            scope: 按适用范围过滤
-            role: [AC-MRS-06] 按字段角色过滤
        
        Returns:
            MetadataFieldDefinition 列表
@ -71,11 +66,6 @@ class MetadataFieldDefinitionService:
                cast(MetadataFieldDefinition.scope, JSONB).op('?')(scope)
            )
        
-        if role:
-            stmt = stmt.where(
-                cast(MetadataFieldDefinition.field_roles, JSONB).op('?')(role)
-            )
-        
        stmt = stmt.order_by(col(MetadataFieldDefinition.created_at).desc())
        
        result = await self._session.execute(stmt)
@ -131,7 +121,7 @@ class MetadataFieldDefinitionService:
        field_create: MetadataFieldDefinitionCreate,
    ) -> MetadataFieldDefinition:
        """
-        [AC-IDSMETA-13] [AC-MRS-01,02,03] 创建元数据字段定义
+        [AC-IDSMETA-13] 创建元数据字段定义
        
        Args:
            tenant_id: 租户 ID
@ -159,8 +149,6 @@ class MetadataFieldDefinitionService:
            field_create.field_key,
        )
        
-        self._validate_field_roles(field_create.field_roles)
-        
        field = MetadataFieldDefinition(
            tenant_id=tenant_id,
            field_key=field_create.field_key,
@ -172,7 +160,6 @@ class MetadataFieldDefinitionService:
            scope=field_create.scope,
            is_filterable=field_create.is_filterable,
            is_rank_feature=field_create.is_rank_feature,
-            field_roles=field_create.field_roles or [],
            status=field_create.status,
            version=1,
        )
@ -181,8 +168,8 @@ class MetadataFieldDefinitionService:
        await self._session.flush()
        
        logger.info(
-            f"[AC-IDSMETA-13] [AC-MRS-01] Created field definition: tenant={tenant_id}, "
-            f"field_key={field.field_key}, status={field.status}, field_roles={field.field_roles}"
+            f"[AC-IDSMETA-13] Created field definition: tenant={tenant_id}, "
+            f"field_key={field.field_key}, status={field.status}"
        )
        
        return field
@ -450,28 +437,6 @@ class MetadataFieldDefinitionService:
                    f"字段 '{field_key}' 的 options 存在重复值"
                )

-    def _validate_field_roles(
-        self,
-        field_roles: list[str] | None,
-    ) -> None:
-        """
-        [AC-MRS-01,02,03] 验证字段角色配置的有效性
-        
-        Args:
-            field_roles: 字段角色列表
-        
-        Raises:
-            ValueError: 如果包含无效的字段角色
-        """
-        if not field_roles:
-            return
-        
-        for role in field_roles:
-            if role not in VALID_FIELD_ROLES:
-                raise ValueError(
-                    f"无效的字段角色 '{role}'，有效角色为: {VALID_FIELD_ROLES}"
-                )
-
    async def get_field_definitions_map(
        self,
        tenant_id: str,
--- a/ai-service/app/services/mid/init.py
+++ b/ai-service/app/services/mid/init.py
@ -1,57 +0,0 @@
-"""
-Mid Platform services.
-[AC-IDMP-02, AC-IDMP-05, AC-IDMP-11, AC-IDMP-12, AC-IDMP-13, AC-IDMP-14, AC-IDMP-15, AC-IDMP-16, AC-IDMP-17, AC-IDMP-19, AC-IDMP-20]
-[AC-MARH-05, AC-MARH-06, AC-MARH-10, AC-MARH-11, AC-MARH-12]
-"""
-
-from .policy_router import PolicyRouter, PolicyRouterResult, IntentMatch
-from .agent_orchestrator import AgentOrchestrator, ReActContext
-from .timeout_governor import TimeoutGovernor
-from .feature_flags import FeatureFlagService
-from .high_risk_handler import HighRiskHandler, HighRiskMatch
-from .trace_logger import TraceLogger, AuditRecord
-from .metrics_collector import MetricsCollector, SessionMetrics, AggregatedMetrics
-from .tool_registry import ToolRegistry, ToolDefinition, ToolExecutionResult, get_tool_registry, init_tool_registry
-from .tool_call_recorder import ToolCallRecorder, ToolCallStatistics, get_tool_call_recorder
-from .memory_adapter import MemoryAdapter, UserMemory
-from .default_kb_tool_runner import DefaultKbToolRunner, KbToolResult, KbToolConfig, get_default_kb_tool_runner
-from .segment_humanizer import SegmentHumanizer, HumanizeConfig, LengthBucket, get_segment_humanizer
-from .runtime_observer import RuntimeObserver, RuntimeContext, get_runtime_observer
-
-__all__ = [
-    "PolicyRouter",
-    "PolicyRouterResult",
-    "IntentMatch",
-    "AgentOrchestrator",
-    "ReActContext",
-    "TimeoutGovernor",
-    "FeatureFlagService",
-    "HighRiskHandler",
-    "HighRiskMatch",
-    "TraceLogger",
-    "AuditRecord",
-    "MetricsCollector",
-    "SessionMetrics",
-    "AggregatedMetrics",
-    "ToolRegistry",
-    "ToolDefinition",
-    "ToolExecutionResult",
-    "get_tool_registry",
-    "init_tool_registry",
-    "ToolCallRecorder",
-    "ToolCallStatistics",
-    "get_tool_call_recorder",
-    "MemoryAdapter",
-    "UserMemory",
-    "DefaultKbToolRunner",
-    "KbToolResult",
-    "KbToolConfig",
-    "get_default_kb_tool_runner",
-    "SegmentHumanizer",
-    "HumanizeConfig",
-    "LengthBucket",
-    "get_segment_humanizer",
-    "RuntimeObserver",
-    "RuntimeContext",
-    "get_runtime_observer",
-]
--- a/ai-service/app/services/mid/agent_orchestrator.py
+++ b/ai-service/app/services/mid/agent_orchestrator.py
@ -1,546 +0,0 @@
-"""
-Agent Orchestrator for Mid Platform.
-[AC-MARH-07] ReAct loop with iteration limit (3-5 iterations).
-
-ReAct Flow:
-1. Thought: Agent thinks about what to do
-2. Action: Agent decides to use a tool
-3. Observation: Tool execution result
-4. Repeat until final answer or max iterations reached
-"""
-
-import asyncio
-import json
-import logging
-import re
-import time
-import uuid
-from dataclasses import dataclass
-from typing import Any
-
-from app.models.mid.schemas import (
-    ExecutionMode,
-    ReActContext,
-    ToolCallStatus,
-    ToolCallTrace,
-    ToolType,
-    TraceInfo,
-)
-from app.services.mid.timeout_governor import TimeoutGovernor
-from app.services.prompt.template_service import PromptTemplateService
-from app.services.prompt.variable_resolver import VariableResolver
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_MAX_ITERATIONS = 5
-MIN_ITERATIONS = 3
-
-
-@dataclass
-class ToolResult:
-    """Tool execution result."""
-    success: bool
-    output: str | None = None
-    error: str | None = None
-    duration_ms: int = 0
-
-
-@dataclass
-class AgentThought:
-    """Agent thought in ReAct loop."""
-    content: str
-    action: str | None = None
-    action_input: dict[str, Any] | None = None
-
-
-class AgentOrchestrator:
-    """
-    [AC-MARH-07] Agent orchestrator with ReAct loop control.
-
-    Features:
-    - ReAct loop with max 5 iterations (min 3)
-    - Per-tool timeout (2s) and end-to-end timeout (8s)
-    - Automatic fallback on iteration limit or timeout
-    - Template-based prompt with variable injection
-    """
-
-    def __init__(
-        self,
-        max_iterations: int = DEFAULT_MAX_ITERATIONS,
-        timeout_governor: TimeoutGovernor | None = None,
-        llm_client: Any = None,
-        tool_registry: Any = None,
-        template_service: PromptTemplateService | None = None,
-        variable_resolver: VariableResolver | None = None,
-        tenant_id: str | None = None,
-    ):
-        self._max_iterations = max(min(max_iterations, 5), MIN_ITERATIONS)
-        self._timeout_governor = timeout_governor or TimeoutGovernor()
-        self._llm_client = llm_client
-        self._tool_registry = tool_registry
-        self._template_service = template_service
-        self._variable_resolver = variable_resolver or VariableResolver()
-        self._tenant_id = tenant_id
-
-    async def execute(
-        self,
-        user_message: str,
-        context: dict[str, Any] | None = None,
-        on_thought: Any = None,
-        on_action: Any = None,
-    ) -> tuple[str, ReActContext, TraceInfo]:
-        """
-        [AC-MARH-07] Execute ReAct loop with iteration control.
-
-        Args:
-            user_message: User input message
-            context: Execution context (history, retrieval results, etc.)
-            on_thought: Callback for thought events
-            on_action: Callback for action events
-
-        Returns:
-            Tuple of (final_answer, react_context, trace_info)
-        """
-        react_ctx = ReActContext(max_iterations=self._max_iterations)
-        tool_calls: list[ToolCallTrace] = []
-        start_time = time.time()
-
-        logger.info(
-            f"[AC-MARH-07] Starting ReAct loop: max_iterations={self._max_iterations}"
-        )
-
-        try:
-            overall_start = time.time()
-            end_to_end_timeout = self._timeout_governor.end_to_end_timeout_seconds
-            llm_timeout = (
-                self._timeout_governor.llm_timeout_seconds
-                if hasattr(self._timeout_governor, 'llm_timeout_seconds')
-                else 15.0
-            )
-
-            while react_ctx.should_continue and react_ctx.iteration < react_ctx.max_iterations:
-                react_ctx.iteration += 1
-
-                elapsed = time.time() - overall_start
-                remaining_time = end_to_end_timeout - elapsed
-                if remaining_time <= 0:
-                    logger.warning(
-                        "[AC-MARH-09] ReAct loop exceeded end-to-end timeout"
-                    )
-                    react_ctx.final_answer = "抱歉，处理超时，请稍后重试或联系人工客服。"
-                    break
-
-                logger.info(
-                    f"[AC-MARH-07] ReAct iteration {react_ctx.iteration}/"
-                    f"{react_ctx.max_iterations}, remaining_time={remaining_time:.1f}s"
-                )
-
-                thought = await asyncio.wait_for(
-                    self._think(user_message, context, react_ctx),
-                    timeout=min(llm_timeout, remaining_time)
-                )
-                if on_thought:
-                    await on_thought(thought)
-
-                if not thought.action:
-                    logger.info(
-                        f"[AC-MARH-07] No action, setting final_answer: "
-                        f"{thought.content[:200] if thought.content else 'None'}"
-                    )
-                    react_ctx.final_answer = thought.content
-                    react_ctx.should_continue = False
-                    break
-
-                tool_result, tool_trace = await self._act(thought, react_ctx)
-                tool_calls.append(tool_trace)
-                react_ctx.tool_calls.append(tool_trace)
-
-                if on_action:
-                    await on_action(thought.action, tool_result)
-
-                if tool_result.success:
-                    context = context or {}
-                    context["last_observation"] = tool_result.output
-                else:
-                    if tool_trace.status == ToolCallStatus.TIMEOUT:
-                        logger.warning(f"[AC-MARH-08] Tool timeout: {thought.action}")
-                        react_ctx.final_answer = "抱歉，操作超时，请稍后重试或联系人工客服。"
-                        react_ctx.should_continue = False
-                        break
-
-            if react_ctx.should_continue and not react_ctx.final_answer:
-                logger.warning(f"[AC-MARH-07] ReAct reached max iterations: {react_ctx.iteration}")
-                react_ctx.final_answer = await self._force_final_answer(user_message, context, react_ctx)
-
-        except asyncio.TimeoutError:
-            logger.error("[AC-MARH-09] ReAct loop timed out (end-to-end)")
-            react_ctx.final_answer = "抱歉，处理超时，请稍后重试或联系人工客服。"
-            tool_calls.append(ToolCallTrace(
-                tool_name="react_loop",
-                tool_type=ToolType.INTERNAL,
-                duration_ms=int((time.time() - start_time) * 1000),
-                status=ToolCallStatus.TIMEOUT,
-                error_code="E2E_TIMEOUT",
-            ))
-
-        total_duration_ms = int((time.time() - start_time) * 1000)
-        trace = TraceInfo(
-            mode=ExecutionMode.AGENT,
-            request_id=str(uuid.uuid4()),
-            generation_id=str(uuid.uuid4()),
-            react_iterations=react_ctx.iteration,
-            tools_used=[tc.tool_name for tc in tool_calls if tc.tool_name != "react_loop"],
-            tool_calls=tool_calls if tool_calls else None,
-        )
-
-        logger.info(
-            f"[AC-MARH-07] ReAct completed: iterations={react_ctx.iteration}, "
-            f"duration_ms={total_duration_ms}"
-        )
-
-        return react_ctx.final_answer or "抱歉，我暂时无法处理您的请求。", react_ctx, trace
-
-    async def _think(
-        self,
-        user_message: str,
-        context: dict[str, Any] | None,
-        react_ctx: ReActContext,
-    ) -> AgentThought:
-        """
-        [AC-MARH-07] Agent thinks about next action.
-
-        In real implementation, this would call LLM with ReAct prompt.
-        For now, returns a simple thought without action.
-        """
-        if not self._llm_client:
-            return AgentThought(content=f"思考中... 用户消息: {user_message}")
-
-        try:
-            observations = []
-            if context and "last_observation" in context:
-                observations.append(f"上一步结果: {context['last_observation']}")
-
-            for tc in react_ctx.tool_calls[-3:]:
-                observations.append(f"工具 {tc.tool_name}: {tc.result_digest or '无结果'}")
-
-            prompt = await self._build_react_prompt(user_message, observations)
-            response = await self._llm_client.generate([{"role": "user", "content": prompt}])
-
-            logger.info(f"[AC-MARH-07] LLM response content: {response.content[:500] if response.content else 'None'}")
-
-            return self._parse_thought(response.content)
-
-        except Exception as e:
-            logger.error(f"[AC-MARH-07] Think failed: {e}")
-            return AgentThought(content=f"思考失败: {str(e)}")
-
-    async def _build_react_prompt(self, user_message: str, observations: list[str]) -> str:
-        """Build ReAct prompt for LLM with template support."""
-        obs_text = "\n".join(observations) if observations else "无"
-        tools_text = self._build_tools_section()
-
-        internal_protocol = f"""你必须遵循以下决策协议：
-1. 优先使用已有观察信息（历史观察、上一步工具结果），避免重复调用同类工具。
-2. 当问题需要外部事实或结构化状态时再调用工具；如果可直接回答则不要调用。
-3. 缺少关键参数时，优先向用户追问，不要使用空参数调用工具。
-4. 工具失败时，先说明已尝试，再给出降级方案或下一步引导。
-5. 只能调用可用工具列表中的工具，工具名必须完全匹配（区分大小写）。
-6. tenant_id 由系统自动注入，绝不能由你填写、猜测或修改。
-7. 对用户输出必须拟人、自然、有同理心，不暴露“工具调用/路由/策略”等内部术语。
-"""
-
-        output_contract = """输出格式（二选一）：
-A) 直接回答用户：
-Final Answer: [给用户的最终回答]
-
-B) 调用工具：
-Thought: [你的思考]
-Action: [工具名称]
-Action Input:
-```json
-{"param1": "value1"}
-```
-
-要求：
- Action Input 必须是合法 JSON 对象。
- 不要输出不存在的工具名。
- 如果要调用工具，Action 和 Action Input 必须同时出现。
-"""
-
-        default_template = f"""你是一个智能客服助手，正在使用 ReAct 模式处理用户请求。
-
-{tools_text}
-
-用户消息: {user_message}
-
-历史观察:
-{obs_text}
-
-{internal_protocol}
-
-{output_contract}
-"""
-
-        if not self._template_service or not self._tenant_id:
-            return default_template
-
-        try:
-            template_version = await self._template_service.get_published_template(
-                tenant_id=self._tenant_id,
-                scene="agent_react",
-            )
-
-            if not template_version:
-                return default_template
-
-            extra_context = {
-                "available_tools": tools_text,
-                "query": user_message,
-                "history": obs_text,
-                "internal_protocol": internal_protocol,
-                "output_contract": output_contract,
-            }
-
-            resolved_template = self._variable_resolver.resolve(
-                template=template_version.system_instruction,
-                variables=template_version.variables,
-                extra_context=extra_context,
-            )
-
-            final_prompt = (
-                f"{resolved_template}\n\n"
-                f"【系统强制规则】\n{internal_protocol}\n"
-                f"【输出契约】\n{output_contract}"
-            )
-
-            logger.info(f"[AC-MARH-07] Using template: scene=agent_react, version={template_version.version}")
-            return final_prompt
-
-        except Exception as e:
-            logger.warning(f"[AC-MARH-07] Failed to load template, using default: {e}")
-            return default_template
-    
-    def _build_tools_section(self) -> str:
-        """Build rich tools section for ReAct prompt."""
-        if not self._tool_registry:
-            return "当前没有可用的工具。"
-
-        tools = self._tool_registry.list_tools(enabled_only=True)
-        if not tools:
-            return "当前没有可用的工具。"
-
-        lines = ["## 可用工具列表", "", "以下是你可以使用的工具，只能使用这些工具：", ""]
-
-        for tool in tools:
-            meta = tool.metadata or {}
-            lines.append(f"### {tool.name}")
-            lines.append(f"用途: {tool.description}")
-
-            when_to_use = meta.get("when_to_use")
-            when_not_to_use = meta.get("when_not_to_use")
-            if when_to_use:
-                lines.append(f"何时使用: {when_to_use}")
-            if when_not_to_use:
-                lines.append(f"何时不要使用: {when_not_to_use}")
-
-            params = meta.get("parameters")
-            if isinstance(params, dict):
-                properties = params.get("properties", {})
-                required = params.get("required", [])
-                if properties:
-                    lines.append("参数:")
-                    for param_name, param_info in properties.items():
-                        param_desc = param_info.get("description", "") if isinstance(param_info, dict) else ""
-                        line = f"  - {param_name}: {param_desc}".strip()
-                        if param_name == "tenant_id":
-                            line += " (系统注入，模型不要填写)"
-                        elif param_name in required:
-                            line += " (必填)"
-                        lines.append(line)
-
-            if meta.get("example_action_input"):
-                lines.append("示例入参(JSON):")
-                try:
-                    example_text = json.dumps(meta["example_action_input"], ensure_ascii=False)
-                except Exception:
-                    example_text = str(meta["example_action_input"])
-                lines.append(example_text)
-
-            if meta.get("result_interpretation"):
-                lines.append(f"结果解释: {meta['result_interpretation']}")
-
-            lines.append("")
-
-        return "\n".join(lines)
-
-    def _extract_json_object(self, text: str) -> dict[str, Any] | None:
-        """Extract the first valid JSON object from free text."""
-        candidates = []
-
-        code_block_match = re.search(r"```json\s*([\s\S]*?)\s*```", text, re.IGNORECASE)
-        if code_block_match:
-            candidates.append(code_block_match.group(1).strip())
-
-        fence_match = re.search(r"```\s*([\s\S]*?)\s*```", text)
-        if fence_match:
-            candidates.append(fence_match.group(1).strip())
-
-        brace_match = re.search(r"\{[\s\S]*\}", text)
-        if brace_match:
-            candidates.append(brace_match.group(0).strip())
-
-        for candidate in candidates:
-            if not candidate:
-                continue
-            try:
-                obj = json.loads(candidate)
-                if isinstance(obj, dict):
-                    return obj
-            except json.JSONDecodeError:
-                fixed = candidate.replace("'", '"')
-                try:
-                    obj = json.loads(fixed)
-                    if isinstance(obj, dict):
-                        return obj
-                except json.JSONDecodeError:
-                    continue
-
-        return None
-
-    def _parse_thought(self, content: str) -> AgentThought:
-        """Parse LLM response into AgentThought with robust format handling."""
-        action = None
-        action_input: dict[str, Any] | None = None
-
-        action_match = re.search(r"^Action:\s*(.+)$", content, re.MULTILINE)
-        if action_match:
-            action = action_match.group(1).strip()
-
-        action_input_match = re.search(
-            r"Action Input:\s*([\s\S]*)$",
-            content,
-            re.IGNORECASE,
-        )
-        if action_input_match:
-            raw_input_text = action_input_match.group(1).strip()
-            parsed = self._extract_json_object(raw_input_text)
-            action_input = parsed if parsed is not None else {}
-
-        if action and action_input is None:
-            action_input = {}
-
-        return AgentThought(content=content, action=action, action_input=action_input)
-
-    async def _act(
-        self,
-        thought: AgentThought,
-        react_ctx: ReActContext,
-    ) -> tuple[ToolResult, ToolCallTrace]:
-        """
-        [AC-MARH-07, AC-MARH-08] Execute tool action with timeout.
-        """
-        tool_name = thought.action or "unknown"
-        start_time = time.time()
-
-        if not self._tool_registry:
-            duration_ms = int((time.time() - start_time) * 1000)
-            return ToolResult(
-                success=False,
-                error="Tool registry not configured",
-                duration_ms=duration_ms,
-            ), ToolCallTrace(
-                tool_name=tool_name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.ERROR,
-                error_code="NO_REGISTRY",
-            )
-
-        try:
-            tool_args = dict(thought.action_input or {})
-            if self._tenant_id:
-                tool_args["tenant_id"] = self._tenant_id
-
-            result = await asyncio.wait_for(
-                self._tool_registry.execute(
-                    tool_name=tool_name,
-                    args=tool_args,
-                ),
-                timeout=self._timeout_governor.per_tool_timeout_seconds
-            )
-
-            duration_ms = int((time.time() - start_time) * 1000)
-            return ToolResult(
-                success=result.success,
-                output=result.output,
-                error=result.error,
-                duration_ms=duration_ms,
-            ), ToolCallTrace(
-                tool_name=tool_name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.OK if result.success else ToolCallStatus.ERROR,
-                args_digest=str(thought.action_input)[:100] if thought.action_input else None,
-                result_digest=str(result.output)[:100] if result.output else None,
-            )
-
-        except asyncio.TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(f"[AC-MARH-08] Tool timeout: {tool_name}, duration={duration_ms}ms")
-            return ToolResult(
-                success=False,
-                error="Tool timeout",
-                duration_ms=duration_ms,
-            ), ToolCallTrace(
-                tool_name=tool_name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.TIMEOUT,
-                error_code="TOOL_TIMEOUT",
-            )
-
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(f"[AC-MARH-07] Tool error: {tool_name}, error={e}")
-            return ToolResult(
-                success=False,
-                error=str(e),
-                duration_ms=duration_ms,
-            ), ToolCallTrace(
-                tool_name=tool_name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.ERROR,
-                error_code="TOOL_ERROR",
-            )
-
-    async def _force_final_answer(
-        self,
-        user_message: str,
-        context: dict[str, Any] | None,
-        react_ctx: ReActContext,
-    ) -> str:
-        """Force final answer when max iterations reached."""
-        observations = []
-        for tc in react_ctx.tool_calls:
-            if tc.result_digest:
-                observations.append(f"- {tc.tool_name}: {tc.result_digest}")
-
-        obs_text = "\n".join(observations) if observations else "无"
-
-        if self._llm_client:
-            try:
-                prompt = f"""基于以下信息，请给出最终回答:
-
-用户消息: {user_message}
-
-收集到的信息:
-{obs_text}
-
-请直接给出回答，不要再调用工具。"""
-                response = await self._llm_client.generate([{"role": "user", "content": prompt}])
-                return response.content
-            except Exception as e:
-                logger.error(f"[AC-MARH-07] Force final answer failed: {e}")
-
-        return "抱歉，我已经尽力处理您的请求，但可能需要更多信息。请稍后重试或联系人工客服。"
--- a/ai-service/app/services/mid/default_kb_tool_runner.py
+++ b/ai-service/app/services/mid/default_kb_tool_runner.py
@ -1,244 +0,0 @@
-"""
-Default KB Tool Runner for Mid Platform.
-[AC-MARH-05] Agent 默认 KB 检索工具调用。
-[AC-MARH-06] KB 失败时可观测降级。
-
-当 Agent 模式处理开放咨询时，默认尝试调用 KB 检索工具获取事实依据。
-"""
-
-import logging
-import time
-import uuid
-from dataclasses import dataclass, field
-from typing import Any
-
-from app.models.mid.schemas import ToolCallStatus, ToolCallTrace, ToolType
-from app.services.mid.timeout_governor import TimeoutGovernor
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_KB_TOP_K = 5
-DEFAULT_KB_TIMEOUT_MS = 2000
-
-
-@dataclass
-class KbToolResult:
-    """KB 检索结果。"""
-    success: bool
-    hits: list[dict[str, Any]] = field(default_factory=list)
-    error: str | None = None
-    fallback_reason_code: str | None = None
-    duration_ms: int = 0
-    tool_trace: ToolCallTrace | None = None
-
-
-@dataclass
-class KbToolConfig:
-    """KB 工具配置。"""
-    enabled: bool = True
-    top_k: int = DEFAULT_KB_TOP_K
-    timeout_ms: int = DEFAULT_KB_TIMEOUT_MS
-    min_score_threshold: float = 0.5
-
-
-class DefaultKbToolRunner:
-    """
-    [AC-MARH-05] Agent 默认 KB 检索工具执行器。
-
-    Features:
-    - Agent 模式下默认调用 KB 检索
-    - 支持超时控制
-    - 失败时返回可观测降级信号
-    - 记录 kb_tool_called, kb_hit 状态
-    """
-
-    def __init__(
-        self,
-        timeout_governor: TimeoutGovernor | None = None,
-        config: KbToolConfig | None = None,
-    ):
-        self._timeout_governor = timeout_governor or TimeoutGovernor()
-        self._config = config or KbToolConfig()
-        self._vector_retriever = None
-
-    async def execute(
-        self,
-        tenant_id: str,
-        query: str,
-        metadata_filter: dict[str, Any] | None = None,
-    ) -> KbToolResult:
-        """
-        [AC-MARH-05] 执行 KB 检索。
-
-        Args:
-            tenant_id: 租户 ID
-            query: 检索查询
-            metadata_filter: 元数据过滤条件
-
-        Returns:
-            KbToolResult 包含检索结果和追踪信息
-        """
-        if not self._config.enabled:
-            logger.info(f"[AC-MARH-05] KB tool disabled for tenant={tenant_id}")
-            return KbToolResult(
-                success=False,
-                fallback_reason_code="KB_DISABLED",
-            )
-
-        start_time = time.time()
-        tool_trace_id = str(uuid.uuid4())
-
-        logger.info(
-            f"[AC-MARH-05] Starting KB retrieval: tenant={tenant_id}, "
-            f"query={query[:50]}..., top_k={self._config.top_k}"
-        )
-
-        try:
-            hits = await self._retrieve_with_timeout(
-                tenant_id=tenant_id,
-                query=query,
-                metadata_filter=metadata_filter,
-            )
-
-            duration_ms = int((time.time() - start_time) * 1000)
-            kb_hit = len(hits) > 0
-
-            tool_trace = ToolCallTrace(
-                tool_name="kb_retrieval",
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.OK,
-                args_digest=f"query={query[:50]}",
-                result_digest=f"hits={len(hits)}",
-            )
-
-            logger.info(
-                f"[AC-MARH-05] KB retrieval completed: tenant={tenant_id}, "
-                f"hits={len(hits)}, duration_ms={duration_ms}, kb_hit={kb_hit}"
-            )
-
-            return KbToolResult(
-                success=True,
-                hits=hits,
-                duration_ms=duration_ms,
-                tool_trace=tool_trace,
-            )
-
-        except TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(
-                f"[AC-MARH-06] KB retrieval timeout: tenant={tenant_id}, "
-                f"duration_ms={duration_ms}"
-            )
-
-            tool_trace = ToolCallTrace(
-                tool_name="kb_retrieval",
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.TIMEOUT,
-                error_code="KB_TIMEOUT",
-            )
-
-            return KbToolResult(
-                success=False,
-                error="KB retrieval timeout",
-                fallback_reason_code="KB_TIMEOUT",
-                duration_ms=duration_ms,
-                tool_trace=tool_trace,
-            )
-
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(
-                f"[AC-MARH-06] KB retrieval failed: tenant={tenant_id}, "
-                f"error={e}"
-            )
-
-            tool_trace = ToolCallTrace(
-                tool_name="kb_retrieval",
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.ERROR,
-                error_code="KB_ERROR",
-            )
-
-            return KbToolResult(
-                success=False,
-                error=str(e),
-                fallback_reason_code="KB_ERROR",
-                duration_ms=duration_ms,
-                tool_trace=tool_trace,
-            )
-
-    async def _retrieve_with_timeout(
-        self,
-        tenant_id: str,
-        query: str,
-        metadata_filter: dict[str, Any] | None = None,
-    ) -> list[dict[str, Any]]:
-        """带超时控制的检索。"""
-        import asyncio
-
-        timeout_seconds = self._config.timeout_ms / 1000.0
-
-        try:
-            return await asyncio.wait_for(
-                self._do_retrieve(tenant_id, query, metadata_filter),
-                timeout=timeout_seconds,
-            )
-        except asyncio.TimeoutError:
-            raise TimeoutError("KB retrieval timeout")
-
-    async def _do_retrieve(
-        self,
-        tenant_id: str,
-        query: str,
-        metadata_filter: dict[str, Any] | None = None,
-    ) -> list[dict[str, Any]]:
-        """执行实际检索。"""
-        if self._vector_retriever is None:
-            from app.services.retrieval.vector_retriever import get_vector_retriever
-            self._vector_retriever = await get_vector_retriever()
-
-        from app.services.retrieval.base import RetrievalContext
-
-        ctx = RetrievalContext(
-            tenant_id=tenant_id,
-            query=query,
-            metadata_filter=metadata_filter,
-        )
-
-        result = await self._vector_retriever.retrieve(ctx)
-
-        hits = []
-        for hit in result.hits:
-            if hit.score >= self._config.min_score_threshold:
-                hits.append({
-                    "id": hit.metadata.get("chunk_id", str(uuid.uuid4())),
-                    "content": hit.text,
-                    "score": hit.score,
-                    "metadata": hit.metadata,
-                })
-
-        return hits[:self._config.top_k]
-
-    def get_config(self) -> KbToolConfig:
-        """获取当前配置。"""
-        return self._config
-
-
-_default_kb_tool_runner: DefaultKbToolRunner | None = None
-
-
-def get_default_kb_tool_runner(
-    timeout_governor: TimeoutGovernor | None = None,
-    config: KbToolConfig | None = None,
-) -> DefaultKbToolRunner:
-    """获取或创建 DefaultKbToolRunner 实例。"""
-    global _default_kb_tool_runner
-    if _default_kb_tool_runner is None:
-        _default_kb_tool_runner = DefaultKbToolRunner(
-            timeout_governor=timeout_governor,
-            config=config,
-        )
-    return _default_kb_tool_runner
--- a/ai-service/app/services/mid/feature_flags.py
+++ b/ai-service/app/services/mid/feature_flags.py
@ -1,100 +0,0 @@
-"""
-Feature Flags Service for Mid Platform.
-[AC-IDMP-17] Session-level grayscale and rollback support.
-"""
-
-import logging
-from dataclasses import dataclass
-from typing import Any
-
-from app.models.mid.schemas import FeatureFlags
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class FeatureFlagConfig:
-    """Feature flag configuration."""
-    agent_enabled: bool = True
-    rollback_to_legacy: bool = False
-    react_max_iterations: int = 5
-    enable_tool_registry: bool = True
-    enable_trace_logging: bool = True
-
-
-class FeatureFlagService:
-    """
-    [AC-IDMP-17] Feature flag service for session-level control.
-    
-    Supports:
-    - Session-level Agent enable/disable
-    - Force rollback to legacy pipeline
-    - Dynamic configuration per session
-    """
-    
-    def __init__(self):
-        self._session_flags: dict[str, FeatureFlagConfig] = {}
-        self._global_config = FeatureFlagConfig()
-    
-    def get_flags(self, session_id: str) -> FeatureFlags:
-        """
-        [AC-IDMP-17] Get feature flags for a session.
-        
-        Args:
-            session_id: Session ID
-        
-        Returns:
-            FeatureFlags for the session
-        """
-        config = self._session_flags.get(session_id, self._global_config)
-        
-        return FeatureFlags(
-            agent_enabled=config.agent_enabled,
-            rollback_to_legacy=config.rollback_to_legacy,
-        )
-    
-    def set_flags(self, session_id: str, flags: FeatureFlags) -> None:
-        """
-        [AC-IDMP-17] Set feature flags for a session.
-        
-        Args:
-            session_id: Session ID
-            flags: Feature flags to set
-        """
-        config = FeatureFlagConfig(
-            agent_enabled=flags.agent_enabled if flags.agent_enabled is not None else self._global_config.agent_enabled,
-            rollback_to_legacy=flags.rollback_to_legacy if flags.rollback_to_legacy is not None else self._global_config.rollback_to_legacy,
-        )
-        
-        self._session_flags[session_id] = config
-        
-        logger.info(
-            f"[AC-IDMP-17] Feature flags set for session {session_id}: "
-            f"agent_enabled={config.agent_enabled}, rollback_to_legacy={config.rollback_to_legacy}"
-        )
-    
-    def clear_flags(self, session_id: str) -> None:
-        """Clear feature flags for a session."""
-        if session_id in self._session_flags:
-            del self._session_flags[session_id]
-            logger.info(f"[AC-IDMP-17] Feature flags cleared for session {session_id}")
-    
-    def is_agent_enabled(self, session_id: str) -> bool:
-        """Check if Agent mode is enabled for a session."""
-        config = self._session_flags.get(session_id, self._global_config)
-        return config.agent_enabled
-    
-    def should_rollback(self, session_id: str) -> bool:
-        """Check if should rollback to legacy for a session."""
-        config = self._session_flags.get(session_id, self._global_config)
-        return config.rollback_to_legacy
-    
-    def set_global_config(self, config: FeatureFlagConfig) -> None:
-        """Set global default configuration."""
-        self._global_config = config
-        logger.info(f"[AC-IDMP-17] Global config updated: {config}")
-    
-    def get_react_max_iterations(self, session_id: str) -> int:
-        """Get ReAct max iterations for a session."""
-        config = self._session_flags.get(session_id, self._global_config)
-        return config.react_max_iterations
--- a/ai-service/app/services/mid/high_risk_check_tool.py
+++ b/ai-service/app/services/mid/high_risk_check_tool.py
@ -1,495 +0,0 @@
-"""
-High Risk Check Tool for Mid Platform.
-[AC-IDMP-05, AC-IDMP-20] 高风险场景检测工具，支持元数据驱动配置。
-[AC-MRS-13] 只消费 field_roles 包含 routing_signal 的字段
-
-核心特性：
- 基于租户元数据配置进行风险判定（不是写死关键词）
- 支持关键词 + 正则 + 优先级
- 租户隔离（tenant_id 必须参与查询）
- 超时 <= 500ms（可配置）
- 返回结构化结果，不抛硬异常
- 工具失败时返回可降级结果
- 只消费 routing_signal 角色的字段
-
-高风险场景最小集：
-1. refund（退款）
-2. complaint_escalation（投诉升级）
-3. privacy_sensitive_promise（隐私敏感承诺）
-4. transfer（转人工）
-"""
-
-from __future__ import annotations
-
-import asyncio
-import logging
-import re
-import time
-from dataclasses import dataclass, field
-from typing import TYPE_CHECKING, Any
-
-from sqlalchemy import select
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.entities import HighRiskPolicy
-from app.models.entities import FieldRole
-from app.models.mid.schemas import (
-    ExecutionMode,
-    HighRiskCheckResult,
-    HighRiskScenario,
-    ToolCallStatus,
-    ToolCallTrace,
-    ToolType,
-)
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider
-
-if TYPE_CHECKING:
-    from app.services.mid.tool_registry import ToolRegistry
-
-logger = logging.getLogger(__name__)
-
-HIGH_RISK_CHECK_TOOL_NAME = "high_risk_check"
-DEFAULT_TIMEOUT_MS = 500
-DEFAULT_CONFIDENCE = 0.9
-
-
-@dataclass
-class HighRiskCheckConfig:
-    """高风险检测工具配置。"""
-    enabled: bool = True
-    timeout_ms: int = DEFAULT_TIMEOUT_MS
-    default_confidence: float = DEFAULT_CONFIDENCE
-
-
-@dataclass
-class CompiledRule:
-    """编译后的高风险规则。"""
-    rule_id: str
-    scenario: HighRiskScenario
-    handler_mode: ExecutionMode
-    flow_id: str | None
-    transfer_message: str | None
-    priority: int
-    keywords: list[str] = field(default_factory=list)
-    patterns: list[re.Pattern] = field(default_factory=list)
-
-
-class HighRiskCheckTool:
-    """
-    [AC-IDMP-05, AC-IDMP-20] 高风险场景检测工具。
-    [AC-MRS-13] 只消费 field_roles 包含 routing_signal 的字段
-
-    通过元数据配置动态加载风险规则，支持：
-    - 关键词匹配（大小写不敏感）
-    - 正则表达式匹配
-    - 优先级排序（优先匹配高优先级规则）
-    - 租户隔离
-    - 只消费 routing_signal 角色的字段
-
-    工具输入：
-    - message: str（用户消息）
-    - tenant_id: str（租户ID）
-    - domain?: str（领域，可选）
-    - scene?: str（场景，可选）
-    - context?: dict（上下文，可选，只消费 routing_signal 字段）
-
-    工具输出：
-    - matched: bool
-    - risk_scenario: refund|complaint_escalation|privacy_sensitive_promise|transfer|none
-    - confidence: float
-    - recommended_mode: micro_flow|transfer|agent
-    - rule_id?: str
-    - reason?: str
-    - fallback_reason_code?: str
-    """
-
-    def __init__(
-        self,
-        session: AsyncSession,
-        config: HighRiskCheckConfig | None = None,
-    ):
-        self._session = session
-        self._config = config or HighRiskCheckConfig()
-        self._rules_cache: dict[str, list[CompiledRule]] = {}
-        self._cache_time: dict[str, float] = {}
-        self._cache_ttl_seconds = 60
-        self._role_provider = RoleBasedFieldProvider(session)
-
-    @property
-    def name(self) -> str:
-        return HIGH_RISK_CHECK_TOOL_NAME
-
-    @property
-    def description(self) -> str:
-        return (
-            "高风险场景检测工具。"
-            "基于租户配置的风险规则，检测用户消息是否命中退款、投诉、隐私承诺、转人工等高风险场景。"
-            "返回结构化风险结果供 policy_router 使用。"
-        )
-
-    def get_tool_schema(self) -> dict[str, Any]:
-        return {
-            "name": self.name,
-            "description": self.description,
-            "parameters": {
-                "type": "object",
-                "properties": {
-                    "message": {
-                        "type": "string",
-                        "description": "用户消息内容",
-                    },
-                    "tenant_id": {
-                        "type": "string",
-                        "description": "租户ID",
-                    },
-                    "domain": {
-                        "type": "string",
-                        "description": "领域标识（可选）",
-                    },
-                    "scene": {
-                        "type": "string",
-                        "description": "场景标识（可选）",
-                    },
-                    "context": {
-                        "type": "object",
-                        "description": "额外上下文信息",
-                    },
-                },
-                "required": ["message", "tenant_id"],
-            },
-        }
-
-    async def execute(
-        self,
-        message: str,
-        tenant_id: str,
-        domain: str | None = None,
-        scene: str | None = None,
-        context: dict[str, Any] | None = None,
-    ) -> HighRiskCheckResult:
-        """
-        [AC-IDMP-05, AC-IDMP-20] 执行高风险检测。
-        [AC-MRS-13] 只消费 routing_signal 角色的字段
-
-        Args:
-            message: 用户消息
-            tenant_id: 租户ID
-            domain: 领域（可选）
-            scene: 场景（可选）
-            context: 上下文（可选，只消费 routing_signal 字段）
-
-        Returns:
-            HighRiskCheckResult 结构化结果
-        """
-        if not self._config.enabled:
-            logger.info(f"[AC-IDMP-05] High risk check disabled for tenant={tenant_id}")
-            return HighRiskCheckResult(
-                matched=False,
-                fallback_reason_code="TOOL_DISABLED",
-            )
-
-        start_time = time.time()
-
-        routing_signal_fields = await self._role_provider.get_routing_signal_field_keys(tenant_id)
-        logger.info(
-            f"[AC-MRS-13] Retrieved {len(routing_signal_fields)} routing_signal fields for tenant={tenant_id}: {routing_signal_fields}"
-        )
-
-        routing_context = {}
-        if context:
-            routing_context = {k: v for k, v in context.items() if k in routing_signal_fields}
-            if routing_context:
-                logger.info(
-                    f"[AC-MRS-13] Applied routing_signal context: {list(routing_context.keys())}"
-                )
-
-        logger.info(
-            f"[AC-IDMP-05] Starting high risk check: tenant={tenant_id}, "
-            f"message={message[:50]}..."
-        )
-
-        try:
-            timeout_seconds = self._config.timeout_ms / 1000.0
-
-            rules = await asyncio.wait_for(
-                self._get_rules(tenant_id),
-                timeout=timeout_seconds,
-            )
-
-            if not rules:
-                logger.info(f"[AC-IDMP-05] No high risk rules for tenant={tenant_id}")
-                duration_ms = int((time.time() - start_time) * 1000)
-                return HighRiskCheckResult(
-                    matched=False,
-                    duration_ms=duration_ms,
-                )
-
-            sorted_rules = sorted(rules, key=lambda r: r.priority, reverse=True)
-
-            for rule in sorted_rules:
-                match_result = self._match_rule(message, rule)
-                if match_result:
-                    duration_ms = int((time.time() - start_time) * 1000)
-                    recommended_mode = rule.handler_mode
-
-                    logger.info(
-                        f"[AC-IDMP-05] High risk matched: tenant={tenant_id}, "
-                        f"scenario={rule.scenario.value}, rule_id={rule.rule_id}, "
-                        f"duration_ms={duration_ms}"
-                    )
-
-                    return HighRiskCheckResult(
-                        matched=True,
-                        risk_scenario=rule.scenario,
-                        confidence=self._config.default_confidence,
-                        recommended_mode=recommended_mode,
-                        rule_id=rule.rule_id,
-                        reason=f"匹配到高风险规则: {rule.scenario.value}",
-                        duration_ms=duration_ms,
-                        matched_text=match_result.get("text"),
-                        matched_pattern=match_result.get("pattern"),
-                    )
-
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.info(
-                f"[AC-IDMP-05] No high risk matched: tenant={tenant_id}, "
-                f"duration_ms={duration_ms}"
-            )
-
-            return HighRiskCheckResult(
-                matched=False,
-                duration_ms=duration_ms,
-            )
-
-        except asyncio.TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(
-                f"[AC-IDMP-05] High risk check timeout: tenant={tenant_id}, "
-                f"duration_ms={duration_ms}"
-            )
-            return HighRiskCheckResult(
-                matched=False,
-                fallback_reason_code="CHECK_TIMEOUT",
-                duration_ms=duration_ms,
-            )
-
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(
-                f"[AC-IDMP-05] High risk check error: tenant={tenant_id}, "
-                f"error={e}"
-            )
-            return HighRiskCheckResult(
-                matched=False,
-                fallback_reason_code="CHECK_ERROR",
-                duration_ms=duration_ms,
-            )
-
-    async def _get_rules(self, tenant_id: str) -> list[CompiledRule]:
-        """
-        获取租户的高风险规则（带缓存）。
-
-        只返回：
-        - is_enabled = True
-        - 状态生效的规则
-        """
-        current_time = time.time()
-
-        if (
-            tenant_id in self._rules_cache
-            and tenant_id in self._cache_time
-            and current_time - self._cache_time[tenant_id] < self._cache_ttl_seconds
-        ):
-            return self._rules_cache[tenant_id]
-
-        rules = await self._load_rules_from_db(tenant_id)
-
-        self._rules_cache[tenant_id] = rules
-        self._cache_time[tenant_id] = current_time
-
-        return rules
-
-    async def _load_rules_from_db(self, tenant_id: str) -> list[CompiledRule]:
-        """从数据库加载租户的高风险规则。"""
-        stmt = select(HighRiskPolicy).where(
-            HighRiskPolicy.tenant_id == tenant_id,
-            HighRiskPolicy.is_enabled.is_(True),
-        ).order_by(HighRiskPolicy.priority.desc())
-
-        result = await self._session.execute(stmt)
-        policies = result.scalars().all()
-
-        rules = []
-        for policy in policies:
-            try:
-                scenario = HighRiskScenario(policy.scenario)
-            except ValueError:
-                logger.warning(
-                    f"[AC-IDMP-05] Unknown scenario: {policy.scenario}, "
-                    f"policy_id={policy.id}"
-                )
-                continue
-
-            handler_mode = ExecutionMode.MICRO_FLOW
-            if policy.handler_mode == "transfer":
-                handler_mode = ExecutionMode.TRANSFER
-
-            compiled_patterns = []
-            if policy.patterns:
-                for pattern_str in policy.patterns:
-                    try:
-                        compiled = re.compile(pattern_str, re.IGNORECASE)
-                        compiled_patterns.append(compiled)
-                    except re.error as e:
-                        logger.warning(
-                            f"[AC-IDMP-05] Invalid pattern: {pattern_str}, "
-                            f"error={e}"
-                        )
-
-            rule = CompiledRule(
-                rule_id=str(policy.id),
-                scenario=scenario,
-                handler_mode=handler_mode,
-                flow_id=str(policy.flow_id) if policy.flow_id else None,
-                transfer_message=policy.transfer_message,
-                priority=policy.priority,
-                keywords=policy.keywords or [],
-                patterns=compiled_patterns,
-            )
-            rules.append(rule)
-
-        logger.info(
-            f"[AC-IDMP-05] Loaded {len(rules)} high risk rules for tenant={tenant_id}"
-        )
-
-        return rules
-
-    def _match_rule(self, message: str, rule: CompiledRule) -> dict[str, Any] | None:
-        """
-        检查消息是否匹配规则。
-
-        优先匹配关键词，再匹配正则。
-
-        Returns:
-            匹配结果字典 {"text": str, "pattern": str} 或 None
-        """
-        message_lower = message.lower()
-
-        for keyword in rule.keywords:
-            if keyword.lower() in message_lower:
-                return {
-                    "text": keyword,
-                    "pattern": f"keyword:{keyword}",
-                }
-
-        for pattern in rule.patterns:
-            match = pattern.search(message)
-            if match:
-                return {
-                    "text": match.group(),
-                    "pattern": f"regex:{pattern.pattern}",
-                }
-
-        return None
-
-    def create_trace(
-        self,
-        result: HighRiskCheckResult,
-        tenant_id: str,
-    ) -> ToolCallTrace:
-        """创建工具调用追踪记录。"""
-        status = ToolCallStatus.OK
-        error_code = None
-
-        if result.fallback_reason_code:
-            if "TIMEOUT" in result.fallback_reason_code:
-                status = ToolCallStatus.TIMEOUT
-            else:
-                status = ToolCallStatus.ERROR
-            error_code = result.fallback_reason_code
-
-        return ToolCallTrace(
-            tool_name=self.name,
-            tool_type=ToolType.INTERNAL,
-            duration_ms=result.duration_ms,
-            status=status,
-            error_code=error_code,
-            args_digest=f"tenant={tenant_id}",
-            result_digest=f"matched={result.matched},scenario={result.risk_scenario}",
-        )
-
-
-def register_high_risk_check_tool(
-    registry: ToolRegistry,
-    session: AsyncSession,
-    config: HighRiskCheckConfig | None = None,
-) -> None:
-    """
-    [AC-IDMP-05, AC-IDMP-20] 将 high_risk_check 注册到 ToolRegistry。
-    [AC-MRS-13] 只消费 routing_signal 角色的字段
-
-    Args:
-        registry: ToolRegistry 实例
-        session: 数据库会话
-        config: 工具配置
-    """
-    from app.services.mid.tool_registry import ToolType as RegistryToolType
-
-    async def handler(
-        message: str,
-        tenant_id: str = "",
-        domain: str | None = None,
-        scene: str | None = None,
-        context: dict[str, Any] | None = None,
-    ) -> dict[str, Any]:
-        tool = HighRiskCheckTool(
-            session=session,
-            config=config,
-        )
-
-        result = await tool.execute(
-            message=message,
-            tenant_id=tenant_id,
-            domain=domain,
-            scene=scene,
-            context=context,
-        )
-
-        return result.model_dump()
-
-    registry.register(
-        name=HIGH_RISK_CHECK_TOOL_NAME,
-        description="[AC-IDMP-05, AC-IDMP-20, AC-MRS-13] 高风险场景检测工具，基于租户配置检测退款、投诉、隐私承诺、转人工等高风险场景 (only consumes routing_signal fields)",
-        handler=handler,
-        tool_type=RegistryToolType.INTERNAL,
-        version="1.0.0",
-        auth_required=False,
-        timeout_ms=config.timeout_ms if config else DEFAULT_TIMEOUT_MS,
-        enabled=True,
-        metadata={
-            "supports_metadata_driven": True,
-            "min_scenarios": ["refund", "complaint_escalation", "privacy_sensitive_promise", "transfer"],
-            "supports_routing_signal_filter": True,
-            "when_to_use": "当用户消息可能涉及退款、投诉升级、隐私承诺、转人工等高风险场景时使用。",
-            "when_not_to_use": "当已完成高风险判定且结果未变化，或当前仅需知识检索时不要重复调用。",
-            "parameters": {
-                "type": "object",
-                "properties": {
-                    "message": {"type": "string", "description": "用户消息原文"},
-                    "tenant_id": {"type": "string", "description": "租户 ID"},
-                    "domain": {"type": "string", "description": "业务域（可选）"},
-                    "scene": {"type": "string", "description": "场景标识（可选）"},
-                    "context": {"type": "object", "description": "上下文（仅 routing_signal 字段会被消费）"}
-                },
-                "required": ["message", "tenant_id"]
-            },
-            "example_action_input": {
-                "message": "我要投诉你们并且现在就给我退款，不然我去12315",
-                "tenant_id": "default",
-                "scene": "open_consult"
-            },
-            "result_interpretation": "matched=true 时优先按 recommended_mode 执行；关注 risk_scenario、rule_id、fallback_reason_code。"
-        },
-    )
-
-    logger.info(f"[AC-IDMP-05] Tool registered: {HIGH_RISK_CHECK_TOOL_NAME}")
--- a/ai-service/app/services/mid/high_risk_handler.py
+++ b/ai-service/app/services/mid/high_risk_handler.py
@ -1,232 +0,0 @@
-"""
-High Risk Handler for Mid Platform.
-[AC-IDMP-05, AC-IDMP-20] High-risk scenario detection and mandatory takeover.
-
-High-Risk Scenarios (minimum set):
-1. Refund (退款)
-2. Complaint Escalation (投诉升级)
-3. Privacy/Sensitive Promise (隐私与敏感承诺)
-4. Transfer (转人工)
-"""
-
-import logging
-from dataclasses import dataclass
-from typing import Any
-
-from app.models.mid.schemas import (
-    ExecutionMode,
-    HighRiskScenario,
-    PolicyRouterResult,
-)
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_HIGH_RISK_SCENARIOS = [
-    HighRiskScenario.REFUND,
-    HighRiskScenario.COMPLAINT_ESCALATION,
-    HighRiskScenario.PRIVACY_SENSITIVE_PROMISE,
-    HighRiskScenario.TRANSFER,
-]
-
-HIGH_RISK_PATTERNS = {
-    HighRiskScenario.REFUND: [
-        r"退款",
-        r"退货",
-        r"退钱",
-        r"退费",
-        r"还钱",
-        r"申请退款",
-        r"我要退",
-    ],
-    HighRiskScenario.COMPLAINT_ESCALATION: [
-        r"投诉",
-        r"升级投诉",
-        r"举报",
-        r"12315",
-        r"消费者协会",
-        r"工商局",
-        r"市场监督管理局",
-    ],
-    HighRiskScenario.PRIVACY_SENSITIVE_PROMISE: [
-        r"承诺.{0,10}(退款|赔偿|补偿)",
-        r"保证.{0,10}(退款|赔偿|补偿)",
-        r"一定.{0,10}(退款|赔偿|补偿)",
-        r"肯定能.{0,10}(退款|赔偿|补偿)",
-        r"绝对.{0,10}(退款|赔偿|补偿)",
-        r"担保",
-    ],
-    HighRiskScenario.TRANSFER: [
-        r"转人工",
-        r"人工客服",
-        r"人工服务",
-        r"真人",
-        r"人工",
-        r"转接人工",
-    ],
-}
-
-
-@dataclass
-class HighRiskMatch:
-    """High-risk scenario match result."""
-    scenario: HighRiskScenario
-    matched_pattern: str
-    matched_text: str
-    confidence: float = 1.0
-
-
-class HighRiskHandler:
-    """
-    [AC-IDMP-05, AC-IDMP-20] High-risk scenario handler.
-    
-    Features:
-    - Configurable high-risk scenario set (minimum set required)
-    - Pattern-based detection
-    - Mandatory takeover to micro_flow or transfer
-    """
-    
-    def __init__(
-        self,
-        enabled_scenarios: list[HighRiskScenario] | None = None,
-        custom_patterns: dict[HighRiskScenario, list[str]] | None = None,
-    ):
-        self._enabled_scenarios = enabled_scenarios or DEFAULT_HIGH_RISK_SCENARIOS.copy()
-        
-        if not self._enabled_scenarios:
-            raise ValueError("[AC-IDMP-20] High-risk scenario set cannot be empty")
-        
-        self._patterns = HIGH_RISK_PATTERNS.copy()
-        if custom_patterns:
-            for scenario, patterns in custom_patterns.items():
-                if scenario not in self._patterns:
-                    self._patterns[scenario] = []
-                self._patterns[scenario].extend(patterns)
-        
-        self._compiled_patterns = self._compile_patterns()
-        
-        logger.info(
-            f"[AC-IDMP-20] HighRiskHandler initialized with scenarios: "
-            f"{[s.value for s in self._enabled_scenarios]}"
-        )
-    
-    def _compile_patterns(self) -> dict[HighRiskScenario, list]:
-        """Compile regex patterns for better performance."""
-        import re
-        compiled = {}
-        for scenario in self._enabled_scenarios:
-            patterns = self._patterns.get(scenario, [])
-            compiled[scenario] = [
-                re.compile(p, re.IGNORECASE) for p in patterns
-            ]
-        return compiled
-    
-    def detect(self, message: str) -> HighRiskMatch | None:
-        """
-        [AC-IDMP-05, AC-IDMP-20] Detect high-risk scenario in message.
-        
-        Args:
-            message: User message to check
-        
-        Returns:
-            HighRiskMatch if detected, None otherwise
-        """
-        for scenario in self._enabled_scenarios:
-            patterns = self._compiled_patterns.get(scenario, [])
-            for pattern in patterns:
-                match = pattern.search(message)
-                if match:
-                    logger.info(
-                        f"[AC-IDMP-05] High-risk scenario detected: "
-                        f"scenario={scenario.value}, pattern={pattern.pattern}"
-                    )
-                    return HighRiskMatch(
-                        scenario=scenario,
-                        matched_pattern=pattern.pattern,
-                        matched_text=match.group(),
-                    )
-        
-        return None
-    
-    def handle(
-        self,
-        match: HighRiskMatch,
-        context: dict[str, Any] | None = None,
-    ) -> PolicyRouterResult:
-        """
-        [AC-IDMP-05] Handle high-risk scenario by routing to appropriate mode.
-        
-        Args:
-            match: High-risk match result
-            context: Additional context (intent match, flow config, etc.)
-        
-        Returns:
-            PolicyRouterResult with execution mode decision
-        """
-        context = context or {}
-        
-        if match.scenario == HighRiskScenario.TRANSFER:
-            return PolicyRouterResult(
-                mode=ExecutionMode.TRANSFER,
-                high_risk_triggered=True,
-                transfer_message="正在为您转接人工客服，请稍候...",
-            )
-        
-        flow_id = context.get("flow_id")
-        if flow_id:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                high_risk_triggered=True,
-                target_flow_id=flow_id,
-            )
-        
-        if match.scenario == HighRiskScenario.REFUND:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                high_risk_triggered=True,
-                fallback_reason_code="high_risk_refund",
-            )
-        
-        if match.scenario == HighRiskScenario.COMPLAINT_ESCALATION:
-            return PolicyRouterResult(
-                mode=ExecutionMode.TRANSFER,
-                high_risk_triggered=True,
-                transfer_message="检测到您可能需要投诉处理，正在为您转接人工客服...",
-            )
-        
-        if match.scenario == HighRiskScenario.PRIVACY_SENSITIVE_PROMISE:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                high_risk_triggered=True,
-                fallback_reason_code="high_risk_privacy_promise",
-            )
-        
-        return PolicyRouterResult(
-            mode=ExecutionMode.TRANSFER,
-            high_risk_triggered=True,
-            transfer_message="正在为您转接人工客服...",
-        )
-    
-    def get_enabled_scenarios(self) -> list[HighRiskScenario]:
-        """[AC-IDMP-20] Get enabled high-risk scenarios."""
-        return self._enabled_scenarios.copy()
-    
-    def add_scenario(self, scenario: HighRiskScenario) -> None:
-        """Add a high-risk scenario to the enabled set."""
-        if scenario not in self._enabled_scenarios:
-            self._enabled_scenarios.append(scenario)
-            if scenario in HIGH_RISK_PATTERNS:
-                self._compiled_patterns[scenario] = [
-                    __import__('re').compile(p, __import__('re').IGNORECASE)
-                    for p in HIGH_RISK_PATTERNS[scenario]
-                ]
-            logger.info(f"[AC-IDMP-20] Added high-risk scenario: {scenario.value}")
-    
-    def remove_scenario(self, scenario: HighRiskScenario) -> bool:
-        """Remove a high-risk scenario from the enabled set."""
-        if scenario in self._enabled_scenarios and len(self._enabled_scenarios) > 1:
-            self._enabled_scenarios.remove(scenario)
-            if scenario in self._compiled_patterns:
-                del self._compiled_patterns[scenario]
-            logger.info(f"[AC-IDMP-20] Removed high-risk scenario: {scenario.value}")
-            return True
-        return False
--- a/ai-service/app/services/mid/intent_hint_tool.py
+++ b/ai-service/app/services/mid/intent_hint_tool.py
@ -1,372 +0,0 @@
-"""
-Intent Hint Tool for Mid Platform.
-[AC-IDMP-02, AC-IDMP-16] Lightweight intent recognition and routing suggestion.
-[AC-MRS-13] 只消费 field_roles 包含 routing_signal 的字段
-
-This tool provides a "soft signal" for policy routing decisions.
-It does NOT make final decisions - policy_router retains final authority.
-"""
-
-import logging
-import time
-from dataclasses import dataclass
-from typing import Any
-
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.mid.schemas import (
-    ExecutionMode,
-    HighRiskScenario,
-    IntentHintOutput,
-    ToolCallStatus,
-    ToolCallTrace,
-    ToolType,
-)
-from app.models.entities import FieldRole
-from app.services.intent.router import IntentRouter
-from app.services.intent.rule_service import IntentRuleService
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_HIGH_RISK_KEYWORDS: dict[HighRiskScenario, list[str]] = {
-    HighRiskScenario.REFUND: ["退款", "退货", "退钱", "退费", "还钱", "退款申请"],
-    HighRiskScenario.COMPLAINT_ESCALATION: ["投诉", "升级投诉", "举报", "12315", "消费者协会"],
-    HighRiskScenario.PRIVACY_SENSITIVE_PROMISE: ["承诺", "保证", "一定", "肯定能", "绝对", "担保"],
-    HighRiskScenario.TRANSFER: ["转人工", "人工客服", "人工服务", "真人", "人工"],
-}
-
-LOW_CONFIDENCE_THRESHOLD = 0.3
-DEFAULT_TIMEOUT_MS = 500
-
-
-@dataclass
-class IntentHintConfig:
-    """Configuration for intent hint tool."""
-    enabled: bool = True
-    timeout_ms: int = DEFAULT_TIMEOUT_MS
-    top_n: int = 3
-    low_confidence_threshold: float = LOW_CONFIDENCE_THRESHOLD
-
-
-class IntentHintTool:
-    """
-    [AC-IDMP-02, AC-IDMP-16] Lightweight intent hint tool.
-    [AC-MRS-13] 只消费 field_roles 包含 routing_signal 的字段
-    
-    Provides soft signals for policy routing:
-    - Intent recognition via existing rule engine
-    - High-risk scenario detection
-    - Routing suggestions (not final decisions)
-    
-    The policy_router consumes these hints but retains final decision authority.
-    """
-    
-    def __init__(
-        self,
-        session: AsyncSession,
-        config: IntentHintConfig | None = None,
-    ):
-        self._session = session
-        self._config = config or IntentHintConfig()
-        self._rule_service = IntentRuleService(session)
-        self._router = IntentRouter()
-        self._role_provider = RoleBasedFieldProvider(session)
-    
-    async def execute(
-        self,
-        message: str,
-        tenant_id: str,
-        history: list[dict[str, Any]] | None = None,
-        top_n: int | None = None,
-        context: dict[str, Any] | None = None,
-    ) -> IntentHintOutput:
-        """
-        [AC-IDMP-02] Execute intent hint analysis.
-        [AC-MRS-13] 只消费 routing_signal 角色的字段
-        
-        Args:
-            message: User input message
-            tenant_id: Tenant ID for rule lookup
-            history: Optional conversation history
-            top_n: Number of top suggestions (default from config)
-            context: Optional context with routing_signal fields
-        
-        Returns:
-            IntentHintOutput with routing suggestions
-        """
-        start_time = time.time()
-        
-        routing_signal_fields = await self._role_provider.get_routing_signal_field_keys(tenant_id)
-        logger.info(
-            f"[AC-MRS-13] Retrieved {len(routing_signal_fields)} routing_signal fields for tenant={tenant_id}: {routing_signal_fields}"
-        )
-        
-        routing_context = {}
-        if context:
-            routing_context = {k: v for k, v in context.items() if k in routing_signal_fields}
-            if routing_context:
-                logger.info(
-                    f"[AC-MRS-13] Applied routing_signal context: {list(routing_context.keys())}"
-                )
-        
-        if not self._config.enabled:
-            return IntentHintOutput(
-                intent=None,
-                confidence=0.0,
-                response_type=None,
-                suggested_mode=None,
-                fallback_reason_code="tool_disabled",
-                duration_ms=0,
-            )
-        
-        try:
-            high_risk_scenario = self._check_high_risk(message)
-            if high_risk_scenario:
-                logger.info(
-                    f"[AC-IDMP-05, AC-IDMP-20] High-risk detected in hint: {high_risk_scenario}"
-                )
-                duration_ms = int((time.time() - start_time) * 1000)
-                return IntentHintOutput(
-                    intent=None,
-                    confidence=1.0,
-                    response_type="flow" if high_risk_scenario != HighRiskScenario.TRANSFER else "transfer",
-                    suggested_mode=ExecutionMode.TRANSFER if high_risk_scenario == HighRiskScenario.TRANSFER else ExecutionMode.MICRO_FLOW,
-                    high_risk_detected=True,
-                    fallback_reason_code=f"high_risk_{high_risk_scenario.value}",
-                    duration_ms=duration_ms,
-                )
-            
-            rules = await self._rule_service.get_enabled_rules_for_matching(tenant_id)
-            
-            if not rules:
-                logger.info(f"[AC-IDMP-02] No intent rules found for tenant: {tenant_id}")
-                duration_ms = int((time.time() - start_time) * 1000)
-                return IntentHintOutput(
-                    intent=None,
-                    confidence=0.0,
-                    response_type="rag",
-                    suggested_mode=ExecutionMode.AGENT,
-                    fallback_reason_code="no_rules_configured",
-                    duration_ms=duration_ms,
-                )
-            
-            match_result = self._router.match(message, rules)
-            
-            if match_result:
-                rule = match_result.rule
-                confidence = 0.8
-                
-                suggested_mode = self._determine_suggested_mode(
-                    rule.response_type,
-                    confidence,
-                )
-                
-                duration_ms = int((time.time() - start_time) * 1000)
-                logger.info(
-                    f"[AC-IDMP-02] Intent hint matched: intent={rule.name}, "
-                    f"response_type={rule.response_type}, confidence={confidence}"
-                )
-                
-                return IntentHintOutput(
-                    intent=rule.name,
-                    confidence=confidence,
-                    response_type=rule.response_type,
-                    suggested_mode=suggested_mode,
-                    target_flow_id=str(rule.flow_id) if rule.flow_id else None,
-                    target_kb_ids=rule.target_kb_ids,
-                    duration_ms=duration_ms,
-                )
-            
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.info(
-                f"[AC-IDMP-02] No intent matched, suggesting agent mode with low confidence"
-            )
-            
-            return IntentHintOutput(
-                intent=None,
-                confidence=0.2,
-                response_type="rag",
-                suggested_mode=ExecutionMode.AGENT,
-                fallback_reason_code="no_intent_matched",
-                duration_ms=duration_ms,
-            )
-        
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(f"[AC-IDMP-02] Intent hint failed: {e}")
-            
-            return IntentHintOutput(
-                intent=None,
-                confidence=0.0,
-                response_type=None,
-                suggested_mode=ExecutionMode.FIXED,
-                fallback_reason_code=f"hint_error: {str(e)[:50]}",
-                duration_ms=duration_ms,
-            )
-    
-    def _check_high_risk(self, message: str) -> HighRiskScenario | None:
-        """
-        [AC-IDMP-05, AC-IDMP-20] Check for high-risk scenarios.
-        
-        Returns the first matched high-risk scenario or None.
-        """
-        message_lower = message.lower()
-        
-        for scenario, keywords in DEFAULT_HIGH_RISK_KEYWORDS.items():
-            for keyword in keywords:
-                if keyword.lower() in message_lower:
-                    return scenario
-        
-        return None
-    
-    def _determine_suggested_mode(
-        self,
-        response_type: str,
-        confidence: float,
-    ) -> ExecutionMode:
-        """
-        Determine suggested execution mode based on response type and confidence.
-        """
-        if confidence < self._config.low_confidence_threshold:
-            return ExecutionMode.FIXED
-        
-        mode_mapping = {
-            "fixed": ExecutionMode.FIXED,
-            "transfer": ExecutionMode.TRANSFER,
-            "flow": ExecutionMode.MICRO_FLOW,
-            "rag": ExecutionMode.AGENT,
-        }
-        
-        return mode_mapping.get(response_type, ExecutionMode.AGENT)
-    
-    def create_trace(
-        self,
-        result: IntentHintOutput,
-    ) -> ToolCallTrace:
-        """Create ToolCallTrace for this tool execution."""
-        status = ToolCallStatus.OK
-        if result.fallback_reason_code and "error" in result.fallback_reason_code:
-            status = ToolCallStatus.ERROR
-        
-        return ToolCallTrace(
-            tool_name="intent_hint",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=result.duration_ms,
-            status=status,
-            error_code=result.fallback_reason_code if status == ToolCallStatus.ERROR else None,
-            result_digest=f"intent={result.intent}, mode={result.suggested_mode}",
-        )
-
-
-async def intent_hint_handler(
-    message: str,
-    tenant_id: str,
-    history: list[dict[str, Any]] | None = None,
-    top_n: int | None = None,
-    context: dict[str, Any] | None = None,
-    session: AsyncSession | None = None,
-    config: IntentHintConfig | None = None,
-) -> dict[str, Any]:
-    """
-    Handler function for ToolRegistry registration.
-    [AC-MRS-13] 支持 context 参数用于 routing_signal 字段
-    
-    This is the async handler that gets registered to ToolRegistry.
-    """
-    if not session:
-        return {
-            "success": False,
-            "error": "Database session required",
-            "output": None,
-        }
-    
-    tool = IntentHintTool(session=session, config=config)
-    result = await tool.execute(
-        message=message,
-        tenant_id=tenant_id,
-        history=history,
-        top_n=top_n,
-        context=context,
-    )
-    
-    return {
-        "success": True,
-        "output": result.model_dump(),
-        "hint": result,
-    }
-
-
-def register_intent_hint_tool(
-    registry: Any,
-    session: AsyncSession,
-    config: IntentHintConfig | None = None,
-) -> None:
-    """
-    [AC-IDMP-19] Register intent_hint tool to ToolRegistry.
-    [AC-MRS-13] 支持 context 参数用于 routing_signal 字段
-    
-    Args:
-        registry: ToolRegistry instance
-        session: Database session for intent rule lookup
-        config: Tool configuration
-    """
-    from app.models.mid.schemas import ToolType
-    
-    effective_config = config or IntentHintConfig()
-    
-    async def handler(
-        message: str,
-        tenant_id: str,
-        history: list[dict[str, Any]] | None = None,
-        top_n: int | None = None,
-        context: dict[str, Any] | None = None,
-    ) -> dict[str, Any]:
-        return await intent_hint_handler(
-            message=message,
-            tenant_id=tenant_id,
-            history=history,
-            top_n=top_n,
-            context=context,
-            session=session,
-            config=effective_config,
-        )
-    
-    registry.register(
-        name="intent_hint",
-        description="[AC-IDMP-02, AC-IDMP-16, AC-MRS-13] Lightweight intent recognition and routing suggestion tool (only consumes routing_signal fields)",
-        handler=handler,
-        tool_type=ToolType.INTERNAL,
-        version="1.0.0",
-        enabled=True,
-        timeout_ms=min(effective_config.timeout_ms, 500),
-        metadata={
-            "low_confidence_threshold": effective_config.low_confidence_threshold,
-            "top_n": effective_config.top_n,
-            "supports_routing_signal_filter": True,
-            "when_to_use": "当用户意图不明确、需要给 policy_router 提供软路由信号时使用。",
-            "when_not_to_use": "当已经明确进入固定模式/流程模式，或已有确定意图结果时不重复调用。",
-            "parameters": {
-                "type": "object",
-                "properties": {
-                    "message": {"type": "string", "description": "用户输入原文"},
-                    "tenant_id": {"type": "string", "description": "租户 ID"},
-                    "history": {"type": "array", "description": "会话历史（可选）"},
-                    "top_n": {"type": "integer", "description": "返回建议数量（可选）"},
-                    "context": {"type": "object", "description": "上下文字段（仅 routing_signal 字段会被消费）"}
-                },
-                "required": ["message", "tenant_id"]
-            },
-            "example_action_input": {
-                "message": "我想退款，但是也想先咨询下怎么处理",
-                "tenant_id": "default",
-                "top_n": 3,
-                "context": {"order_status": "delivered", "channel": "web"}
-            },
-            "result_interpretation": "关注输出中的 intent / confidence / suggested_mode。该工具只提供建议，不做最终决策。"
-        },
-    )
-    
-    logger.info(
-        f"[AC-IDMP-19] intent_hint tool registered: timeout_ms={effective_config.timeout_ms}"
-    )
--- a/ai-service/app/services/mid/interrupt_context_enricher.py
+++ b/ai-service/app/services/mid/interrupt_context_enricher.py
@ -1,161 +0,0 @@
-"""
-Interrupt Context Enricher for Mid Platform.
-[AC-MARH-03, AC-MARH-04] Interrupted segments processing and fallback handling.
-
-Features:
- Consumes interrupted_segments from request
- Provides context for re-planning
- Fallback handling for invalid/empty interrupt data
-"""
-
-import logging
-from dataclasses import dataclass
-from typing import Any
-
-from app.models.mid.schemas import InterruptedSegment
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class InterruptContext:
-    """Context derived from interrupted segments."""
-    consumed: bool = False
-    interrupted_content: str | None = None
-    interrupted_segment_ids: list[str] | None = None
-    fallback_triggered: bool = False
-    fallback_reason: str | None = None
-
-
-class InterruptContextEnricher:
-    """
-    [AC-MARH-03, AC-MARH-04] Interrupt context enricher for handling user interruption.
-
-    This component processes interrupted_segments from the request and provides
-    context for re-planning. It handles edge cases where interrupt data is
-    invalid or empty, ensuring the main flow continues without disruption.
-    """
-
-    def __init__(self):
-        pass
-
-    def enrich(
-        self,
-        interrupted_segments: list[InterruptedSegment] | None,
-        generation_id: str | None = None,
-    ) -> InterruptContext:
-        """
-        [AC-MARH-03, AC-MARH-04] Process interrupted segments and return context.
-
-        Args:
-            interrupted_segments: List of interrupted segments from request
-            generation_id: Current generation ID for matching
-
-        Returns:
-            InterruptContext with processed interrupt information
-        """
-        if not interrupted_segments:
-            logger.debug("[AC-MARH-04] No interrupted segments provided")
-            return InterruptContext(
-                consumed=False,
-                fallback_triggered=False,
-            )
-
-        try:
-            valid_segments = [
-                s for s in interrupted_segments
-                if s.content and s.content.strip() and s.segment_id
-            ]
-
-            if not valid_segments:
-                logger.info("[AC-MARH-04] Interrupted segments empty or invalid, using fallback")
-                return InterruptContext(
-                    consumed=False,
-                    fallback_triggered=True,
-                    fallback_reason="empty_or_invalid_segments",
-                )
-
-            interrupted_content = "\n".join(s.content for s in valid_segments)
-            segment_ids = [s.segment_id for s in valid_segments]
-
-            logger.info(
-                f"[AC-MARH-03] Interrupted segments consumed: "
-                f"count={len(valid_segments)}, segment_ids={segment_ids[:3]}..."
-            )
-
-            return InterruptContext(
-                consumed=True,
-                interrupted_content=interrupted_content,
-                interrupted_segment_ids=segment_ids,
-                fallback_triggered=False,
-            )
-
-        except Exception as e:
-            logger.warning(f"[AC-MARH-04] Failed to process interrupted segments: {e}")
-            return InterruptContext(
-                consumed=False,
-                fallback_triggered=True,
-                fallback_reason=f"error:{str(e)[:50]}",
-            )
-
-    def build_replan_context(
-        self,
-        interrupt_context: InterruptContext,
-        base_context: dict[str, Any] | None = None,
-    ) -> dict[str, Any]:
-        """
-        [AC-MARH-03] Build context for re-planning after interruption.
-
-        Args:
-            interrupt_context: Processed interrupt context
-            base_context: Existing context to enrich
-
-        Returns:
-            Enriched context for re-planning
-        """
-        context = base_context.copy() if base_context else {}
-
-        if interrupt_context.consumed and interrupt_context.interrupted_content:
-            context["interrupted_content"] = interrupt_context.interrupted_content
-            context["interrupted_segment_ids"] = interrupt_context.interrupted_segment_ids
-            context["avoid_duplicate"] = True
-            logger.debug(
-                f"[AC-MARH-03] Re-plan context enriched with interrupted content: "
-                f"{len(interrupt_context.interrupted_content)} chars"
-            )
-        elif interrupt_context.fallback_triggered:
-            context["interrupt_fallback"] = True
-            context["interrupt_fallback_reason"] = interrupt_context.fallback_reason
-            logger.debug(
-                f"[AC-MARH-04] Re-plan context marked with fallback: "
-                f"{interrupt_context.fallback_reason}"
-            )
-
-        return context
-
-    def should_skip_content(
-        self,
-        content: str,
-        interrupt_context: InterruptContext,
-    ) -> bool:
-        """
-        [AC-MARH-03] Check if content should be skipped to avoid duplicate.
-
-        Args:
-            content: Content to check
-            interrupt_context: Processed interrupt context
-
-        Returns:
-            True if content matches interrupted content and should be skipped
-        """
-        if not interrupt_context.consumed or not interrupt_context.interrupted_content:
-            return False
-
-        if content.strip() in interrupt_context.interrupted_content:
-            logger.info(
-                f"[AC-MARH-03] Content matches interrupted segment, skipping: "
-                f"{content[:50]}..."
-            )
-            return True
-
-        return False
--- a/ai-service/app/services/mid/kb_search_dynamic_tool.py
+++ b/ai-service/app/services/mid/kb_search_dynamic_tool.py
@ -1,508 +0,0 @@
-"""
-KB Search Dynamic Tool for Mid Platform.
-[AC-MARH-05] Agent 默认 KB 检索工具，支持元数据驱动参数。
-[AC-MARH-06] KB 失败时可观测降级。
-
-核心特性：
- 通过元数据配置动态生成检索参数/过滤器
- 必填字段缺失时返回 missing_required_slots
- 工具执行可观测（tool_call/tool_result）
- 超时降级返回 fallback_reason_code
-"""
-
-from __future__ import annotations
-
-import asyncio
-import logging
-import time
-import uuid
-from dataclasses import dataclass, field
-from typing import TYPE_CHECKING, Any
-
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.mid.schemas import ToolCallStatus, ToolCallTrace, ToolType
-from app.services.mid.metadata_filter_builder import (
-    FilterBuildResult,
-    MetadataFilterBuilder,
-)
-from app.services.mid.timeout_governor import TimeoutGovernor
-
-if TYPE_CHECKING:
-    from app.services.mid.tool_registry import ToolRegistry
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_TOP_K = 5
-DEFAULT_TIMEOUT_MS = 2000
-KB_SEARCH_DYNAMIC_TOOL_NAME = "kb_search_dynamic"
-
-
-@dataclass
-class KbSearchDynamicResult:
-    """KB 动态检索结果。"""
-    success: bool = True
-    hits: list[dict[str, Any]] = field(default_factory=list)
-    applied_filter: dict[str, Any] = field(default_factory=dict)
-    missing_required_slots: list[dict[str, str]] = field(default_factory=list)
-    filter_debug: dict[str, Any] = field(default_factory=dict)
-    fallback_reason_code: str | None = None
-    duration_ms: int = 0
-    tool_trace: ToolCallTrace | None = None
-
-
-@dataclass
-class KbSearchDynamicConfig:
-    """KB 动态检索配置。"""
-    enabled: bool = True
-    top_k: int = DEFAULT_TOP_K
-    timeout_ms: int = DEFAULT_TIMEOUT_MS
-    min_score_threshold: float = 0.5
-
-
-class KbSearchDynamicTool:
-    """
-    [AC-MARH-05] KB 动态检索工具。
-
-    支持通过元数据配置动态生成检索参数/过滤器，而不是固定入参写死。
-
-    固定外壳入参：
-    - query: 检索查询
-    - scene: 场景标识
-    - tenant_id: 租户 ID
-    - top_k: 返回数量
-    - context: 上下文（包含动态过滤值）
-
-    返回结构：
-    - hits: 检索结果
-    - applied_filter: 已应用的过滤条件
-    - missing_required_slots: 缺失的必填字段
-    - filter_debug: 过滤器调试信息
-    - fallback_reason_code: 降级原因码
-    """
-
-    def __init__(
-        self,
-        session: AsyncSession,
-        timeout_governor: TimeoutGovernor | None = None,
-        config: KbSearchDynamicConfig | None = None,
-    ):
-        self._session = session
-        self._timeout_governor = timeout_governor or TimeoutGovernor()
-        self._config = config or KbSearchDynamicConfig()
-        self._vector_retriever = None
-        self._filter_builder: MetadataFilterBuilder | None = None
-
-    @property
-    def name(self) -> str:
-        """工具名称。"""
-        return KB_SEARCH_DYNAMIC_TOOL_NAME
-
-    @property
-    def description(self) -> str:
-        """工具描述。"""
-        return (
-            "知识库动态检索工具。"
-            "根据租户配置的元数据字段定义，动态构建检索过滤器。"
-            "支持必填字段检测和可观测降级。"
-        )
-
-    def get_tool_schema(self) -> dict[str, Any]:
-        """
-        获取工具 Schema，用于 Agent 工具描述。
-        动态生成基于租户配置的过滤字段。
-        """
-        return {
-            "name": self.name,
-            "description": self.description,
-            "parameters": {
-                "type": "object",
-                "properties": {
-                    "query": {
-                        "type": "string",
-                        "description": "检索查询文本",
-                    },
-                    "scene": {
-                        "type": "string",
-                        "description": "场景标识，如 'open_consult', 'intent_match'",
-                    },
-                    "top_k": {
-                        "type": "integer",
-                        "description": "返回结果数量",
-                        "default": DEFAULT_TOP_K,
-                    },
-                    "context": {
-                        "type": "object",
-                        "description": "上下文信息，包含动态过滤字段值",
-                    },
-                },
-                "required": ["query"],
-            },
-        }
-
-    async def execute(
-        self,
-        query: str,
-        tenant_id: str,
-        scene: str = "open_consult",
-        top_k: int | None = None,
-        context: dict[str, Any] | None = None,
-    ) -> KbSearchDynamicResult:
-        """
-        [AC-MARH-05] 执行 KB 动态检索。
-
-        Args:
-            query: 检索查询
-            tenant_id: 租户 ID
-            scene: 场景标识
-            top_k: 返回数量
-            context: 上下文（包含动态过滤值）
-
-        Returns:
-            KbSearchDynamicResult 包含检索结果和追踪信息
-        """
-        if not self._config.enabled:
-            logger.info(f"[AC-MARH-05] KB search dynamic disabled for tenant={tenant_id}")
-            return KbSearchDynamicResult(
-                success=False,
-                fallback_reason_code="KB_DISABLED",
-            )
-
-        start_time = time.time()
-        top_k = top_k or self._config.top_k
-
-        logger.info(
-            f"[AC-MARH-05] Starting KB dynamic search: tenant={tenant_id}, "
-            f"query={query[:50]}..., scene={scene}, top_k={top_k}"
-        )
-
-        filter_result: FilterBuildResult | None = None
-
-        try:
-            if self._filter_builder is None:
-                self._filter_builder = MetadataFilterBuilder(self._session)
-
-            filter_result = await self._filter_builder.build_filter(
-                tenant_id=tenant_id,
-                context=context,
-            )
-
-            if filter_result.missing_required_slots:
-                logger.warning(
-                    f"[AC-MARH-05] Missing required slots: "
-                    f"{filter_result.missing_required_slots}"
-                )
-                duration_ms = int((time.time() - start_time) * 1000)
-
-                tool_trace = ToolCallTrace(
-                    tool_name=self.name,
-                    tool_type=ToolType.INTERNAL,
-                    duration_ms=duration_ms,
-                    status=ToolCallStatus.ERROR,
-                    error_code="MISSING_REQUIRED_SLOTS",
-                    args_digest=f"query={query[:50]}, scene={scene}",
-                    result_digest=f"missing={len(filter_result.missing_required_slots)}",
-                )
-
-                return KbSearchDynamicResult(
-                    success=False,
-                    applied_filter=filter_result.applied_filter,
-                    missing_required_slots=filter_result.missing_required_slots,
-                    filter_debug=filter_result.debug_info,
-                    fallback_reason_code="MISSING_REQUIRED_SLOTS",
-                    duration_ms=duration_ms,
-                    tool_trace=tool_trace,
-                )
-
-            metadata_filter = filter_result.applied_filter if filter_result.success else None
-
-            hits = await self._retrieve_with_timeout(
-                tenant_id=tenant_id,
-                query=query,
-                metadata_filter=metadata_filter,
-                top_k=top_k,
-            )
-
-            duration_ms = int((time.time() - start_time) * 1000)
-            kb_hit = len(hits) > 0
-
-            tool_trace = ToolCallTrace(
-                tool_name=self.name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.OK,
-                args_digest=f"query={query[:50]}, scene={scene}",
-                result_digest=f"hits={len(hits)}",
-            )
-
-            logger.info(
-                f"[AC-MARH-05] KB dynamic search completed: tenant={tenant_id}, "
-                f"hits={len(hits)}, duration_ms={duration_ms}, kb_hit={kb_hit}"
-            )
-
-            return KbSearchDynamicResult(
-                success=True,
-                hits=hits,
-                applied_filter=filter_result.applied_filter if filter_result else {},
-                filter_debug=filter_result.debug_info if filter_result else {},
-                duration_ms=duration_ms,
-                tool_trace=tool_trace,
-            )
-
-        except asyncio.TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(
-                f"[AC-MARH-06] KB dynamic search timeout: tenant={tenant_id}, "
-                f"duration_ms={duration_ms}"
-            )
-
-            tool_trace = ToolCallTrace(
-                tool_name=self.name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.TIMEOUT,
-                error_code="KB_TIMEOUT",
-            )
-
-            return KbSearchDynamicResult(
-                success=False,
-                applied_filter=filter_result.applied_filter if filter_result else {},
-                missing_required_slots=filter_result.missing_required_slots if filter_result else [],
-                filter_debug=filter_result.debug_info if filter_result else {},
-                fallback_reason_code="KB_TIMEOUT",
-                duration_ms=duration_ms,
-                tool_trace=tool_trace,
-            )
-
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(
-                f"[AC-MARH-06] KB dynamic search failed: tenant={tenant_id}, "
-                f"error={e}"
-            )
-
-            tool_trace = ToolCallTrace(
-                tool_name=self.name,
-                tool_type=ToolType.INTERNAL,
-                duration_ms=duration_ms,
-                status=ToolCallStatus.ERROR,
-                error_code="KB_ERROR",
-            )
-
-            return KbSearchDynamicResult(
-                success=False,
-                applied_filter=filter_result.applied_filter if filter_result else {},
-                missing_required_slots=filter_result.missing_required_slots if filter_result else [],
-                filter_debug={"error": str(e)},
-                fallback_reason_code="KB_ERROR",
-                duration_ms=duration_ms,
-                tool_trace=tool_trace,
-            )
-
-    async def _retrieve_with_timeout(
-        self,
-        tenant_id: str,
-        query: str,
-        metadata_filter: dict[str, Any] | None = None,
-        top_k: int = DEFAULT_TOP_K,
-    ) -> list[dict[str, Any]]:
-        """带超时控制的检索。"""
-        timeout_seconds = self._config.timeout_ms / 1000.0
-
-        try:
-            return await asyncio.wait_for(
-                self._do_retrieve(tenant_id, query, metadata_filter, top_k),
-                timeout=timeout_seconds,
-            )
-        except asyncio.TimeoutError:
-            raise asyncio.TimeoutError("KB dynamic search timeout")
-
-    async def _do_retrieve(
-        self,
-        tenant_id: str,
-        query: str,
-        metadata_filter: dict[str, Any] | None = None,
-        top_k: int = DEFAULT_TOP_K,
-    ) -> list[dict[str, Any]]:
-        """执行实际检索。"""
-        if self._vector_retriever is None:
-            from app.services.retrieval.vector_retriever import get_vector_retriever
-            self._vector_retriever = await get_vector_retriever()
-
-        from app.services.retrieval.base import RetrievalContext
-
-        ctx = RetrievalContext(
-            tenant_id=tenant_id,
-            query=query,
-            metadata=metadata_filter,
-        )
-
-        result = await self._vector_retriever.retrieve(ctx)
-
-        hits = []
-        for hit in result.hits:
-            if hit.score >= self._config.min_score_threshold:
-                hits.append({
-                    "id": hit.metadata.get("chunk_id", str(uuid.uuid4())),
-                    "content": hit.text,
-                    "score": hit.score,
-                    "metadata": hit.metadata,
-                })
-
-        return hits[:top_k]
-
-    def get_config(self) -> KbSearchDynamicConfig:
-        """获取当前配置。"""
-        return self._config
-
-
-async def create_kb_search_dynamic_handler(
-    session: AsyncSession,
-    timeout_governor: TimeoutGovernor | None = None,
-    config: KbSearchDynamicConfig | None = None,
-) -> callable:
-    """
-    创建 kb_search_dynamic 工具的 handler 函数，用于注册到 ToolRegistry。
-
-    Args:
-        session: 数据库会话
-        timeout_governor: 超时治理器
-        config: 工具配置
-
-    Returns:
-        异步 handler 函数
-    """
-    tool = KbSearchDynamicTool(
-        session=session,
-        timeout_governor=timeout_governor,
-        config=config,
-    )
-
-    async def handler(
-        query: str,
-        tenant_id: str = "",
-        scene: str = "open_consult",
-        top_k: int = DEFAULT_TOP_K,
-        context: dict[str, Any] | None = None,
-    ) -> dict[str, Any]:
-        """
-        KB 动态检索 handler。
-
-        Args:
-            query: 检索查询
-            tenant_id: 租户 ID
-            scene: 场景标识
-            top_k: 返回数量
-            context: 上下文
-
-        Returns:
-            检索结果字典
-        """
-        result = await tool.execute(
-            query=query,
-            tenant_id=tenant_id,
-            scene=scene,
-            top_k=top_k,
-            context=context,
-        )
-
-        return {
-            "success": result.success,
-            "hits": result.hits,
-            "applied_filter": result.applied_filter,
-            "missing_required_slots": result.missing_required_slots,
-            "filter_debug": result.filter_debug,
-            "fallback_reason_code": result.fallback_reason_code,
-            "duration_ms": result.duration_ms,
-        }
-
-    return handler
-
-
-def register_kb_search_dynamic_tool(
-    registry: ToolRegistry,
-    session: AsyncSession,
-    timeout_governor: TimeoutGovernor | None = None,
-    config: KbSearchDynamicConfig | None = None,
-) -> None:
-    """
-    [AC-MARH-05] 将 kb_search_dynamic 注册到 ToolRegistry。
-
-    Args:
-        registry: ToolRegistry 实例
-        session: 数据库会话
-        timeout_governor: 超时治理器
-        config: 工具配置
-    """
-    from app.services.mid.tool_registry import ToolType as RegistryToolType
-
-    async def handler(
-        query: str,
-        tenant_id: str = "",
-        scene: str = "open_consult",
-        top_k: int = DEFAULT_TOP_K,
-        context: dict[str, Any] | None = None,
-    ) -> dict[str, Any]:
-        tool = KbSearchDynamicTool(
-            session=session,
-            timeout_governor=timeout_governor,
-            config=config,
-        )
-
-        result = await tool.execute(
-            query=query,
-            tenant_id=tenant_id,
-            scene=scene,
-            top_k=top_k,
-            context=context,
-        )
-
-        return {
-            "success": result.success,
-            "hits": result.hits,
-            "applied_filter": result.applied_filter,
-            "missing_required_slots": result.missing_required_slots,
-            "filter_debug": result.filter_debug,
-            "fallback_reason_code": result.fallback_reason_code,
-            "duration_ms": result.duration_ms,
-        }
-
-    registry.register(
-        name=KB_SEARCH_DYNAMIC_TOOL_NAME,
-        description="知识库动态检索工具，支持元数据驱动过滤",
-        handler=handler,
-        tool_type=RegistryToolType.INTERNAL,
-        version="1.0.0",
-        auth_required=False,
-        timeout_ms=config.timeout_ms if config else DEFAULT_TIMEOUT_MS,
-        enabled=True,
-        metadata={
-            "supports_dynamic_filter": True,
-            "min_score_threshold": config.min_score_threshold if config else 0.5,
-            "when_to_use": "当需要知识库事实支撑回答，且需按租户元数据动态过滤时使用。",
-            "when_not_to_use": "当用户问题不依赖知识库（纯闲聊/仅流程确认）或已有充分 KB 结果时不重复调用。",
-            "parameters": {
-                "type": "object",
-                "properties": {
-                    "query": {"type": "string", "description": "检索查询文本"},
-                    "tenant_id": {"type": "string", "description": "租户 ID"},
-                    "scene": {"type": "string", "description": "场景标识，如 open_consult"},
-                    "top_k": {"type": "integer", "description": "返回条数"},
-                    "context": {"type": "object", "description": "上下文，用于动态过滤字段"}
-                },
-                "required": ["query", "tenant_id"]
-            },
-            "example_action_input": {
-                "query": "退款到账一般要多久",
-                "tenant_id": "default",
-                "scene": "open_consult",
-                "top_k": 5,
-                "context": {"product_line": "vip_course", "region": "beijing"}
-            },
-            "result_interpretation": "success=true 且 hits 非空表示命中知识；missing_required_slots 非空时应先向用户补采信息。"
-        },
-    )
-
-    logger.info(
-        f"[AC-MARH-05] Tool registered: {KB_SEARCH_DYNAMIC_TOOL_NAME}"
-    )
--- a/ai-service/app/services/mid/memory_adapter.py
+++ b/ai-service/app/services/mid/memory_adapter.py
@ -1,355 +0,0 @@
-"""
-Memory Adapter for Mid Platform.
-[AC-IDMP-13] 记忆召回服务 - 在响应前执行 recall 并注入 profile/facts/preferences
-[AC-IDMP-14] 记忆更新服务 - 异步执行记忆更新（含会话摘要）
-
-Reference: 
- spec/intent-driven-mid-platform/openapi.deps.yaml
- spec/intent-driven-mid-platform/requirements.md AC-IDMP-13/14
-"""
-
-import asyncio
-import logging
-from dataclasses import dataclass, field
-from datetime import datetime
-from typing import Any, Callable
-
-from sqlalchemy import select
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.mid.memory import (
-    MemoryFact,
-    MemoryProfile,
-    MemoryPreferences,
-    RecallRequest,
-    RecallResponse,
-    UpdateRequest,
-)
-
-logger = logging.getLogger(__name__)
-
-
-class MemoryStoreType(str):
-    """记忆存储类型"""
-    PROFILE = "profile"
-    FACT = "fact"
-    PREFERENCES = "preferences"
-    SUMMARY = "summary"
-
-
-@dataclass
-class UserMemory:
-    """
-    用户记忆存储实体
-    用于持久化存储用户的三层记忆
-    """
-    id: str
-    user_id: str
-    tenant_id: str
-    memory_type: str
-    content: dict[str, Any]
-    created_at: datetime = field(default_factory=datetime.utcnow)
-    updated_at: datetime = field(default_factory=datetime.utcnow)
-    expires_at: datetime | None = None
-    confidence: float | None = None
-    source: str | None = None
-
-
-class MemoryAdapter:
-    """
-    [AC-IDMP-13/14] 记忆适配器
-    
-    功能：
-    1. recall: 在对话响应前召回用户记忆（profile/facts/preferences）
-    2. update: 在对话完成后异步更新用户记忆
-    
-    设计原则：
-    - recall 失败不阻断主链路（降级处理）
-    - update 异步执行，不阻塞主响应
-    """
-
-    DEFAULT_RECALL_TIMEOUT_MS = 500
-    DEFAULT_UPDATE_TIMEOUT_MS = 2000
-
-    def __init__(
-        self,
-        session: AsyncSession,
-        recall_timeout_ms: int = DEFAULT_RECALL_TIMEOUT_MS,
-        update_timeout_ms: int = DEFAULT_UPDATE_TIMEOUT_MS,
-    ):
-        self._session = session
-        self._recall_timeout_ms = recall_timeout_ms
-        self._update_timeout_ms = update_timeout_ms
-        self._pending_updates: list[asyncio.Task] = []
-
-    async def recall(
-        self,
-        user_id: str,
-        session_id: str,
-        tenant_id: str | None = None,
-    ) -> RecallResponse:
-        """
-        [AC-IDMP-13] 召回用户记忆
-        
-        在响应前执行，注入基础属性、事实记忆与偏好记忆。
-        失败时返回空记忆，不阻断主链路。
-        
-        Args:
-            user_id: 用户ID
-            session_id: 会话ID
-            tenant_id: 租户ID（可选）
-            
-        Returns:
-            RecallResponse: 包含 profile/facts/preferences 的响应
-        """
-        try:
-            return await asyncio.wait_for(
-                self._recall_internal(user_id, session_id, tenant_id),
-                timeout=self._recall_timeout_ms / 1000,
-            )
-        except asyncio.TimeoutError:
-            logger.warning(
-                f"[AC-IDMP-13] Memory recall timeout for user={user_id}, "
-                f"session={session_id}, timeout_ms={self._recall_timeout_ms}"
-            )
-            return RecallResponse()
-        except Exception as e:
-            logger.error(
-                f"[AC-IDMP-13] Memory recall failed for user={user_id}, "
-                f"session={session_id}, error={e}"
-            )
-            return RecallResponse()
-
-    async def _recall_internal(
-        self,
-        user_id: str,
-        session_id: str,
-        tenant_id: str | None,
-    ) -> RecallResponse:
-        """
-        内部召回实现
-        """
-        profile = await self._recall_profile(user_id, tenant_id)
-        facts = await self._recall_facts(user_id, tenant_id)
-        preferences = await self._recall_preferences(user_id, tenant_id)
-        last_summary = await self._recall_last_summary(user_id, tenant_id)
-
-        logger.info(
-            f"[AC-IDMP-13] Memory recalled for user={user_id}: "
-            f"profile={bool(profile)}, facts={len(facts)}, "
-            f"preferences={bool(preferences)}, summary={bool(last_summary)}"
-        )
-
-        return RecallResponse(
-            profile=profile,
-            facts=facts,
-            preferences=preferences,
-            last_summary=last_summary,
-        )
-
-    async def _recall_profile(
-        self,
-        user_id: str,
-        tenant_id: str | None,
-    ) -> MemoryProfile | None:
-        """召回用户基础属性"""
-        return MemoryProfile(
-            grade="初一",
-            region="北京",
-            channel="wechat",
-            vip_level="gold",
-        )
-
-    async def _recall_facts(
-        self,
-        user_id: str,
-        tenant_id: str | None,
-    ) -> list[MemoryFact]:
-        """召回用户事实记忆"""
-        return [
-            MemoryFact(content="已购课程：数学思维训练营", source="order", confidence=1.0),
-            MemoryFact(content="学习目标：提高数学成绩", source="profile", confidence=0.9),
-            MemoryFact(content="上次咨询：课程退费政策", source="conversation", confidence=0.8),
-        ]
-
-    async def _recall_preferences(
-        self,
-        user_id: str,
-        tenant_id: str | None,
-    ) -> MemoryPreferences | None:
-        """召回用户偏好"""
-        return MemoryPreferences(
-            tone="friendly",
-            focus_subjects=["数学", "物理"],
-            communication_style="详细解释",
-        )
-
-    async def _recall_last_summary(
-        self,
-        user_id: str,
-        tenant_id: str | None,
-    ) -> str | None:
-        """召回最近会话摘要"""
-        return "上次讨论了数学学习计划，用户对课程安排比较满意"
-
-    async def update(
-        self,
-        user_id: str,
-        session_id: str,
-        messages: list[dict[str, Any]],
-        summary: str | None = None,
-        tenant_id: str | None = None,
-    ) -> bool:
-        """
-        [AC-IDMP-14] 异步更新用户记忆
-        
-        在对话完成后异步执行，不阻塞主响应。
-        包含会话摘要的回写。
-        
-        Args:
-            user_id: 用户ID
-            session_id: 会话ID
-            messages: 本轮对话消息
-            summary: 会话摘要（可选）
-            tenant_id: 租户ID
-            
-        Returns:
-            bool: 是否成功提交更新任务
-        """
-        request = UpdateRequest(
-            user_id=user_id,
-            session_id=session_id,
-            messages=messages,
-            summary=summary,
-        )
-
-        task = asyncio.create_task(
-            self._update_internal(request, tenant_id),
-            name=f"memory_update_{user_id}_{session_id}",
-        )
-        self._pending_updates.append(task)
-        task.add_done_callback(lambda t: self._pending_updates.remove(t))
-
-        logger.info(
-            f"[AC-IDMP-14] Memory update scheduled for user={user_id}, "
-            f"session={session_id}, messages_count={len(messages)}"
-        )
-
-        return True
-
-    async def _update_internal(
-        self,
-        request: UpdateRequest,
-        tenant_id: str | None,
-    ) -> None:
-        """
-        内部更新实现
-        """
-        try:
-            await asyncio.wait_for(
-                self._do_update(request, tenant_id),
-                timeout=self._update_timeout_ms / 1000,
-            )
-            logger.info(
-                f"[AC-IDMP-14] Memory updated for user={request.user_id}, "
-                f"session={request.session_id}"
-            )
-        except asyncio.TimeoutError:
-            logger.warning(
-                f"[AC-IDMP-14] Memory update timeout for user={request.user_id}, "
-                f"session={request.session_id}"
-            )
-        except Exception as e:
-            logger.error(
-                f"[AC-IDMP-14] Memory update failed for user={request.user_id}, "
-                f"session={request.session_id}, error={e}"
-            )
-
-    async def _do_update(
-        self,
-        request: UpdateRequest,
-        tenant_id: str | None,
-    ) -> None:
-        """
-        执行实际的记忆更新
-        """
-        if request.summary:
-            await self._save_summary(request.user_id, request.summary, tenant_id)
-
-        await self._extract_and_save_facts(
-            request.user_id, request.messages, tenant_id
-        )
-
-    async def _save_summary(
-        self,
-        user_id: str,
-        summary: str,
-        tenant_id: str | None,
-    ) -> None:
-        """保存会话摘要"""
-        pass
-
-    async def _extract_and_save_facts(
-        self,
-        user_id: str,
-        messages: list[dict[str, Any]],
-        tenant_id: str | None,
-    ) -> None:
-        """从消息中提取并保存事实"""
-        pass
-
-    async def update_with_summary_generation(
-        self,
-        user_id: str,
-        session_id: str,
-        messages: list[dict[str, Any]],
-        tenant_id: str | None = None,
-        summary_generator: Callable | None = None,
-    ) -> bool:
-        """
-        [AC-IDMP-14] 带摘要生成的记忆更新
-        
-        如果未提供摘要，会尝试生成摘要后回写
-        """
-        summary = None
-        if summary_generator:
-            try:
-                summary = await summary_generator(messages)
-            except Exception as e:
-                logger.warning(
-                    f"[AC-IDMP-14] Summary generation failed: {e}"
-                )
-
-        return await self.update(
-            user_id=user_id,
-            session_id=session_id,
-            messages=messages,
-            summary=summary,
-            tenant_id=tenant_id,
-        )
-
-    async def wait_pending_updates(self, timeout: float = 5.0) -> int:
-        """
-        等待所有待处理的更新任务完成
-        
-        用于优雅关闭时确保所有更新完成
-        
-        Args:
-            timeout: 最大等待时间（秒）
-            
-        Returns:
-            int: 完成的任务数
-        """
-        if not self._pending_updates:
-            return 0
-
-        try:
-            done, _ = await asyncio.wait(
-                self._pending_updates,
-                timeout=timeout,
-                return_when=asyncio.ALL_COMPLETED,
-            )
-            return len(done)
-        except Exception as e:
-            logger.error(f"[AC-IDMP-14] Error waiting for pending updates: {e}")
-            return 0
--- a/ai-service/app/services/mid/memory_recall_tool.py
+++ b/ai-service/app/services/mid/memory_recall_tool.py
@ -1,603 +0,0 @@
-"""
-Memory Recall Tool for Mid Platform.
-[AC-IDMP-13] 记忆召回工具 - 短期可用记忆注入
-[AC-MRS-12] 只消费 field_roles 包含 slot 的字段
-
-定位：短期可用记忆注入，不是完整中长期记忆系统。
-功能：读取可用记忆包（profile/facts/preferences/last_summary/slots）
-
-关键特性：
-1. 优先读取已有结构化记忆
-2. 若缺失，使用最近窗口历史做最小回填
-3. 槽位冲突优先级：user_confirmed > rule_extracted > llm_inferred > default
-4. 超时 <= 1000ms，失败不抛硬异常
-5. 多租户隔离正确
-6. 只消费 slot 角色的字段
-"""
-
-import asyncio
-import logging
-import time
-from dataclasses import dataclass, field
-from typing import Any
-
-from sqlalchemy import select
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.mid.schemas import (
-    MemoryRecallResult,
-    MemorySlot,
-    SlotSource,
-    ToolCallStatus,
-    ToolCallTrace,
-    ToolType,
-)
-from app.models.entities import FieldRole
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider
-from app.services.mid.timeout_governor import TimeoutGovernor
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_RECALL_TIMEOUT_MS = 1000
-DEFAULT_MAX_RECENT_MESSAGES = 8
-
-
-@dataclass
-class MemoryRecallConfig:
-    """记忆召回工具配置。"""
-    enabled: bool = True
-    timeout_ms: int = DEFAULT_RECALL_TIMEOUT_MS
-    max_recent_messages: int = DEFAULT_MAX_RECENT_MESSAGES
-    default_recall_scope: list[str] = field(
-        default_factory=lambda: ["profile", "facts", "preferences", "summary", "slots"]
-    )
-
-
-class MemoryRecallTool:
-    """
-    [AC-IDMP-13] 记忆召回工具。
-    [AC-MRS-12] 只消费 field_roles 包含 slot 的字段
-
-    用于在对话前读取用户可用记忆，减少重复追问。
-
-    Features:
-    - 读取 profile/facts/preferences/last_summary/slots
-    - 槽位冲突优先级处理
-    - 超时控制与降级
-    - 多租户隔离
-    - 只消费 slot 角色的字段
-    """
-
-    SLOT_PRIORITY: dict[SlotSource, int] = {
-        SlotSource.USER_CONFIRMED: 4,
-        SlotSource.RULE_EXTRACTED: 3,
-        SlotSource.LLM_INFERRED: 2,
-        SlotSource.DEFAULT: 1,
-    }
-
-    def __init__(
-        self,
-        session: AsyncSession,
-        timeout_governor: TimeoutGovernor | None = None,
-        config: MemoryRecallConfig | None = None,
-    ):
-        self._session = session
-        self._timeout_governor = timeout_governor or TimeoutGovernor()
-        self._config = config or MemoryRecallConfig()
-        self._role_provider = RoleBasedFieldProvider(session)
-
-    async def execute(
-        self,
-        tenant_id: str,
-        user_id: str,
-        session_id: str,
-        recall_scope: list[str] | None = None,
-        max_recent_messages: int | None = None,
-    ) -> MemoryRecallResult:
-        """
-        [AC-IDMP-13] 执行记忆召回。
-
-        Args:
-            tenant_id: 租户 ID
-            user_id: 用户 ID
-            session_id: 会话 ID
-            recall_scope: 召回范围，默认 ["profile","facts","preferences","summary","slots"]
-            max_recent_messages: 最大最近消息数，默认 8
-
-        Returns:
-            MemoryRecallResult: 记忆召回结果
-        """
-        if not self._config.enabled:
-            logger.info(f"[AC-IDMP-13] Memory recall disabled for tenant={tenant_id}")
-            return MemoryRecallResult(
-                fallback_reason_code="MEMORY_RECALL_DISABLED",
-            )
-
-        start_time = time.time()
-        scope = recall_scope or self._config.default_recall_scope
-        max_msgs = max_recent_messages or self._config.max_recent_messages
-
-        logger.info(
-            f"[AC-IDMP-13] Starting memory recall: tenant={tenant_id}, "
-            f"user={user_id}, session={session_id}, scope={scope}"
-        )
-
-        try:
-            result = await asyncio.wait_for(
-                self._recall_internal(
-                    tenant_id=tenant_id,
-                    user_id=user_id,
-                    session_id=session_id,
-                    scope=scope,
-                    max_recent_messages=max_msgs,
-                ),
-                timeout=self._config.timeout_ms / 1000.0,
-            )
-
-            duration_ms = int((time.time() - start_time) * 1000)
-            result.duration_ms = duration_ms
-
-            logger.info(
-                f"[AC-IDMP-13] Memory recall completed: tenant={tenant_id}, "
-                f"user={user_id}, duration_ms={duration_ms}, "
-                f"profile={bool(result.profile)}, facts={len(result.facts)}, "
-                f"slots={len(result.slots)}, missing_slots={len(result.missing_slots)}"
-            )
-
-            return result
-
-        except asyncio.TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(
-                f"[AC-IDMP-13] Memory recall timeout: tenant={tenant_id}, "
-                f"user={user_id}, duration_ms={duration_ms}"
-            )
-            return MemoryRecallResult(
-                fallback_reason_code="MEMORY_RECALL_TIMEOUT",
-                duration_ms=duration_ms,
-            )
-
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(
-                f"[AC-IDMP-13] Memory recall failed: tenant={tenant_id}, "
-                f"user={user_id}, error={e}"
-            )
-            return MemoryRecallResult(
-                fallback_reason_code=f"MEMORY_RECALL_ERROR:{str(e)[:50]}",
-                duration_ms=duration_ms,
-            )
-
-    async def _recall_internal(
-        self,
-        tenant_id: str,
-        user_id: str,
-        session_id: str,
-        scope: list[str],
-        max_recent_messages: int,
-    ) -> MemoryRecallResult:
-        """内部召回实现。"""
-        profile: dict[str, Any] = {}
-        facts: list[str] = []
-        preferences: dict[str, Any] = {}
-        last_summary: str | None = None
-        slots: dict[str, MemorySlot] = {}
-        missing_slots: list[str] = []
-
-        if "profile" in scope:
-            profile = await self._recall_profile(tenant_id, user_id)
-
-        if "facts" in scope:
-            facts = await self._recall_facts(tenant_id, user_id)
-
-        if "preferences" in scope:
-            preferences = await self._recall_preferences(tenant_id, user_id)
-
-        if "summary" in scope:
-            last_summary = await self._recall_last_summary(tenant_id, user_id)
-
-        if "slots" in scope:
-            slots, missing_slots = await self._recall_slots(
-                tenant_id, user_id, session_id
-            )
-
-        if not profile and not facts and not preferences and not last_summary and not slots:
-            if "history" in scope or max_recent_messages > 0:
-                history_facts = await self._recall_from_history(
-                    tenant_id, session_id, max_recent_messages
-                )
-                facts.extend(history_facts)
-
-        return MemoryRecallResult(
-            profile=profile,
-            facts=facts,
-            preferences=preferences,
-            last_summary=last_summary,
-            slots=slots,
-            missing_slots=missing_slots,
-        )
-
-    async def _recall_profile(
-        self,
-        tenant_id: str,
-        user_id: str,
-    ) -> dict[str, Any]:
-        """召回用户基础属性。"""
-        try:
-            from app.models.entities import ChatMessage
-            from sqlmodel import col
-
-            stmt = (
-                select(ChatMessage)
-                .where(
-                    ChatMessage.tenant_id == tenant_id,
-                    ChatMessage.role == "user",
-                )
-                .order_by(col(ChatMessage.created_at).desc())
-                .limit(5)
-            )
-            result = await self._session.execute(stmt)
-            messages = result.scalars().all()
-
-            profile: dict[str, Any] = {}
-            for msg in messages:
-                content = msg.content.lower()
-                if "年级" in content or "初" in content or "高" in content:
-                    if "grade" not in profile:
-                        profile["grade"] = self._extract_grade(msg.content)
-                if "北京" in content or "上海" in content or "广州" in content:
-                    if "region" not in profile:
-                        profile["region"] = self._extract_region(msg.content)
-
-            return profile
-
-        except Exception as e:
-            logger.warning(f"[AC-IDMP-13] Failed to recall profile: {e}")
-            return {}
-
-    async def _recall_facts(
-        self,
-        tenant_id: str,
-        user_id: str,
-    ) -> list[str]:
-        """召回用户事实记忆。"""
-        try:
-            from app.models.entities import ChatMessage
-            from sqlmodel import col
-
-            stmt = (
-                select(ChatMessage)
-                .where(
-                    ChatMessage.tenant_id == tenant_id,
-                    ChatMessage.role == "assistant",
-                )
-                .order_by(col(ChatMessage.created_at).desc())
-                .limit(10)
-            )
-            result = await self._session.execute(stmt)
-            messages = result.scalars().all()
-
-            facts: list[str] = []
-            for msg in messages:
-                content = msg.content
-                if "已购" in content or "购买" in content:
-                    facts.append(self._extract_purchase_info(content))
-                if "订单" in content:
-                    facts.append(self._extract_order_info(content))
-
-            return [f for f in facts if f]
-
-        except Exception as e:
-            logger.warning(f"[AC-IDMP-13] Failed to recall facts: {e}")
-            return []
-
-    async def _recall_preferences(
-        self,
-        tenant_id: str,
-        user_id: str,
-    ) -> dict[str, Any]:
-        """召回用户偏好。"""
-        try:
-            from app.models.entities import ChatMessage
-            from sqlmodel import col
-
-            stmt = (
-                select(ChatMessage)
-                .where(
-                    ChatMessage.tenant_id == tenant_id,
-                    ChatMessage.role == "user",
-                )
-                .order_by(col(ChatMessage.created_at).desc())
-                .limit(10)
-            )
-            result = await self._session.execute(stmt)
-            messages = result.scalars().all()
-
-            preferences: dict[str, Any] = {}
-            for msg in messages:
-                content = msg.content.lower()
-                if "详细" in content or "详细解释" in content:
-                    preferences["communication_style"] = "详细解释"
-                elif "简单" in content or "简洁" in content:
-                    preferences["communication_style"] = "简洁"
-
-            return preferences
-
-        except Exception as e:
-            logger.warning(f"[AC-IDMP-13] Failed to recall preferences: {e}")
-            return {}
-
-    async def _recall_last_summary(
-        self,
-        tenant_id: str,
-        user_id: str,
-    ) -> str | None:
-        """召回最近会话摘要。"""
-        try:
-            from app.models.entities import MidAuditLog
-            from sqlmodel import col
-
-            stmt = (
-                select(MidAuditLog)
-                .where(
-                    MidAuditLog.tenant_id == tenant_id,
-                )
-                .order_by(col(MidAuditLog.created_at).desc())
-                .limit(1)
-            )
-            result = await self._session.execute(stmt)
-            audit = result.scalar_one_or_none()
-
-            if audit:
-                return f"上次会话模式: {audit.mode}"
-
-            return None
-
-        except Exception as e:
-            logger.warning(f"[AC-IDMP-13] Failed to recall last summary: {e}")
-            return None
-
-    async def _recall_slots(
-        self,
-        tenant_id: str,
-        user_id: str,
-        session_id: str,
-    ) -> tuple[dict[str, MemorySlot], list[str]]:
-        """
-        [AC-MRS-12] 召回结构化槽位，只消费 slot 角色的字段。
-
-        Returns:
-            Tuple of (slots_dict, missing_required_slots)
-        """
-        try:
-            slot_field_keys = await self._role_provider.get_slot_field_keys(tenant_id)
-
-            logger.info(
-                f"[AC-MRS-12] Retrieved {len(slot_field_keys)} slot fields for tenant={tenant_id}: {slot_field_keys}"
-            )
-
-            from app.models.entities import FlowInstance
-            from sqlalchemy import desc
-
-            stmt = (
-                select(FlowInstance)
-                .where(
-                    FlowInstance.tenant_id == tenant_id,
-                )
-                .order_by(desc(FlowInstance.updated_at))
-                .limit(1)
-            )
-            result = await self._session.execute(stmt)
-            flow_instance = result.scalar_one_or_none()
-
-            slots: dict[str, MemorySlot] = {}
-            missing_slots: list[str] = []
-
-            if flow_instance and flow_instance.context:
-                context = flow_instance.context
-                for key, value in context.items():
-                    if key in slot_field_keys and value is not None:
-                        slots[key] = MemorySlot(
-                            key=key,
-                            value=value,
-                            source=SlotSource.USER_CONFIRMED,
-                            confidence=1.0,
-                            updated_at=str(flow_instance.updated_at),
-                        )
-
-            return slots, missing_slots
-
-        except Exception as e:
-            logger.warning(f"[AC-IDMP-13] Failed to recall slots: {e}")
-            return {}, []
-
-    async def _recall_from_history(
-        self,
-        tenant_id: str,
-        session_id: str,
-        max_messages: int,
-    ) -> list[str]:
-        """从最近历史中提取最小回填信息。"""
-        try:
-            from app.models.entities import ChatMessage
-            from sqlmodel import col
-
-            stmt = (
-                select(ChatMessage)
-                .where(
-                    ChatMessage.tenant_id == tenant_id,
-                    ChatMessage.session_id == session_id,
-                )
-                .order_by(col(ChatMessage.created_at).desc())
-                .limit(max_messages)
-            )
-            result = await self._session.execute(stmt)
-            messages = result.scalars().all()
-
-            facts: list[str] = []
-            for msg in messages:
-                if msg.role == "user":
-                    facts.append(f"用户说过: {msg.content[:50]}")
-
-            return facts
-
-        except Exception as e:
-            logger.warning(f"[AC-IDMP-13] Failed to recall from history: {e}")
-            return []
-
-    def _extract_grade(self, content: str) -> str:
-        """从内容中提取年级信息。"""
-        grades = ["初一", "初二", "初三", "高一", "高二", "高三"]
-        for grade in grades:
-            if grade in content:
-                return grade
-        return "未知年级"
-
-    def _extract_region(self, content: str) -> str:
-        """从内容中提取地区信息。"""
-        regions = ["北京", "上海", "广州", "深圳", "杭州", "成都", "武汉", "南京"]
-        for region in regions:
-            if region in content:
-                return region
-        return "未知地区"
-
-    def _extract_purchase_info(self, content: str) -> str:
-        """从内容中提取购买信息。"""
-        return f"购买记录: {content[:30]}..."
-
-    def _extract_order_info(self, content: str) -> str:
-        """从内容中提取订单信息。"""
-        return f"订单信息: {content[:30]}..."
-
-    def merge_slots(
-        self,
-        existing_slots: dict[str, MemorySlot],
-        new_slots: dict[str, MemorySlot],
-    ) -> dict[str, MemorySlot]:
-        """
-        合并槽位，按优先级处理冲突。
-
-        优先级：user_confirmed > rule_extracted > llm_inferred > default
-        """
-        merged = dict(existing_slots)
-
-        for key, new_slot in new_slots.items():
-            if key not in merged:
-                merged[key] = new_slot
-            else:
-                existing_slot = merged[key]
-                existing_priority = self.SLOT_PRIORITY.get(existing_slot.source, 0)
-                new_priority = self.SLOT_PRIORITY.get(new_slot.source, 0)
-
-                if new_priority > existing_priority:
-                    merged[key] = new_slot
-                elif new_priority == existing_priority:
-                    if new_slot.confidence > existing_slot.confidence:
-                        merged[key] = new_slot
-
-        return merged
-
-    def create_trace(
-        self,
-        result: MemoryRecallResult,
-        tenant_id: str,
-    ) -> ToolCallTrace:
-        """创建工具调用追踪记录。"""
-        status = ToolCallStatus.OK
-        error_code = None
-
-        if result.fallback_reason_code:
-            if "TIMEOUT" in result.fallback_reason_code:
-                status = ToolCallStatus.TIMEOUT
-            else:
-                status = ToolCallStatus.ERROR
-            error_code = result.fallback_reason_code
-
-        return ToolCallTrace(
-            tool_name="memory_recall",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=result.duration_ms,
-            status=status,
-            error_code=error_code,
-            args_digest=f"tenant={tenant_id}",
-            result_digest=f"profile={len(result.profile)}, facts={len(result.facts)}, slots={len(result.slots)}",
-        )
-
-
-def register_memory_recall_tool(
-    registry: Any,
-    session: AsyncSession,
-    timeout_governor: TimeoutGovernor | None = None,
-    config: MemoryRecallConfig | None = None,
-) -> None:
-    """
-    [AC-IDMP-13] 注册 memory_recall 工具到 ToolRegistry。
-
-    Args:
-        registry: ToolRegistry 实例
-        session: 数据库会话
-        timeout_governor: 超时治理器
-        config: 工具配置
-    """
-    cfg = config or MemoryRecallConfig()
-
-    async def memory_recall_handler(
-        tenant_id: str,
-        user_id: str,
-        session_id: str,
-        recall_scope: list[str] | None = None,
-        max_recent_messages: int | None = None,
-    ) -> dict[str, Any]:
-        """memory_recall 工具处理器。"""
-        tool = MemoryRecallTool(
-            session=session,
-            timeout_governor=timeout_governor,
-            config=cfg,
-        )
-        result = await tool.execute(
-            tenant_id=tenant_id,
-            user_id=user_id,
-            session_id=session_id,
-            recall_scope=recall_scope,
-            max_recent_messages=max_recent_messages,
-        )
-        return result.model_dump()
-
-    registry.register(
-        name="memory_recall",
-        description="[AC-IDMP-13] 记忆召回工具，读取用户可用记忆包（profile/facts/preferences/summary/slots）",
-        handler=memory_recall_handler,
-        tool_type=ToolType.INTERNAL,
-        version="1.0.0",
-        auth_required=False,
-        timeout_ms=min(cfg.timeout_ms, 1000),
-        enabled=True,
-        metadata={
-            "ac_ids": ["AC-IDMP-13"],
-            "recall_scope": cfg.default_recall_scope,
-            "max_recent_messages": cfg.max_recent_messages,
-            "when_to_use": "当需要补全用户画像、历史事实、偏好、槽位，避免重复追问时使用。",
-            "when_not_to_use": "当当前轮次已经有完整上下文且无需个性化记忆支撑时可不调用。",
-            "parameters": {
-                "type": "object",
-                "properties": {
-                    "tenant_id": {"type": "string", "description": "租户 ID"},
-                    "user_id": {"type": "string", "description": "用户 ID"},
-                    "session_id": {"type": "string", "description": "会话 ID"},
-                    "recall_scope": {"type": "array", "description": "召回范围，例如 profile/facts/preferences/summary/slots"},
-                    "max_recent_messages": {"type": "integer", "description": "历史回填窗口大小"}
-                },
-                "required": ["tenant_id", "user_id", "session_id"]
-            },
-            "example_action_input": {
-                "tenant_id": "default",
-                "user_id": "u_10086",
-                "session_id": "s_abc_001",
-                "recall_scope": ["profile", "facts", "preferences", "summary", "slots"],
-                "max_recent_messages": 8
-            },
-            "result_interpretation": "关注 profile/facts/preferences/slots/missing_slots。若 fallback_reason_code 存在，需降级处理。"
-        },
-    )
-
-    logger.info("[AC-IDMP-13] memory_recall tool registered to registry")
--- a/ai-service/app/services/mid/metadata_filter_builder.py
+++ b/ai-service/app/services/mid/metadata_filter_builder.py
@ -1,284 +0,0 @@
-"""
-Metadata Filter Builder for KB Search Dynamic.
-[AC-MARH-05] 基于元数据字段定义动态构建检索过滤器。
-[AC-MRS-11] 只消费 field_roles 包含 resource_filter 的字段
-
-核心逻辑：
-1. 查询状态=生效 且 field_roles 包含 resource_filter 的字段定义
-2. 根据 context 中的值构建过滤条件
-3. 必填字段缺失时返回 missing_required_slots
-"""
-
-import logging
-from dataclasses import dataclass, field
-from typing import Any
-
-from sqlalchemy import cast, select
-from sqlalchemy.dialects.postgresql import JSONB
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.entities import (
-    MetadataFieldDefinition,
-    MetadataFieldStatus,
-    MetadataFieldType,
-    MetadataScope,
-)
-from app.models.entities import FieldRole
-from app.services.mid.role_based_field_provider import RoleBasedFieldProvider
-
-logger = logging.getLogger(__name__)
-
-KB_DOCUMENT_SCOPE = MetadataScope.KB_DOCUMENT.value
-
-
-@dataclass
-class FilterBuildResult:
-    """过滤器构建结果。"""
-    applied_filter: dict[str, Any] = field(default_factory=dict)
-    missing_required_slots: list[dict[str, str]] = field(default_factory=list)
-    debug_info: dict[str, Any] = field(default_factory=dict)
-    success: bool = True
-    error_message: str | None = None
-
-
-@dataclass
-class FilterFieldInfo:
-    """过滤字段信息。"""
-    field_key: str
-    label: str
-    field_type: str
-    required: bool
-    options: list[str] | None
-    default_value: Any
-    is_filterable: bool
-
-
-class MetadataFilterBuilder:
-    """
-    [AC-MARH-05] 元数据过滤器构建器。
-    [AC-MRS-11] 只消费 field_roles 包含 resource_filter 的字段
-
-    根据租户的元数据字段定义，动态构建 KB 检索的过滤条件。
-    支持：
-    - 状态=生效的字段
-    - field_roles 包含 resource_filter
-    - 字段类型决定过滤操作（单选/多选/文本）
-    - 必填字段缺失检测
-    """
-
-    def __init__(self, session: AsyncSession):
-        self._session = session
-        self._role_provider = RoleBasedFieldProvider(session)
-
-    async def build_filter(
-        self,
-        tenant_id: str,
-        context: dict[str, Any] | None = None,
-    ) -> FilterBuildResult:
-        """
-        构建元数据过滤器。
-
-        Args:
-            tenant_id: 租户 ID
-            context: 上下文信息，包含可能的过滤值
-
-        Returns:
-            FilterBuildResult 包含:
-            - applied_filter: 已应用的过滤条件
-            - missing_required_slots: 缺失的必填字段
-            - debug_info: 调试信息
-        """
-        context = context or {}
-        debug_info = {
-            "tenant_id": tenant_id,
-            "context_keys": list(context.keys()),
-            "filterable_fields": [],
-            "applied_fields": [],
-        }
-
-        try:
-            filterable_fields = await self._get_filterable_fields(tenant_id)
-            debug_info["filterable_fields"] = [f.field_key for f in filterable_fields]
-
-            if not filterable_fields:
-                logger.info(
-                    f"[AC-MARH-05] No filterable fields found for tenant={tenant_id}"
-                )
-                return FilterBuildResult(
-                    applied_filter={},
-                    missing_required_slots=[],
-                    debug_info=debug_info,
-                    success=True,
-                )
-
-            applied_filter: dict[str, Any] = {}
-            missing_required_slots: list[dict[str, str]] = []
-
-            for field_info in filterable_fields:
-                field_key = field_info.field_key
-                value = context.get(field_key)
-
-                if value is None and field_info.default_value is not None:
-                    value = field_info.default_value
-
-                if value is None:
-                    if field_info.required:
-                        missing_required_slots.append({
-                            "field_key": field_key,
-                            "label": field_info.label,
-                            "reason": "required_field_missing",
-                        })
-                    continue
-
-                filter_value = self._build_field_filter(field_info, value)
-                if filter_value is not None:
-                    applied_filter[field_key] = filter_value
-                    debug_info["applied_fields"].append(field_key)
-
-            logger.info(
-                f"[AC-MARH-05] Filter built: tenant={tenant_id}, "
-                f"applied={len(applied_filter)}, missing={len(missing_required_slots)}"
-            )
-
-            return FilterBuildResult(
-                applied_filter=applied_filter,
-                missing_required_slots=missing_required_slots,
-                debug_info=debug_info,
-                success=True,
-            )
-
-        except Exception as e:
-            logger.error(
-                f"[AC-MARH-05] Filter build failed: tenant={tenant_id}, error={e}"
-            )
-            return FilterBuildResult(
-                applied_filter={},
-                missing_required_slots=[],
-                debug_info={"error": str(e)},
-                success=False,
-                error_message=str(e),
-            )
-
-    async def _get_filterable_fields(
-        self,
-        tenant_id: str,
-    ) -> list[FilterFieldInfo]:
-        """
-        [AC-MRS-11] 获取可过滤的字段定义。
-
-        条件：
-        - 状态=生效 (active)
-        - field_roles 包含 resource_filter
-        """
-        fields = await self._role_provider.get_fields_by_role(
-            tenant_id=tenant_id,
-            role=FieldRole.RESOURCE_FILTER.value,
-        )
-
-        logger.info(
-            f"[AC-MRS-11] Retrieved {len(fields)} resource_filter fields for tenant={tenant_id}"
-        )
-
-        return [
-            FilterFieldInfo(
-                field_key=f.field_key,
-                label=f.label,
-                field_type=f.type,
-                required=f.required,
-                options=f.options,
-                default_value=f.default_value,
-                is_filterable=f.is_filterable,
-            )
-            for f in fields
-        ]
-
-    def _build_field_filter(
-        self,
-        field_info: FilterFieldInfo,
-        value: Any,
-    ) -> dict[str, Any] | str | list[str] | None:
-        """
-        根据字段类型构建过滤条件。
-
-        Args:
-            field_info: 字段信息
-            value: 字段值
-
-        Returns:
-            过滤条件（格式取决于字段类型）
-        """
-        field_type = field_info.field_type
-
-        if field_type == MetadataFieldType.ENUM.value:
-            if field_info.options and value not in field_info.options:
-                logger.warning(
-                    f"[AC-MARH-05] Invalid enum value: field={field_info.field_key}, "
-                    f"value={value}, options={field_info.options}"
-                )
-                return None
-            return {"$eq": value}
-
-        elif field_type == MetadataFieldType.ARRAY_ENUM.value:
-            if not isinstance(value, list):
-                value = [value]
-            if field_info.options:
-                invalid = [v for v in value if v not in field_info.options]
-                if invalid:
-                    logger.warning(
-                        f"[AC-MARH-05] Invalid array_enum values: field={field_info.field_key}, "
-                        f"invalid={invalid}"
-                    )
-                    value = [v for v in value if v in field_info.options]
-            if not value:
-                return None
-            return {"$in": value}
-
-        elif field_type == MetadataFieldType.NUMBER.value:
-            try:
-                num_value = float(value)
-                return {"$eq": num_value}
-            except (ValueError, TypeError):
-                logger.warning(
-                    f"[AC-MARH-05] Invalid number value: field={field_info.field_key}, "
-                    f"value={value}"
-                )
-                return None
-
-        elif field_type == MetadataFieldType.BOOLEAN.value:
-            if isinstance(value, bool):
-                return {"$eq": value}
-            if value in ["true", "1", 1, "True"]:
-                return {"$eq": True}
-            if value in ["false", "0", 0, "False"]:
-                return {"$eq": False}
-            return None
-
-        else:
-            return {"$eq": str(value)}
-
-    async def get_filter_schema(
-        self,
-        tenant_id: str,
-    ) -> list[dict[str, Any]]:
-        """
-        获取过滤字段 Schema，用于前端动态渲染或 Agent 工具描述。
-
-        Args:
-            tenant_id: 租户 ID
-
-        Returns:
-            字段 Schema 列表
-        """
-        filterable_fields = await self._get_filterable_fields(tenant_id)
-
-        return [
-            {
-                "field_key": f.field_key,
-                "label": f.label,
-                "type": f.field_type,
-                "required": f.required,
-                "options": f.options,
-                "default": f.default_value,
-            }
-            for f in filterable_fields
-        ]
--- a/ai-service/app/services/mid/metrics_collector.py
+++ b/ai-service/app/services/mid/metrics_collector.py
@ -1,219 +0,0 @@
-"""
-Metrics Collector for Mid Platform.
-[AC-IDMP-18] Runtime metrics collection.
-
-Metrics:
- task_completion_rate: Task completion rate
- slot_completion_rate: Slot completion rate
- wrong_transfer_rate: Wrong transfer rate
- no_recall_rate: No recall rate
- avg_latency_ms: Average latency
-"""
-
-import logging
-import time
-from collections import defaultdict
-from dataclasses import dataclass, field
-from typing import Any
-
-from app.models.mid.schemas import MetricsSnapshot
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class SessionMetrics:
-    """Session-level metrics."""
-    session_id: str
-    total_turns: int = 0
-    completed_tasks: int = 0
-    transfers: int = 0
-    wrong_transfers: int = 0
-    no_recall_turns: int = 0
-    total_latency_ms: int = 0
-    slots_expected: int = 0
-    slots_filled: int = 0
-
-
-@dataclass
-class AggregatedMetrics:
-    """Aggregated metrics over time window."""
-    total_sessions: int = 0
-    total_turns: int = 0
-    completed_tasks: int = 0
-    total_transfers: int = 0
-    wrong_transfers: int = 0
-    no_recall_turns: int = 0
-    total_latency_ms: int = 0
-    slots_expected: int = 0
-    slots_filled: int = 0
-    
-    def to_snapshot(self) -> MetricsSnapshot:
-        """Convert to MetricsSnapshot."""
-        task_rate = self.completed_tasks / self.total_turns if self.total_turns > 0 else 0.0
-        slot_rate = self.slots_filled / self.slots_expected if self.slots_expected > 0 else 1.0
-        wrong_transfer_rate = self.wrong_transfers / self.total_transfers if self.total_transfers > 0 else 0.0
-        no_recall_rate = self.no_recall_turns / self.total_turns if self.total_turns > 0 else 0.0
-        avg_latency = self.total_latency_ms / self.total_turns if self.total_turns > 0 else 0.0
-        
-        return MetricsSnapshot(
-            task_completion_rate=round(task_rate, 4),
-            slot_completion_rate=round(slot_rate, 4),
-            wrong_transfer_rate=round(wrong_transfer_rate, 4),
-            no_recall_rate=round(no_recall_rate, 4),
-            avg_latency_ms=round(avg_latency, 2),
-        )
-
-
-class MetricsCollector:
-    """
-    [AC-IDMP-18] Metrics collector for runtime observability.
-    
-    Features:
-    - Session-level metrics tracking
-    - Aggregated metrics over time windows
-    - Real-time snapshot generation
-    """
-    
-    def __init__(self):
-        self._session_metrics: dict[str, SessionMetrics] = {}
-        self._tenant_metrics: dict[str, AggregatedMetrics] = defaultdict(AggregatedMetrics)
-        self._global_metrics = AggregatedMetrics()
-    
-    def start_session(self, session_id: str) -> None:
-        """Start tracking a new session."""
-        if session_id not in self._session_metrics:
-            self._session_metrics[session_id] = SessionMetrics(session_id=session_id)
-            logger.debug(f"[AC-IDMP-18] Session started: {session_id}")
-    
-    def record_turn(
-        self,
-        session_id: str,
-        tenant_id: str,
-        latency_ms: int,
-        task_completed: bool = False,
-        transferred: bool = False,
-        wrong_transfer: bool = False,
-        no_recall: bool = False,
-        slots_expected: int = 0,
-        slots_filled: int = 0,
-    ) -> None:
-        """
-        [AC-IDMP-18] Record a conversation turn.
-        
-        Args:
-            session_id: Session ID
-            tenant_id: Tenant ID
-            latency_ms: Turn latency in milliseconds
-            task_completed: Whether the task was completed
-            transferred: Whether transfer occurred
-            wrong_transfer: Whether it was a wrong transfer
-            no_recall: Whether no recall occurred
-            slots_expected: Expected slots count
-            slots_filled: Filled slots count
-        """
-        session = self._session_metrics.get(session_id)
-        if not session:
-            session = SessionMetrics(session_id=session_id)
-            self._session_metrics[session_id] = session
-        
-        session.total_turns += 1
-        session.total_latency_ms += latency_ms
-        
-        if task_completed:
-            session.completed_tasks += 1
-        
-        if transferred:
-            session.transfers += 1
-            if wrong_transfer:
-                session.wrong_transfers += 1
-        
-        if no_recall:
-            session.no_recall_turns += 1
-        
-        session.slots_expected += slots_expected
-        session.slots_filled += slots_filled
-        
-        self._tenant_metrics[tenant_id].total_turns += 1
-        self._tenant_metrics[tenant_id].total_latency_ms += latency_ms
-        
-        if task_completed:
-            self._tenant_metrics[tenant_id].completed_tasks += 1
-        
-        if transferred:
-            self._tenant_metrics[tenant_id].total_transfers += 1
-            if wrong_transfer:
-                self._tenant_metrics[tenant_id].wrong_transfers += 1
-        
-        if no_recall:
-            self._tenant_metrics[tenant_id].no_recall_turns += 1
-        
-        self._tenant_metrics[tenant_id].slots_expected += slots_expected
-        self._tenant_metrics[tenant_id].slots_filled += slots_filled
-        
-        self._global_metrics.total_turns += 1
-        self._global_metrics.total_latency_ms += latency_ms
-        
-        if task_completed:
-            self._global_metrics.completed_tasks += 1
-        
-        if transferred:
-            self._global_metrics.total_transfers += 1
-            if wrong_transfer:
-                self._global_metrics.wrong_transfers += 1
-        
-        if no_recall:
-            self._global_metrics.no_recall_turns += 1
-        
-        self._global_metrics.slots_expected += slots_expected
-        self._global_metrics.slots_filled += slots_filled
-        
-        logger.debug(
-            f"[AC-IDMP-18] Turn recorded: session={session_id}, "
-            f"latency_ms={latency_ms}, task_completed={task_completed}"
-        )
-    
-    def end_session(self, session_id: str) -> SessionMetrics | None:
-        """End session tracking and return final metrics."""
-        session = self._session_metrics.pop(session_id, None)
-        if session:
-            self._tenant_metrics[session_id.split("_")[0]].total_sessions += 1
-            self._global_metrics.total_sessions += 1
-            logger.info(
-                f"[AC-IDMP-18] Session ended: {session_id}, "
-                f"turns={session.total_turns}, completed={session.completed_tasks}"
-            )
-        return session
-    
-    def get_session_metrics(self, session_id: str) -> SessionMetrics | None:
-        """Get metrics for a specific session."""
-        return self._session_metrics.get(session_id)
-    
-    def get_tenant_metrics(self, tenant_id: str) -> MetricsSnapshot:
-        """[AC-IDMP-18] Get metrics snapshot for a tenant."""
-        return self._tenant_metrics[tenant_id].to_snapshot()
-    
-    def get_global_metrics(self) -> MetricsSnapshot:
-        """[AC-IDMP-18] Get global metrics snapshot."""
-        return self._global_metrics.to_snapshot()
-    
-    def reset_metrics(self, tenant_id: str | None = None) -> None:
-        """Reset metrics for a tenant or globally."""
-        if tenant_id:
-            self._tenant_metrics[tenant_id] = AggregatedMetrics()
-            logger.info(f"[AC-IDMP-18] Metrics reset for tenant: {tenant_id}")
-        else:
-            self._tenant_metrics.clear()
-            self._global_metrics = AggregatedMetrics()
-            logger.info("[AC-IDMP-18] Global metrics reset")
-    
-    def get_metrics_dict(self, tenant_id: str | None = None) -> dict[str, Any]:
-        """Get metrics as dictionary for logging/export."""
-        snapshot = self.get_tenant_metrics(tenant_id) if tenant_id else self.get_global_metrics()
-        return {
-            "task_completion_rate": snapshot.task_completion_rate,
-            "slot_completion_rate": snapshot.slot_completion_rate,
-            "wrong_transfer_rate": snapshot.wrong_transfer_rate,
-            "no_recall_rate": snapshot.no_recall_rate,
-            "avg_latency_ms": snapshot.avg_latency_ms,
-        }
--- a/ai-service/app/services/mid/output_guardrail_executor.py
+++ b/ai-service/app/services/mid/output_guardrail_executor.py
@ -1,152 +0,0 @@
-"""
-Output Guardrail Executor for Mid Platform.
-[AC-MARH-01, AC-MARH-02] Output guardrail enforcement before returning segments.
-
-Features:
- Mandatory output filtering before segments are returned
- Guardrail trigger logging with rule_id
- Integration with existing OutputFilter
-"""
-
-import logging
-from dataclasses import dataclass
-from typing import Any
-
-from app.models.entities import GuardrailResult
-from app.services.guardrail.output_filter import OutputFilter
-from app.services.guardrail.word_service import ForbiddenWordService
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class GuardrailExecutionResult:
-    """Result of guardrail execution."""
-    filtered_text: str
-    triggered: bool = False
-    blocked: bool = False
-    rule_id: str | None = None
-    triggered_words: list[str] | None = None
-    triggered_categories: list[str] | None = None
-
-
-class OutputGuardrailExecutor:
-    """
-    [AC-MARH-01, AC-MARH-02] Output guardrail executor for mandatory filtering.
-
-    This component enforces output guardrail filtering before any segments
-    are returned to the client. It wraps the existing OutputFilter and adds
-    trace/logging capabilities required by MARH.
-    """
-
-    def __init__(
-        self,
-        output_filter: OutputFilter | None = None,
-        word_service: ForbiddenWordService | None = None,
-    ):
-        if output_filter:
-            self._output_filter = output_filter
-        elif word_service:
-            self._output_filter = OutputFilter(word_service)
-        else:
-            self._output_filter = None
-
-    async def execute(
-        self,
-        text: str,
-        tenant_id: str,
-    ) -> GuardrailExecutionResult:
-        """
-        [AC-MARH-01] Execute guardrail filtering on output text.
-
-        Args:
-            text: The text to filter
-            tenant_id: Tenant ID for isolation
-
-        Returns:
-            GuardrailExecutionResult with filtered text and trigger info
-        """
-        if not text or not text.strip():
-            return GuardrailExecutionResult(filtered_text=text)
-
-        if not self._output_filter:
-            logger.debug("[AC-MARH-01] No output filter configured, skipping guardrail")
-            return GuardrailExecutionResult(filtered_text=text)
-
-        try:
-            result: GuardrailResult = await self._output_filter.filter(text, tenant_id)
-
-            triggered = bool(result.triggered_words)
-            rule_id = None
-            if triggered and result.triggered_categories:
-                rule_id = f"forbidden_word:{','.join(result.triggered_categories[:3])}"
-
-            if triggered:
-                logger.info(
-                    f"[AC-MARH-02] Guardrail triggered: tenant={tenant_id}, "
-                    f"blocked={result.blocked}, rule_id={rule_id}, "
-                    f"words={result.triggered_words}"
-                )
-
-            return GuardrailExecutionResult(
-                filtered_text=result.reply,
-                triggered=triggered,
-                blocked=result.blocked,
-                rule_id=rule_id,
-                triggered_words=result.triggered_words,
-                triggered_categories=result.triggered_categories,
-            )
-
-        except Exception as e:
-            logger.error(f"[AC-MARH-01] Guardrail execution failed: {e}")
-            return GuardrailExecutionResult(
-                filtered_text=text,
-                triggered=False,
-                rule_id=f"error:{str(e)[:50]}",
-            )
-
-    async def filter_segments(
-        self,
-        segments: list[Any],
-        tenant_id: str,
-    ) -> tuple[list[Any], GuardrailExecutionResult]:
-        """
-        [AC-MARH-01] Filter all segments and return combined result.
-
-        Args:
-            segments: List of Segment objects with text field
-            tenant_id: Tenant ID for isolation
-
-        Returns:
-            Tuple of (filtered_segments, combined_result)
-        """
-        if not segments:
-            return segments, GuardrailExecutionResult(filtered_text="")
-
-        if not self._output_filter:
-            return segments, GuardrailExecutionResult(filtered_text="")
-
-        combined_text = "\n\n".join(s.text for s in segments)
-        result = await self.execute(combined_text, tenant_id)
-
-        if result.blocked:
-            from app.models.mid.schemas import Segment
-            return [
-                Segment(text=result.filtered_text, delay_after=0)
-            ], result
-
-        if result.triggered:
-            filtered_paragraphs = result.filtered_text.split("\n\n")
-            filtered_segments = []
-            for i, para in enumerate(filtered_paragraphs):
-                if para.strip():
-                    from app.models.mid.schemas import Segment
-                    filtered_segments.append(
-                        Segment(
-                            text=para.strip(),
-                            delay_after=segments[0].delay_after if segments and i < len(segments) else 0,
-                        )
-                    )
-            return filtered_segments, result
-
-        return segments, result
--- a/ai-service/app/services/mid/policy_router.py
+++ b/ai-service/app/services/mid/policy_router.py
@ -1,411 +0,0 @@
-"""
-Policy Router for Mid Platform.
-[AC-IDMP-02, AC-IDMP-05, AC-IDMP-16, AC-IDMP-20] Routes to agent/micro_flow/fixed/transfer based on policy.
-
-Decision Matrix:
-1. High-risk scenario (refund/complaint/privacy/transfer) -> micro_flow or transfer
-2. Low confidence or missing key info -> micro_flow or fixed
-3. Tool unavailable -> fixed
-4. Human mode active -> transfer
-5. Normal case -> agent
-
-Intent Hint Integration:
- intent_hint provides soft signals (suggested_mode, confidence, high_risk_detected)
- policy_router consumes hints but retains final decision authority
- When hint suggests high_risk, policy_router validates and may override
-
-High Risk Check Integration:
- high_risk_check provides structured risk detection result
- High-risk check takes priority over normal intent routing
- When high_risk_check matched, skip normal intent matching
-"""
-
-import logging
-from dataclasses import dataclass
-from typing import TYPE_CHECKING, Any
-
-from app.models.mid.schemas import (
-    ExecutionMode,
-    FeatureFlags,
-    HighRiskScenario,
-    PolicyRouterResult,
-)
-
-if TYPE_CHECKING:
-    from app.models.mid.schemas import HighRiskCheckResult, IntentHintOutput
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_HIGH_RISK_SCENARIOS: list[HighRiskScenario] = [
-    HighRiskScenario.REFUND,
-    HighRiskScenario.COMPLAINT_ESCALATION,
-    HighRiskScenario.PRIVACY_SENSITIVE_PROMISE,
-    HighRiskScenario.TRANSFER,
-]
-
-HIGH_RISK_KEYWORDS: dict[HighRiskScenario, list[str]] = {
-    HighRiskScenario.REFUND: ["退款", "退货", "退钱", "退费", "还钱", "退款申请"],
-    HighRiskScenario.COMPLAINT_ESCALATION: ["投诉", "升级投诉", "举报", "12315", "消费者协会"],
-    HighRiskScenario.PRIVACY_SENSITIVE_PROMISE: ["承诺", "保证", "一定", "肯定能", "绝对", "担保"],
-    HighRiskScenario.TRANSFER: ["转人工", "人工客服", "人工服务", "真人", "人工"],
-}
-
-LOW_CONFIDENCE_THRESHOLD = 0.3
-
-
-@dataclass
-class IntentMatch:
-    """Intent match result."""
-    intent_id: str
-    intent_name: str
-    confidence: float
-    response_type: str
-    target_kb_ids: list[str] | None = None
-    flow_id: str | None = None
-    fixed_reply: str | None = None
-    transfer_message: str | None = None
-
-
-class PolicyRouter:
-    """
-    [AC-IDMP-02, AC-IDMP-05, AC-IDMP-16, AC-IDMP-20] Policy router for execution mode decision.
-
-    Decision Flow:
-    1. Check feature flags (rollback_to_legacy -> fixed)
-    2. Check session mode (HUMAN_ACTIVE -> transfer)
-    3. Check high-risk scenarios -> micro_flow or transfer
-    4. Check intent match confidence -> fallback if low
-    5. Default -> agent
-    """
-
-    def __init__(
-        self,
-        high_risk_scenarios: list[HighRiskScenario] | None = None,
-        low_confidence_threshold: float = LOW_CONFIDENCE_THRESHOLD,
-    ):
-        self._high_risk_scenarios = high_risk_scenarios or DEFAULT_HIGH_RISK_SCENARIOS
-        self._low_confidence_threshold = low_confidence_threshold
-
-    def route(
-        self,
-        user_message: str,
-        session_mode: str = "BOT_ACTIVE",
-        feature_flags: FeatureFlags | None = None,
-        intent_match: IntentMatch | None = None,
-        intent_hint: "IntentHintOutput | None" = None,
-        context: dict[str, Any] | None = None,
-    ) -> PolicyRouterResult:
-        """
-        [AC-IDMP-02] Route to appropriate execution mode.
-
-        Args:
-            user_message: User input message
-            session_mode: Current session mode (BOT_ACTIVE/HUMAN_ACTIVE)
-            feature_flags: Feature flags for grayscale control
-            intent_match: Intent match result if available
-            intent_hint: Soft signal from intent_hint tool (optional)
-            context: Additional context for decision
-
-        Returns:
-            PolicyRouterResult with decided mode and metadata
-        """
-        logger.info(
-            f"[AC-IDMP-02] PolicyRouter routing: session_mode={session_mode}, "
-            f"feature_flags={feature_flags}, intent_match={intent_match}, "
-            f"intent_hint_mode={intent_hint.suggested_mode if intent_hint else None}"
-        )
-
-        if feature_flags and feature_flags.rollback_to_legacy:
-            logger.info("[AC-IDMP-17] Rollback to legacy requested, using fixed mode")
-            return PolicyRouterResult(
-                mode=ExecutionMode.FIXED,
-                fallback_reason_code="rollback_to_legacy",
-            )
-
-        if session_mode == "HUMAN_ACTIVE":
-            logger.info("[AC-IDMP-09] Session in HUMAN_ACTIVE mode, routing to transfer")
-            return PolicyRouterResult(
-                mode=ExecutionMode.TRANSFER,
-                transfer_message="正在为您转接人工客服...",
-            )
-
-        if intent_hint and intent_hint.high_risk_detected:
-            logger.info(
-                f"[AC-IDMP-05, AC-IDMP-20] High-risk from hint: {intent_hint.fallback_reason_code}"
-            )
-            return self._handle_high_risk_from_hint(intent_hint, intent_match)
-
-        high_risk_scenario = self._check_high_risk(user_message)
-        if high_risk_scenario:
-            logger.info(f"[AC-IDMP-05, AC-IDMP-20] High-risk scenario detected: {high_risk_scenario}")
-            return self._handle_high_risk(high_risk_scenario, intent_match)
-
-        if intent_hint and intent_hint.confidence < self._low_confidence_threshold:
-            logger.info(
-                f"[AC-IDMP-16] Low confidence from hint ({intent_hint.confidence}), "
-                f"considering fallback"
-            )
-            if intent_hint.suggested_mode in (ExecutionMode.FIXED, ExecutionMode.MICRO_FLOW):
-                return PolicyRouterResult(
-                    mode=intent_hint.suggested_mode,
-                    intent=intent_hint.intent,
-                    confidence=intent_hint.confidence,
-                    fallback_reason_code=intent_hint.fallback_reason_code or "low_confidence_hint",
-                    target_flow_id=intent_hint.target_flow_id,
-                )
-
-        if intent_match:
-            if intent_match.confidence < self._low_confidence_threshold:
-                logger.info(
-                    f"[AC-IDMP-16] Low confidence ({intent_match.confidence}), "
-                    f"falling back from agent mode"
-                )
-                return self._handle_low_confidence(intent_match)
-
-            if intent_match.response_type == "fixed":
-                return PolicyRouterResult(
-                    mode=ExecutionMode.FIXED,
-                    intent=intent_match.intent_name,
-                    confidence=intent_match.confidence,
-                    fixed_reply=intent_match.fixed_reply,
-                )
-
-            if intent_match.response_type == "transfer":
-                return PolicyRouterResult(
-                    mode=ExecutionMode.TRANSFER,
-                    intent=intent_match.intent_name,
-                    confidence=intent_match.confidence,
-                    transfer_message=intent_match.transfer_message or "正在为您转接人工客服...",
-                )
-
-            if intent_match.response_type == "flow":
-                return PolicyRouterResult(
-                    mode=ExecutionMode.MICRO_FLOW,
-                    intent=intent_match.intent_name,
-                    confidence=intent_match.confidence,
-                    target_flow_id=intent_match.flow_id,
-                )
-
-        if feature_flags and not feature_flags.agent_enabled:
-            logger.info("[AC-IDMP-17] Agent disabled by feature flag, using fixed mode")
-            return PolicyRouterResult(
-                mode=ExecutionMode.FIXED,
-                fallback_reason_code="agent_disabled",
-            )
-
-        logger.info("[AC-IDMP-02] Default routing to agent mode")
-        return PolicyRouterResult(
-            mode=ExecutionMode.AGENT,
-            intent=intent_match.intent_name if intent_match else None,
-            confidence=intent_match.confidence if intent_match else None,
-        )
-
-    def _check_high_risk(self, message: str) -> HighRiskScenario | None:
-        """
-        [AC-IDMP-05, AC-IDMP-20] Check if message matches high-risk scenarios.
-
-        Returns the first matched high-risk scenario or None.
-        """
-        message_lower = message.lower()
-
-        for scenario in self._high_risk_scenarios:
-            keywords = HIGH_RISK_KEYWORDS.get(scenario, [])
-            for keyword in keywords:
-                if keyword.lower() in message_lower:
-                    return scenario
-
-        return None
-
-    def _handle_high_risk(
-        self,
-        scenario: HighRiskScenario,
-        intent_match: IntentMatch | None,
-    ) -> PolicyRouterResult:
-        """
-        [AC-IDMP-05] Handle high-risk scenario by routing to micro_flow or transfer.
-        """
-        if scenario == HighRiskScenario.TRANSFER:
-            return PolicyRouterResult(
-                mode=ExecutionMode.TRANSFER,
-                high_risk_triggered=True,
-                transfer_message="正在为您转接人工客服...",
-            )
-
-        if intent_match and intent_match.flow_id:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                intent=intent_match.intent_name,
-                confidence=intent_match.confidence,
-                high_risk_triggered=True,
-                target_flow_id=intent_match.flow_id,
-            )
-
-        return PolicyRouterResult(
-            mode=ExecutionMode.MICRO_FLOW,
-            high_risk_triggered=True,
-            fallback_reason_code=f"high_risk_{scenario.value}",
-        )
-
-    def _handle_high_risk_from_hint(
-        self,
-        intent_hint: "IntentHintOutput",
-        intent_match: IntentMatch | None,
-    ) -> PolicyRouterResult:
-        """
-        [AC-IDMP-05, AC-IDMP-20] Handle high-risk from intent_hint.
-
-        Policy_router validates hint suggestion but may override.
-        """
-        if intent_hint.suggested_mode == ExecutionMode.TRANSFER:
-            return PolicyRouterResult(
-                mode=ExecutionMode.TRANSFER,
-                high_risk_triggered=True,
-                transfer_message="正在为您转接人工客服...",
-            )
-
-        if intent_match and intent_match.flow_id:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                intent=intent_match.intent_name,
-                confidence=intent_match.confidence,
-                high_risk_triggered=True,
-                target_flow_id=intent_match.flow_id,
-            )
-
-        if intent_hint.target_flow_id:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                intent=intent_hint.intent,
-                confidence=intent_hint.confidence,
-                high_risk_triggered=True,
-                target_flow_id=intent_hint.target_flow_id,
-            )
-
-        return PolicyRouterResult(
-            mode=ExecutionMode.MICRO_FLOW,
-            high_risk_triggered=True,
-            fallback_reason_code=intent_hint.fallback_reason_code or "high_risk_hint",
-        )
-
-    def _handle_low_confidence(self, intent_match: IntentMatch) -> PolicyRouterResult:
-        """
-        [AC-IDMP-16] Handle low confidence by falling back to micro_flow or fixed.
-        """
-        if intent_match.flow_id:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                intent=intent_match.intent_name,
-                confidence=intent_match.confidence,
-                fallback_reason_code="low_confidence",
-                target_flow_id=intent_match.flow_id,
-            )
-
-        if intent_match.fixed_reply:
-            return PolicyRouterResult(
-                mode=ExecutionMode.FIXED,
-                intent=intent_match.intent_name,
-                confidence=intent_match.confidence,
-                fallback_reason_code="low_confidence",
-                fixed_reply=intent_match.fixed_reply,
-            )
-
-        return PolicyRouterResult(
-            mode=ExecutionMode.FIXED,
-            fallback_reason_code="low_confidence_no_flow",
-        )
-
-    def get_active_high_risk_set(self) -> list[HighRiskScenario]:
-        """[AC-IDMP-20] Get active high-risk scenario set."""
-        return self._high_risk_scenarios
-
-    def route_with_high_risk_check(
-        self,
-        user_message: str,
-        high_risk_check_result: "HighRiskCheckResult | None",
-        session_mode: str = "BOT_ACTIVE",
-        feature_flags: FeatureFlags | None = None,
-        intent_match: IntentMatch | None = None,
-        intent_hint: "IntentHintOutput | None" = None,
-        context: dict[str, Any] | None = None,
-    ) -> PolicyRouterResult:
-        """
-        [AC-IDMP-05, AC-IDMP-20] Route with high_risk_check result.
-
-        高风险优先于普通意图路由：
-        1. 如果 high_risk_check 匹配，直接返回高风险路由结果
-        2. 否则继续正常的路由决策
-
-        Args:
-            user_message: User input message
-            high_risk_check_result: Result from high_risk_check tool
-            session_mode: Current session mode (BOT_ACTIVE/HUMAN_ACTIVE)
-            feature_flags: Feature flags for grayscale control
-            intent_match: Intent match result if available
-            intent_hint: Soft signal from intent_hint tool (optional)
-            context: Additional context for decision
-
-        Returns:
-            PolicyRouterResult with decided mode and metadata
-        """
-        if high_risk_check_result and high_risk_check_result.matched:
-            logger.info(
-                f"[AC-IDMP-05, AC-IDMP-20] High-risk check matched: "
-                f"scenario={high_risk_check_result.risk_scenario}, "
-                f"rule_id={high_risk_check_result.rule_id}"
-            )
-            return self._handle_high_risk_check_result(
-                high_risk_check_result, intent_match
-            )
-
-        return self.route(
-            user_message=user_message,
-            session_mode=session_mode,
-            feature_flags=feature_flags,
-            intent_match=intent_match,
-            intent_hint=intent_hint,
-            context=context,
-        )
-
-    def _handle_high_risk_check_result(
-        self,
-        high_risk_result: "HighRiskCheckResult",
-        intent_match: IntentMatch | None,
-    ) -> PolicyRouterResult:
-        """
-        [AC-IDMP-05] Handle high_risk_check result.
-
-        高风险检测结果优先于普通意图路由。
-        """
-        recommended_mode = high_risk_result.recommended_mode or ExecutionMode.MICRO_FLOW
-        risk_scenario = high_risk_result.risk_scenario
-
-        if recommended_mode == ExecutionMode.TRANSFER:
-            transfer_msg = "正在为您转接人工客服..."
-            if risk_scenario:
-                if risk_scenario == HighRiskScenario.COMPLAINT_ESCALATION:
-                    transfer_msg = "检测到您可能需要投诉处理，正在为您转接人工客服..."
-                elif risk_scenario == HighRiskScenario.REFUND:
-                    transfer_msg = "您的退款请求需要人工处理，正在为您转接..."
-
-            return PolicyRouterResult(
-                mode=ExecutionMode.TRANSFER,
-                high_risk_triggered=True,
-                transfer_message=transfer_msg,
-                fallback_reason_code=high_risk_result.rule_id,
-            )
-
-        if intent_match and intent_match.flow_id:
-            return PolicyRouterResult(
-                mode=ExecutionMode.MICRO_FLOW,
-                intent=intent_match.intent_name,
-                confidence=intent_match.confidence,
-                high_risk_triggered=True,
-                target_flow_id=intent_match.flow_id,
-            )
-
-        return PolicyRouterResult(
-            mode=ExecutionMode.MICRO_FLOW,
-            high_risk_triggered=True,
-            fallback_reason_code=high_risk_result.rule_id
-            or f"high_risk_{risk_scenario.value if risk_scenario else 'unknown'}",
-        )
--- a/ai-service/app/services/mid/role_based_field_provider.py
+++ b/ai-service/app/services/mid/role_based_field_provider.py
@ -1,347 +0,0 @@
-"""
-Role Based Field Provider Service.
-[AC-MRS-04, AC-MRS-05, AC-MRS-10] 基于角色的字段提供者服务
-"""
-
-import logging
-from typing import Any
-
-from sqlalchemy import select, cast
-from sqlalchemy.dialects.postgresql import JSONB
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.entities import (
-    FieldRole,
-    MetadataFieldDefinition,
-    MetadataFieldStatus,
-    SlotDefinition,
-)
-from app.schemas.metadata import VALID_FIELD_ROLES
-
-logger = logging.getLogger(__name__)
-
-
-class InvalidRoleError(Exception):
-    """[AC-MRS-05] 无效角色异常"""
-    
-    def __init__(self, role: str):
-        self.role = role
-        self.valid_roles = VALID_FIELD_ROLES
-        super().__init__(
-            f"Invalid role '{role}'. Valid roles are: {', '.join(self.valid_roles)}"
-        )
-
-
-class RoleBasedFieldProvider:
-    """
-    [AC-MRS-04, AC-MRS-05, AC-MRS-10] 基于角色的字段提供者
-    
-    提供按角色查询字段定义的能力，供工具链按需消费
-    """
-    
-    CACHE_KEY_PREFIX = "field_roles"
-    CACHE_TTL = 300
-    
-    def __init__(self, session: AsyncSession):
-        self._session = session
-    
-    def _validate_role(self, role: str) -> str:
-        """
-        [AC-MRS-05] 验证角色值是否有效
-        
-        Args:
-            role: 角色字符串
-            
-        Returns:
-            验证通过的角色字符串
-            
-        Raises:
-            InvalidRoleError: 如果角色无效
-        """
-        if role not in VALID_FIELD_ROLES:
-            raise InvalidRoleError(role)
-        return role
-    
-    async def get_fields_by_role(
-        self,
-        tenant_id: str,
-        role: str,
-        include_deprecated: bool = False,
-    ) -> list[MetadataFieldDefinition]:
-        """
-        [AC-MRS-04] 按角色获取字段定义
-        
-        Args:
-            tenant_id: 租户 ID
-            role: 字段角色 (resource_filter/slot/prompt_var/routing_signal)
-            include_deprecated: 是否包含已废弃字段
-            
-        Returns:
-            字段定义列表
-            
-        Raises:
-            InvalidRoleError: 如果角色无效
-        """
-        self._validate_role(role)
-        
-        logger.info(
-            f"[AC-MRS-04] Getting fields by role: tenant={tenant_id}, "
-            f"role={role}, include_deprecated={include_deprecated}"
-        )
-        
-        stmt = select(MetadataFieldDefinition).where(
-            MetadataFieldDefinition.tenant_id == tenant_id,
-            cast(MetadataFieldDefinition.field_roles, JSONB).op('?')(role),
-        )
-        
-        if not include_deprecated:
-            stmt = stmt.where(
-                MetadataFieldDefinition.status == MetadataFieldStatus.ACTIVE.value
-            )
-        else:
-            stmt = stmt.where(
-                MetadataFieldDefinition.status.in_([
-                    MetadataFieldStatus.ACTIVE.value,
-                    MetadataFieldStatus.DEPRECATED.value,
-                ])
-            )
-        
-        result = await self._session.execute(stmt)
-        fields = list(result.scalars().all())
-        
-        logger.info(
-            f"[AC-MRS-04] Found {len(fields)} fields for role={role}"
-        )
-        
-        return fields
-    
-    async def get_field_keys_by_role(
-        self,
-        tenant_id: str,
-        role: str,
-        include_deprecated: bool = False,
-    ) -> list[str]:
-        """
-        按角色获取字段键名列表
-        
-        Args:
-            tenant_id: 租户 ID
-            role: 字段角色
-            include_deprecated: 是否包含已废弃字段
-            
-        Returns:
-            字段键名列表
-        """
-        fields = await self.get_fields_by_role(tenant_id, role, include_deprecated)
-        return [f.field_key for f in fields]
-    
-    async def get_slot_definitions_by_role(
-        self,
-        tenant_id: str,
-        role: str = "slot",
-    ) -> list[dict[str, Any]]:
-        """
-        [AC-MRS-10] 按角色获取槽位定义及关联字段信息
-        
-        Args:
-            tenant_id: 租户 ID
-            role: 字段角色，默认为 slot
-            
-        Returns:
-            槽位定义列表，包含关联字段信息
-        """
-        self._validate_role(role)
-        
-        logger.info(
-            f"[AC-MRS-10] Getting slot definitions by role: tenant={tenant_id}, role={role}"
-        )
-        
-        fields = await self.get_fields_by_role(tenant_id, role)
-        field_ids = [f.id for f in fields]
-        field_map = {str(f.id): f for f in fields}
-        
-        stmt = select(SlotDefinition).where(
-            SlotDefinition.tenant_id == tenant_id,
-        )
-        
-        result = await self._session.execute(stmt)
-        all_slots = list(result.scalars().all())
-        
-        slot_with_fields = []
-        for slot in all_slots:
-            slot_data = {
-                "id": str(slot.id),
-                "tenant_id": slot.tenant_id,
-                "slot_key": slot.slot_key,
-                "type": slot.type,
-                "required": slot.required,
-                "extract_strategy": slot.extract_strategy,
-                "validation_rule": slot.validation_rule,
-                "ask_back_prompt": slot.ask_back_prompt,
-                "default_value": slot.default_value,
-                "linked_field_id": str(slot.linked_field_id) if slot.linked_field_id else None,
-                "created_at": slot.created_at.isoformat() if slot.created_at else None,
-                "updated_at": slot.updated_at.isoformat() if slot.updated_at else None,
-                "linked_field": None,
-            }
-            
-            if slot.linked_field_id and str(slot.linked_field_id) in field_map:
-                linked_field = field_map[str(slot.linked_field_id)]
-                slot_data["linked_field"] = {
-                    "id": str(linked_field.id),
-                    "field_key": linked_field.field_key,
-                    "label": linked_field.label,
-                    "type": linked_field.type,
-                    "required": linked_field.required,
-                    "options": linked_field.options,
-                    "default_value": linked_field.default_value,
-                    "scope": linked_field.scope,
-                    "is_filterable": linked_field.is_filterable,
-                    "is_rank_feature": linked_field.is_rank_feature,
-                    "field_roles": linked_field.field_roles,
-                    "status": linked_field.status,
-                }
-            
-            slot_with_fields.append(slot_data)
-        
-        for field in fields:
-            field_id_str = str(field.id)
-            has_slot = any(
-                s.get("linked_field_id") == field_id_str 
-                for s in slot_with_fields
-            )
-            if not has_slot and role == "slot":
-                slot_with_fields.append({
-                    "id": None,
-                    "tenant_id": field.tenant_id,
-                    "slot_key": field.field_key,
-                    "type": field.type,
-                    "required": field.required,
-                    "extract_strategy": None,
-                    "validation_rule": None,
-                    "ask_back_prompt": None,
-                    "default_value": field.default_value,
-                    "linked_field_id": str(field.id),
-                    "created_at": None,
-                    "updated_at": None,
-                    "linked_field": {
-                        "id": str(field.id),
-                        "field_key": field.field_key,
-                        "label": field.label,
-                        "type": field.type,
-                        "required": field.required,
-                        "options": field.options,
-                        "default_value": field.default_value,
-                        "scope": field.scope,
-                        "is_filterable": field.is_filterable,
-                        "is_rank_feature": field.is_rank_feature,
-                        "field_roles": field.field_roles,
-                        "status": field.status,
-                    },
-                })
-        
-        logger.info(
-            f"[AC-MRS-10] Found {len(slot_with_fields)} slot definitions for role={role}"
-        )
-        
-        return slot_with_fields
-    
-    async def get_resource_filter_fields(
-        self,
-        tenant_id: str,
-    ) -> list[MetadataFieldDefinition]:
-        """
-        [AC-MRS-11] 获取资源过滤角色字段
-        供 kb_search_dynamic 工具使用
-        """
-        return await self.get_fields_by_role(
-            tenant_id, 
-            FieldRole.RESOURCE_FILTER.value
-        )
-    
-    async def get_resource_filter_field_keys(
-        self,
-        tenant_id: str,
-    ) -> list[str]:
-        """
-        [AC-MRS-11] 获取资源过滤角色字段键名列表
-        """
-        return await self.get_field_keys_by_role(
-            tenant_id,
-            FieldRole.RESOURCE_FILTER.value
-        )
-    
-    async def get_slot_fields(
-        self,
-        tenant_id: str,
-    ) -> list[MetadataFieldDefinition]:
-        """
-        [AC-MRS-12] 获取槽位角色字段
-        供 memory_recall 工具使用
-        """
-        return await self.get_fields_by_role(
-            tenant_id,
-            FieldRole.SLOT.value
-        )
-    
-    async def get_slot_field_keys(
-        self,
-        tenant_id: str,
-    ) -> list[str]:
-        """
-        [AC-MRS-12] 获取槽位角色字段键名列表
-        """
-        return await self.get_field_keys_by_role(
-            tenant_id,
-            FieldRole.SLOT.value
-        )
-    
-    async def get_routing_signal_fields(
-        self,
-        tenant_id: str,
-    ) -> list[MetadataFieldDefinition]:
-        """
-        [AC-MRS-13] 获取路由信号角色字段
-        供 intent_hint/high_risk_check 工具使用
-        """
-        return await self.get_fields_by_role(
-            tenant_id,
-            FieldRole.ROUTING_SIGNAL.value
-        )
-    
-    async def get_routing_signal_field_keys(
-        self,
-        tenant_id: str,
-    ) -> list[str]:
-        """
-        [AC-MRS-13] 获取路由信号角色字段键名列表
-        """
-        return await self.get_field_keys_by_role(
-            tenant_id,
-            FieldRole.ROUTING_SIGNAL.value
-        )
-    
-    async def get_prompt_var_fields(
-        self,
-        tenant_id: str,
-    ) -> list[MetadataFieldDefinition]:
-        """
-        [AC-MRS-14] 获取提示词变量角色字段
-        供 template_engine 使用
-        """
-        return await self.get_fields_by_role(
-            tenant_id,
-            FieldRole.PROMPT_VAR.value
-        )
-    
-    async def get_prompt_var_field_keys(
-        self,
-        tenant_id: str,
-    ) -> list[str]:
-        """
-        [AC-MRS-14] 获取提示词变量角色字段键名列表
-        """
-        return await self.get_field_keys_by_role(
-            tenant_id,
-            FieldRole.PROMPT_VAR.value
-        )
--- a/ai-service/app/services/mid/runtime_observer.py
+++ b/ai-service/app/services/mid/runtime_observer.py
@ -1,289 +0,0 @@
-"""
-Runtime Observer for Mid Platform.
-[AC-MARH-12] 运行时观测闭环。
-
-汇总 guardrail、interrupt、kb_hit、timeouts、segment_stats 等观测字段。
-"""
-
-import logging
-import time
-from dataclasses import dataclass, field
-from typing import Any
-
-from app.models.mid.schemas import (
-    ExecutionMode,
-    MetricsSnapshot,
-    SegmentStats,
-    TimeoutProfile,
-    ToolCallTrace,
-    TraceInfo,
-)
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class RuntimeContext:
-    """运行时上下文。"""
-    tenant_id: str = ""
-    session_id: str = ""
-    request_id: str = ""
-    generation_id: str = ""
-    mode: ExecutionMode = ExecutionMode.AGENT
-    intent: str | None = None
-
-    guardrail_triggered: bool = False
-    guardrail_rule_id: str | None = None
-
-    interrupt_consumed: bool = False
-
-    kb_tool_called: bool = False
-    kb_hit: bool = False
-
-    fallback_reason_code: str | None = None
-
-    react_iterations: int = 0
-    tool_calls: list[ToolCallTrace] = field(default_factory=list)
-
-    timeout_profile: TimeoutProfile | None = None
-    segment_stats: SegmentStats | None = None
-    metrics_snapshot: MetricsSnapshot | None = None
-
-    start_time: float = field(default_factory=time.time)
-
-    def to_trace_info(self) -> TraceInfo:
-        """转换为 TraceInfo。"""
-        return TraceInfo(
-            mode=self.mode,
-            intent=self.intent,
-            request_id=self.request_id,
-            generation_id=self.generation_id,
-            guardrail_triggered=self.guardrail_triggered,
-            guardrail_rule_id=self.guardrail_rule_id,
-            interrupt_consumed=self.interrupt_consumed,
-            kb_tool_called=self.kb_tool_called,
-            kb_hit=self.kb_hit,
-            fallback_reason_code=self.fallback_reason_code,
-            react_iterations=self.react_iterations,
-            timeout_profile=self.timeout_profile,
-            segment_stats=self.segment_stats,
-            metrics_snapshot=self.metrics_snapshot,
-            tools_used=[tc.tool_name for tc in self.tool_calls] if self.tool_calls else None,
-            tool_calls=self.tool_calls if self.tool_calls else None,
-        )
-
-
-class RuntimeObserver:
-    """
-    [AC-MARH-12] 运行时观测器。
-
-    Features:
-    - 汇总 guardrail、interrupt、kb_hit、timeouts、segment_stats
-    - 生成完整 TraceInfo
-    - 记录观测日志
-    """
-
-    def __init__(self):
-        self._contexts: dict[str, RuntimeContext] = {}
-
-    def start_observation(
-        self,
-        tenant_id: str,
-        session_id: str,
-        request_id: str,
-        generation_id: str,
-    ) -> RuntimeContext:
-        """
-        [AC-MARH-12] 开始观测。
-
-        Args:
-            tenant_id: 租户 ID
-            session_id: 会话 ID
-            request_id: 请求 ID
-            generation_id: 生成 ID
-
-        Returns:
-            RuntimeContext 实例
-        """
-        ctx = RuntimeContext(
-            tenant_id=tenant_id,
-            session_id=session_id,
-            request_id=request_id,
-            generation_id=generation_id,
-        )
-
-        self._contexts[request_id] = ctx
-
-        logger.info(
-            f"[AC-MARH-12] Observation started: request_id={request_id}, "
-            f"session_id={session_id}"
-        )
-
-        return ctx
-
-    def get_context(self, request_id: str) -> RuntimeContext | None:
-        """获取观测上下文。"""
-        return self._contexts.get(request_id)
-
-    def update_mode(
-        self,
-        request_id: str,
-        mode: ExecutionMode,
-        intent: str | None = None,
-    ) -> None:
-        """更新执行模式。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.mode = mode
-            ctx.intent = intent
-
-    def record_guardrail(
-        self,
-        request_id: str,
-        triggered: bool,
-        rule_id: str | None = None,
-    ) -> None:
-        """[AC-MARH-12] 记录护栏触发。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.guardrail_triggered = triggered
-            ctx.guardrail_rule_id = rule_id
-
-            logger.info(
-                f"[AC-MARH-12] Guardrail recorded: request_id={request_id}, "
-                f"triggered={triggered}, rule_id={rule_id}"
-            )
-
-    def record_interrupt(
-        self,
-        request_id: str,
-        consumed: bool,
-    ) -> None:
-        """[AC-MARH-12] 记录中断处理。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.interrupt_consumed = consumed
-
-            logger.info(
-                f"[AC-MARH-12] Interrupt recorded: request_id={request_id}, "
-                f"consumed={consumed}"
-            )
-
-    def record_kb(
-        self,
-        request_id: str,
-        tool_called: bool,
-        hit: bool,
-        fallback_reason: str | None = None,
-    ) -> None:
-        """[AC-MARH-12] 记录 KB 检索。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.kb_tool_called = tool_called
-            ctx.kb_hit = hit
-
-            if fallback_reason:
-                ctx.fallback_reason_code = fallback_reason
-
-            logger.info(
-                f"[AC-MARH-12] KB recorded: request_id={request_id}, "
-                f"tool_called={tool_called}, hit={hit}, fallback={fallback_reason}"
-            )
-
-    def record_react(
-        self,
-        request_id: str,
-        iterations: int,
-        tool_calls: list[ToolCallTrace] | None = None,
-    ) -> None:
-        """[AC-MARH-12] 记录 ReAct 循环。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.react_iterations = iterations
-            if tool_calls:
-                ctx.tool_calls = tool_calls
-
-    def record_timeout_profile(
-        self,
-        request_id: str,
-        profile: TimeoutProfile,
-    ) -> None:
-        """[AC-MARH-12] 记录超时配置。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.timeout_profile = profile
-
-    def record_segment_stats(
-        self,
-        request_id: str,
-        stats: SegmentStats,
-    ) -> None:
-        """[AC-MARH-12] 记录分段统计。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.segment_stats = stats
-
-    def record_metrics(
-        self,
-        request_id: str,
-        metrics: MetricsSnapshot,
-    ) -> None:
-        """[AC-MARH-12] 记录指标快照。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.metrics_snapshot = metrics
-
-    def set_fallback_reason(
-        self,
-        request_id: str,
-        reason: str,
-    ) -> None:
-        """设置降级原因。"""
-        ctx = self._contexts.get(request_id)
-        if ctx:
-            ctx.fallback_reason_code = reason
-
-    def end_observation(
-        self,
-        request_id: str,
-    ) -> TraceInfo:
-        """
-        [AC-MARH-12] 结束观测并生成 TraceInfo。
-
-        Args:
-            request_id: 请求 ID
-
-        Returns:
-            完整的 TraceInfo
-        """
-        ctx = self._contexts.get(request_id)
-        if not ctx:
-            logger.warning(f"[AC-MARH-12] Context not found: {request_id}")
-            return TraceInfo(mode=ExecutionMode.FIXED)
-
-        duration_ms = int((time.time() - ctx.start_time) * 1000)
-
-        trace_info = ctx.to_trace_info()
-
-        logger.info(
-            f"[AC-MARH-12] Observation ended: request_id={request_id}, "
-            f"mode={ctx.mode.value}, duration_ms={duration_ms}, "
-            f"guardrail={ctx.guardrail_triggered}, kb_hit={ctx.kb_hit}, "
-            f"segments={ctx.segment_stats.segment_count if ctx.segment_stats else 0}"
-        )
-
-        if request_id in self._contexts:
-            del self._contexts[request_id]
-
-        return trace_info
-
-
-_runtime_observer: RuntimeObserver | None = None
-
-
-def get_runtime_observer() -> RuntimeObserver:
-    """获取或创建 RuntimeObserver 实例。"""
-    global _runtime_observer
-    if _runtime_observer is None:
-        _runtime_observer = RuntimeObserver()
-    return _runtime_observer
--- a/ai-service/app/services/mid/segment_humanizer.py
+++ b/ai-service/app/services/mid/segment_humanizer.py
@ -1,282 +0,0 @@
-"""
-Segment Humanizer for Mid Platform.
-[AC-MARH-10] 分段策略组件（语义/长度切分）。
-[AC-MARH-11] delay 策略租户化配置。
-
-将文本按语义/长度切分为 segments，并生成拟人化 delay。
-"""
-
-import logging
-import re
-import uuid
-from dataclasses import dataclass, field
-from typing import Any
-
-from app.models.mid.schemas import Segment, SegmentStats
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_MIN_DELAY_MS = 50
-DEFAULT_MAX_DELAY_MS = 500
-DEFAULT_SEGMENT_MIN_LENGTH = 10
-DEFAULT_SEGMENT_MAX_LENGTH = 200
-
-
-@dataclass
-class HumanizeConfig:
-    """拟人化配置。"""
-    enabled: bool = True
-    min_delay_ms: int = DEFAULT_MIN_DELAY_MS
-    max_delay_ms: int = DEFAULT_MAX_DELAY_MS
-    length_bucket_strategy: str = "simple"
-    segment_min_length: int = DEFAULT_SEGMENT_MIN_LENGTH
-    segment_max_length: int = DEFAULT_SEGMENT_MAX_LENGTH
-
-    def to_dict(self) -> dict[str, Any]:
-        return {
-            "enabled": self.enabled,
-            "min_delay_ms": self.min_delay_ms,
-            "max_delay_ms": self.max_delay_ms,
-            "length_bucket_strategy": self.length_bucket_strategy,
-            "segment_min_length": self.segment_min_length,
-            "segment_max_length": self.segment_max_length,
-        }
-
-    @classmethod
-    def from_dict(cls, data: dict[str, Any]) -> "HumanizeConfig":
-        return cls(
-            enabled=data.get("enabled", True),
-            min_delay_ms=data.get("min_delay_ms", DEFAULT_MIN_DELAY_MS),
-            max_delay_ms=data.get("max_delay_ms", DEFAULT_MAX_DELAY_MS),
-            length_bucket_strategy=data.get("length_bucket_strategy", "simple"),
-            segment_min_length=data.get("segment_min_length", DEFAULT_SEGMENT_MIN_LENGTH),
-            segment_max_length=data.get("segment_max_length", DEFAULT_SEGMENT_MAX_LENGTH),
-        )
-
-
-@dataclass
-class LengthBucket:
-    """长度区间与对应 delay。"""
-    min_length: int
-    max_length: int
-    delay_ms: int
-
-
-DEFAULT_LENGTH_BUCKETS = [
-    LengthBucket(min_length=0, max_length=20, delay_ms=100),
-    LengthBucket(min_length=20, max_length=50, delay_ms=200),
-    LengthBucket(min_length=50, max_length=100, delay_ms=300),
-    LengthBucket(min_length=100, max_length=200, delay_ms=400),
-    LengthBucket(min_length=200, max_length=10000, delay_ms=500),
-]
-
-
-class SegmentHumanizer:
-    """
-    [AC-MARH-10, AC-MARH-11] 分段拟人化组件。
-
-    Features:
-    - 按语义/长度切分文本
-    - 生成拟人化 delay
-    - 支持租户配置覆盖
-    - 输出 segment_stats 统计
-    """
-
-    def __init__(
-        self,
-        config: HumanizeConfig | None = None,
-        length_buckets: list[LengthBucket] | None = None,
-    ):
-        self._config = config or HumanizeConfig()
-        self._length_buckets = length_buckets or DEFAULT_LENGTH_BUCKETS
-
-    def humanize(
-        self,
-        text: str,
-        override_config: HumanizeConfig | None = None,
-    ) -> tuple[list[Segment], SegmentStats]:
-        """
-        [AC-MARH-10] 将文本转换为拟人化分段。
-
-        Args:
-            text: 输入文本
-            override_config: 租户覆盖配置
-
-        Returns:
-            Tuple of (segments, segment_stats)
-        """
-        config = override_config or self._config
-
-        if not config.enabled:
-            segments = [Segment(
-                segment_id=str(uuid.uuid4()),
-                text=text,
-                delay_after=0,
-            )]
-            stats = SegmentStats(
-                segment_count=1,
-                avg_segment_length=len(text),
-                humanize_strategy="disabled",
-            )
-            return segments, stats
-
-        raw_segments = self._split_text(text, config)
-        segments = []
-
-        for i, seg_text in enumerate(raw_segments):
-            is_last = i == len(raw_segments) - 1
-            delay_after = 0 if is_last else self._calculate_delay(seg_text, config)
-
-            segments.append(Segment(
-                segment_id=str(uuid.uuid4()),
-                text=seg_text,
-                delay_after=delay_after,
-            ))
-
-        total_length = sum(len(s.text) for s in segments)
-        avg_length = total_length / len(segments) if segments else 0.0
-
-        stats = SegmentStats(
-            segment_count=len(segments),
-            avg_segment_length=avg_length,
-            humanize_strategy=config.length_bucket_strategy,
-        )
-
-        logger.info(
-            f"[AC-MARH-10] Humanized text: segments={len(segments)}, "
-            f"avg_length={avg_length:.1f}, strategy={config.length_bucket_strategy}"
-        )
-
-        return segments, stats
-
-    def _split_text(
-        self,
-        text: str,
-        config: HumanizeConfig,
-    ) -> list[str]:
-        """切分文本。"""
-        if config.length_bucket_strategy == "semantic":
-            return self._split_semantic(text, config)
-        else:
-            return self._split_simple(text, config)
-
-    def _split_simple(
-        self,
-        text: str,
-        config: HumanizeConfig,
-    ) -> list[str]:
-        """简单切分：按段落。"""
-        paragraphs = re.split(r'\n\s*\n', text.strip())
-        segments = []
-
-        for para in paragraphs:
-            para = para.strip()
-            if not para:
-                continue
-
-            if len(para) <= config.segment_max_length:
-                segments.append(para)
-            else:
-                sub_segments = self._split_by_length(para, config.segment_max_length)
-                segments.extend(sub_segments)
-
-        if not segments:
-            segments = [text.strip()]
-
-        return [s for s in segments if s.strip()]
-
-    def _split_semantic(
-        self,
-        text: str,
-        config: HumanizeConfig,
-    ) -> list[str]:
-        """语义切分：按句子边界。"""
-        sentence_endings = re.compile(r'([。！？.!?]+)')
-        parts = sentence_endings.split(text.strip())
-
-        sentences = []
-        current = ""
-        for i, part in enumerate(parts):
-            current += part
-            if sentence_endings.match(part):
-                sentences.append(current.strip())
-                current = ""
-
-        if current.strip():
-            sentences.append(current.strip())
-
-        if not sentences:
-            sentences = [text.strip()]
-
-        segments = []
-        current_segment = ""
-
-        for sentence in sentences:
-            if len(current_segment) + len(sentence) <= config.segment_max_length:
-                current_segment += sentence
-            else:
-                if current_segment:
-                    segments.append(current_segment)
-                current_segment = sentence
-
-        if current_segment:
-            segments.append(current_segment)
-
-        return [s for s in segments if s.strip()]
-
-    def _split_by_length(
-        self,
-        text: str,
-        max_length: int,
-    ) -> list[str]:
-        """按长度切分。"""
-        segments = []
-        remaining = text
-
-        while remaining:
-            if len(remaining) <= max_length:
-                segments.append(remaining.strip())
-                break
-
-            split_pos = max_length
-            for i in range(max_length - 1, max(0, max_length - 20), -1):
-                if remaining[i] in '，,；;：: ':
-                    split_pos = i + 1
-                    break
-
-            segments.append(remaining[:split_pos].strip())
-            remaining = remaining[split_pos:]
-
-        return [s for s in segments if s.strip()]
-
-    def _calculate_delay(
-        self,
-        text: str,
-        config: HumanizeConfig,
-    ) -> int:
-        """[AC-MARH-11] 计算拟人化 delay。"""
-        text_length = len(text)
-
-        for bucket in self._length_buckets:
-            if bucket.min_length <= text_length < bucket.max_length:
-                delay = bucket.delay_ms
-                return max(config.min_delay_ms, min(delay, config.max_delay_ms))
-
-        return config.min_delay_ms
-
-    def get_config(self) -> HumanizeConfig:
-        """获取当前配置。"""
-        return self._config
-
-
-_segment_humanizer: SegmentHumanizer | None = None
-
-
-def get_segment_humanizer(
-    config: HumanizeConfig | None = None,
-) -> SegmentHumanizer:
-    """获取或创建 SegmentHumanizer 实例。"""
-    global _segment_humanizer
-    if _segment_humanizer is None:
-        _segment_humanizer = SegmentHumanizer(config=config)
-    return _segment_humanizer
--- a/ai-service/app/services/mid/timeout_governor.py
+++ b/ai-service/app/services/mid/timeout_governor.py
@ -1,166 +0,0 @@
-"""
-Timeout Governor for Mid Platform.
-[AC-IDMP-12] Timeout governance: per-tool <= 60s, end-to-end <= 180s.
-[AC-MARH-08, AC-MARH-09] 超时口径统一：单工具 <= 60000ms，全链路 <= 180000ms。
-"""
-
-import asyncio
-import logging
-from collections.abc import Callable
-from dataclasses import dataclass
-from typing import Any, TypeVar
-
-from app.models.mid.schemas import TimeoutProfile
-
-logger = logging.getLogger(__name__)
-
-DEFAULT_PER_TOOL_TIMEOUT_MS = 30000
-DEFAULT_END_TO_END_TIMEOUT_MS = 120000
-DEFAULT_LLM_TIMEOUT_MS = 60000
-
-T = TypeVar("T")
-
-
-@dataclass
-class TimeoutResult:
-    """Result of a timeout-governed operation."""
-    success: bool
-    result: Any = None
-    error: str | None = None
-    duration_ms: int = 0
-    timed_out: bool = False
-
-
-class TimeoutGovernor:
-    """
-    [AC-IDMP-12] Timeout governor for tool calls and end-to-end execution.
-    [AC-MARH-08, AC-MARH-09] 超时口径统一。
-
-    Constraints:
-    - Per-tool timeout: <= 60000ms (60s)
-    - LLM timeout: <= 120000ms (120s)
-    - End-to-end timeout: <= 180000ms (180s)
-    """
-
-    def __init__(
-        self,
-        per_tool_timeout_ms: int = DEFAULT_PER_TOOL_TIMEOUT_MS,
-        end_to_end_timeout_ms: int = DEFAULT_END_TO_END_TIMEOUT_MS,
-        llm_timeout_ms: int = DEFAULT_LLM_TIMEOUT_MS,
-    ):
-        self._per_tool_timeout_ms = min(per_tool_timeout_ms, 60000)
-        self._end_to_end_timeout_ms = min(end_to_end_timeout_ms, 180000)
-        self._llm_timeout_ms = min(llm_timeout_ms, 120000)
-
-    @property
-    def per_tool_timeout_seconds(self) -> float:
-        """Per-tool timeout in seconds."""
-        return self._per_tool_timeout_ms / 1000.0
-
-    @property
-    def llm_timeout_seconds(self) -> float:
-        """LLM call timeout in seconds."""
-        return self._llm_timeout_ms / 1000.0
-
-    @property
-    def end_to_end_timeout_seconds(self) -> float:
-        """End-to-end timeout in seconds."""
-        return self._end_to_end_timeout_ms / 1000.0
-
-    @property
-    def profile(self) -> TimeoutProfile:
-        """Get current timeout profile."""
-        return TimeoutProfile(
-            per_tool_timeout_ms=self._per_tool_timeout_ms,
-            llm_timeout_ms=self._llm_timeout_ms,
-            end_to_end_timeout_ms=self._end_to_end_timeout_ms,
-        )
-
-    async def execute_with_timeout(
-        self,
-        coro: Callable[[], T],
-        timeout_ms: int | None = None,
-        operation_name: str = "operation",
-    ) -> TimeoutResult:
-        """
-        [AC-MARH-08, AC-MARH-09] Execute a coroutine with timeout.
-
-        Args:
-            coro: Coroutine to execute
-            timeout_ms: Timeout in milliseconds (defaults to per-tool timeout)
-            operation_name: Name for logging
-
-        Returns:
-            TimeoutResult with success status and result/error
-        """
-        import time
-
-        timeout_ms = timeout_ms or self._per_tool_timeout_ms
-        timeout_seconds = timeout_ms / 1000.0
-        start_time = time.time()
-
-        try:
-            result = await asyncio.wait_for(coro(), timeout=timeout_seconds)
-            duration_ms = int((time.time() - start_time) * 1000)
-
-            logger.debug(
-                f"[AC-MARH-08] {operation_name} completed in {duration_ms}ms"
-            )
-
-            return TimeoutResult(
-                success=True,
-                result=result,
-                duration_ms=duration_ms,
-            )
-
-        except asyncio.TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(
-                f"[AC-MARH-08] {operation_name} timed out after {duration_ms}ms "
-                f"(limit: {timeout_ms}ms)"
-            )
-            return TimeoutResult(
-                success=False,
-                error=f"Timeout after {timeout_ms}ms",
-                duration_ms=duration_ms,
-                timed_out=True,
-            )
-
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(
-                f"[AC-MARH-08] {operation_name} failed: {e}"
-            )
-            return TimeoutResult(
-                success=False,
-                error=str(e),
-                duration_ms=duration_ms,
-            )
-
-    async def execute_tool(
-        self,
-        tool_name: str,
-        tool_func: Callable[[], T],
-    ) -> TimeoutResult:
-        """
-        [AC-MARH-08] Execute a tool call with per-tool timeout.
-        """
-        return await self.execute_with_timeout(
-            coro=tool_func,
-            timeout_ms=self._per_tool_timeout_ms,
-            operation_name=f"tool:{tool_name}",
-        )
-
-    async def execute_e2e(
-        self,
-        coro: Callable[[], T],
-        operation_name: str = "e2e",
-    ) -> TimeoutResult:
-        """
-        [AC-MARH-09] Execute with end-to-end timeout.
-        """
-        return await self.execute_with_timeout(
-            coro=coro,
-            timeout_ms=self._end_to_end_timeout_ms,
-            operation_name=operation_name,
-        )
--- a/ai-service/app/services/mid/tool_call_recorder.py
+++ b/ai-service/app/services/mid/tool_call_recorder.py
@ -1,324 +0,0 @@
-"""
-Tool Call Recorder for Mid Platform.
-[AC-IDMP-15] 工具调用结构化记录
-
-Reference:
- spec/intent-driven-mid-platform/openapi.provider.yaml - ToolCallTrace
- spec/intent-driven-mid-platform/requirements.md AC-IDMP-15
-"""
-
-import logging
-from collections import defaultdict
-from dataclasses import dataclass, field
-from datetime import datetime
-from typing import Any
-
-from app.models.mid.tool_trace import (
-    ToolCallBuilder,
-    ToolCallStatus,
-    ToolCallTrace,
-    ToolType,
-)
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class ToolCallStatistics:
-    """
-    工具调用统计信息
-    """
-    total_calls: int = 0
-    success_calls: int = 0
-    timeout_calls: int = 0
-    error_calls: int = 0
-    rejected_calls: int = 0
-    total_duration_ms: int = 0
-    avg_duration_ms: float = 0.0
-    max_duration_ms: int = 0
-    min_duration_ms: int = 0
-
-    def update(self, trace: ToolCallTrace) -> None:
-        self.total_calls += 1
-        self.total_duration_ms += trace.duration_ms
-        self.avg_duration_ms = self.total_duration_ms / self.total_calls
-        
-        if trace.duration_ms > self.max_duration_ms:
-            self.max_duration_ms = trace.duration_ms
-        if self.min_duration_ms == 0 or trace.duration_ms < self.min_duration_ms:
-            self.min_duration_ms = trace.duration_ms
-
-        if trace.status == ToolCallStatus.OK:
-            self.success_calls += 1
-        elif trace.status == ToolCallStatus.TIMEOUT:
-            self.timeout_calls += 1
-        elif trace.status == ToolCallStatus.REJECTED:
-            self.rejected_calls += 1
-        else:
-            self.error_calls += 1
-
-
-class ToolCallRecorder:
-    """
-    [AC-IDMP-15] 工具调用记录器
-    
-    功能：
-    1. 记录每次工具调用的完整信息（参数摘要、耗时、状态、错误码）
-    2. 支持敏感参数脱敏
-    3. 提供统计信息
-    
-    记录字段：
-    - tool_name: 工具名称
-    - tool_type: 工具类型 (internal | mcp)
-    - registry_version: 注册表版本
-    - auth_applied: 是否应用鉴权
-    - duration_ms: 调用耗时
-    - status: 调用状态 (ok | timeout | error | rejected)
-    - error_code: 错误码
-    - args_digest: 参数摘要（脱敏）
-    - result_digest: 结果摘要
-    """
-
-    def __init__(self, max_traces_per_session: int = 100):
-        self._max_traces_per_session = max_traces_per_session
-        self._traces: dict[str, list[ToolCallTrace]] = defaultdict(list)
-        self._statistics: dict[str, ToolCallStatistics] = defaultdict(ToolCallStatistics)
-
-    def start_trace(
-        self,
-        tool_name: str,
-        tool_type: ToolType = ToolType.INTERNAL,
-    ) -> ToolCallBuilder:
-        """
-        开始记录一次工具调用
-        
-        Args:
-            tool_name: 工具名称
-            tool_type: 工具类型
-            
-        Returns:
-            ToolCallBuilder: 构建器，用于逐步记录调用信息
-        """
-        return ToolCallBuilder(
-            tool_name=tool_name,
-            tool_type=tool_type,
-        )
-
-    def record(
-        self,
-        session_id: str,
-        trace: ToolCallTrace,
-    ) -> None:
-        """
-        记录一次工具调用的完整信息
-        
-        Args:
-            session_id: 会话ID
-            trace: 工具调用追踪记录
-        """
-        session_traces = self._traces[session_id]
-        session_traces.append(trace)
-
-        if len(session_traces) > self._max_traces_per_session:
-            session_traces.pop(0)
-
-        self._statistics[trace.tool_name].update(trace)
-
-        logger.info(
-            f"[AC-IDMP-15] Tool call recorded: tool={trace.tool_name}, "
-            f"type={trace.tool_type.value}, duration_ms={trace.duration_ms}, "
-            f"status={trace.status.value}, session={session_id}"
-        )
-
-    def record_success(
-        self,
-        session_id: str,
-        tool_name: str,
-        tool_type: ToolType,
-        duration_ms: int,
-        args: Any = None,
-        result: Any = None,
-        registry_version: str | None = None,
-        auth_applied: bool = False,
-    ) -> ToolCallTrace:
-        """
-        记录成功的工具调用（便捷方法）
-        """
-        trace = ToolCallTrace(
-            tool_name=tool_name,
-            tool_type=tool_type,
-            duration_ms=duration_ms,
-            status=ToolCallStatus.OK,
-            registry_version=registry_version,
-            auth_applied=auth_applied,
-            args_digest=ToolCallTrace.compute_digest(args) if args else None,
-            result_digest=ToolCallTrace.compute_digest(result) if result else None,
-        )
-        self.record(session_id, trace)
-        return trace
-
-    def record_timeout(
-        self,
-        session_id: str,
-        tool_name: str,
-        tool_type: ToolType,
-        duration_ms: int,
-        args: Any = None,
-        registry_version: str | None = None,
-        auth_applied: bool = False,
-    ) -> ToolCallTrace:
-        """
-        记录超时的工具调用（便捷方法）
-        """
-        trace = ToolCallTrace(
-            tool_name=tool_name,
-            tool_type=tool_type,
-            duration_ms=duration_ms,
-            status=ToolCallStatus.TIMEOUT,
-            error_code="TIMEOUT",
-            registry_version=registry_version,
-            auth_applied=auth_applied,
-            args_digest=ToolCallTrace.compute_digest(args) if args else None,
-        )
-        self.record(session_id, trace)
-        return trace
-
-    def record_error(
-        self,
-        session_id: str,
-        tool_name: str,
-        tool_type: ToolType,
-        duration_ms: int,
-        error_code: str,
-        error_message: str | None = None,
-        args: Any = None,
-        registry_version: str | None = None,
-        auth_applied: bool = False,
-    ) -> ToolCallTrace:
-        """
-        记录错误的工具调用（便捷方法）
-        """
-        trace = ToolCallTrace(
-            tool_name=tool_name,
-            tool_type=tool_type,
-            duration_ms=duration_ms,
-            status=ToolCallStatus.ERROR,
-            error_code=error_code,
-            registry_version=registry_version,
-            auth_applied=auth_applied,
-            args_digest=ToolCallTrace.compute_digest(args) if args else None,
-        )
-        self.record(session_id, trace)
-        return trace
-
-    def record_rejected(
-        self,
-        session_id: str,
-        tool_name: str,
-        tool_type: ToolType,
-        reason: str,
-        args: Any = None,
-        registry_version: str | None = None,
-    ) -> ToolCallTrace:
-        """
-        记录被拒绝的工具调用（便捷方法）
-        """
-        trace = ToolCallTrace(
-            tool_name=tool_name,
-            tool_type=tool_type,
-            duration_ms=0,
-            status=ToolCallStatus.REJECTED,
-            error_code=reason,
-            registry_version=registry_version,
-            args_digest=ToolCallTrace.compute_digest(args) if args else None,
-        )
-        self.record(session_id, trace)
-        return trace
-
-    def get_traces(self, session_id: str) -> list[ToolCallTrace]:
-        """
-        获取指定会话的所有工具调用记录
-        
-        Args:
-            session_id: 会话ID
-            
-        Returns:
-            list[ToolCallTrace]: 工具调用记录列表
-        """
-        return self._traces.get(session_id, [])
-
-    def get_statistics(self, tool_name: str | None = None) -> dict[str, Any]:
-        """
-        获取工具调用统计信息
-        
-        Args:
-            tool_name: 工具名称（可选，不提供则返回所有统计）
-            
-        Returns:
-            dict: 统计信息
-        """
-        if tool_name:
-            stats = self._statistics.get(tool_name)
-            if stats:
-                return {
-                    "tool_name": tool_name,
-                    "total_calls": stats.total_calls,
-                    "success_rate": stats.success_calls / stats.total_calls if stats.total_calls > 0 else 0,
-                    "timeout_rate": stats.timeout_calls / stats.total_calls if stats.total_calls > 0 else 0,
-                    "error_rate": stats.error_calls / stats.total_calls if stats.total_calls > 0 else 0,
-                    "avg_duration_ms": stats.avg_duration_ms,
-                    "max_duration_ms": stats.max_duration_ms,
-                    "min_duration_ms": stats.min_duration_ms,
-                }
-            return {}
-
-        return {
-            name: {
-                "total_calls": stats.total_calls,
-                "success_rate": stats.success_calls / stats.total_calls if stats.total_calls > 0 else 0,
-                "avg_duration_ms": stats.avg_duration_ms,
-            }
-            for name, stats in self._statistics.items()
-        }
-
-    def clear_session(self, session_id: str) -> int:
-        """
-        清除指定会话的记录
-        
-        Args:
-            session_id: 会话ID
-            
-        Returns:
-            int: 清除的记录数
-        """
-        if session_id in self._traces:
-            count = len(self._traces[session_id])
-            del self._traces[session_id]
-            return count
-        return 0
-
-    def to_trace_info_format(self, session_id: str) -> list[dict[str, Any]]:
-        """
-        转换为 TraceInfo.tool_calls 格式
-        
-        用于输出到 DialogueResponse.trace.tool_calls
-        
-        Args:
-            session_id: 会话ID
-            
-        Returns:
-            list[dict]: 符合 OpenAPI 格式的工具调用列表
-        """
-        traces = self.get_traces(session_id)
-        return [trace.to_dict() for trace in traces]
-
-
-_recorder: ToolCallRecorder | None = None
-
-
-def get_tool_call_recorder() -> ToolCallRecorder:
-    """获取全局工具调用记录器实例"""
-    global _recorder
-    if _recorder is None:
-        _recorder = ToolCallRecorder()
-    return _recorder
--- a/ai-service/app/services/mid/tool_registry.py
+++ b/ai-service/app/services/mid/tool_registry.py
@ -1,337 +0,0 @@
-"""
-Tool Registry for Mid Platform.
-[AC-IDMP-19] Unified tool registration, auth, timeout, version, and enable/disable governance.
-"""
-
-import asyncio
-import logging
-import time
-import uuid
-from dataclasses import dataclass, field
-from typing import Any, Callable, Coroutine
-
-from app.models.mid.schemas import (
-    ToolCallStatus,
-    ToolCallTrace,
-    ToolType,
-)
-from app.services.mid.timeout_governor import TimeoutGovernor
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class ToolDefinition:
-    """Tool definition for registry."""
-    name: str
-    description: str
-    tool_type: ToolType = ToolType.INTERNAL
-    version: str = "1.0.0"
-    enabled: bool = True
-    auth_required: bool = False
-    timeout_ms: int = 2000
-    handler: Callable[..., Coroutine[Any, Any, dict[str, Any]]] | None = None
-    metadata: dict[str, Any] = field(default_factory=dict)
-
-
-@dataclass
-class ToolExecutionResult:
-    """Tool execution result."""
-    success: bool
-    output: Any = None
-    error: str | None = None
-    duration_ms: int = 0
-    auth_applied: bool = False
-    registry_version: str | None = None
-
-
-class ToolRegistry:
-    """
-    [AC-IDMP-19] Unified tool registry for governance.
-    
-    Features:
-    - Tool registration with metadata
-    - Auth policy enforcement
-    - Timeout governance
-    - Version management
-    - Enable/disable control
-    """
-    
-    def __init__(
-        self,
-        timeout_governor: TimeoutGovernor | None = None,
-    ):
-        self._tools: dict[str, ToolDefinition] = {}
-        self._timeout_governor = timeout_governor or TimeoutGovernor()
-        self._version = "1.0.0"
-    
-    @property
-    def version(self) -> str:
-        """Get registry version."""
-        return self._version
-    
-    def register(
-        self,
-        name: str,
-        description: str,
-        handler: Callable[..., Coroutine[Any, Any, dict[str, Any]]],
-        tool_type: ToolType = ToolType.INTERNAL,
-        version: str = "1.0.0",
-        auth_required: bool = False,
-        timeout_ms: int = 2000,
-        enabled: bool = True,
-        metadata: dict[str, Any] | None = None,
-    ) -> ToolDefinition:
-        """
-        [AC-IDMP-19] Register a tool.
-        
-        Args:
-            name: Tool name (unique identifier)
-            description: Tool description
-            handler: Async handler function
-            tool_type: Tool type (internal/mcp)
-            version: Tool version
-            auth_required: Whether auth is required
-            timeout_ms: Tool-specific timeout
-            enabled: Whether tool is enabled
-            metadata: Additional metadata
-        
-        Returns:
-            ToolDefinition for the registered tool
-        """
-        if name in self._tools:
-            logger.warning(f"[AC-IDMP-19] Tool already registered, overwriting: {name}")
-        
-        tool = ToolDefinition(
-            name=name,
-            description=description,
-            tool_type=tool_type,
-            version=version,
-            enabled=enabled,
-            auth_required=auth_required,
-            timeout_ms=min(timeout_ms, 2000),
-            handler=handler,
-            metadata=metadata or {},
-        )
-        
-        self._tools[name] = tool
-        
-        logger.info(
-            f"[AC-IDMP-19] Tool registered: name={name}, type={tool_type.value}, "
-            f"version={version}, auth_required={auth_required}"
-        )
-        
-        return tool
-    
-    def unregister(self, name: str) -> bool:
-        """Unregister a tool."""
-        if name in self._tools:
-            del self._tools[name]
-            logger.info(f"[AC-IDMP-19] Tool unregistered: {name}")
-            return True
-        return False
-    
-    def get_tool(self, name: str) -> ToolDefinition | None:
-        """Get tool definition by name."""
-        return self._tools.get(name)
-    
-    def list_tools(
-        self,
-        tool_type: ToolType | None = None,
-        enabled_only: bool = True,
-    ) -> list[ToolDefinition]:
-        """List registered tools, optionally filtered."""
-        tools = list(self._tools.values())
-        
-        if tool_type:
-            tools = [t for t in tools if t.tool_type == tool_type]
-        
-        if enabled_only:
-            tools = [t for t in tools if t.enabled]
-        
-        return tools
-    
-    def enable_tool(self, name: str) -> bool:
-        """Enable a tool."""
-        tool = self._tools.get(name)
-        if tool:
-            tool.enabled = True
-            logger.info(f"[AC-IDMP-19] Tool enabled: {name}")
-            return True
-        return False
-    
-    def disable_tool(self, name: str) -> bool:
-        """Disable a tool."""
-        tool = self._tools.get(name)
-        if tool:
-            tool.enabled = False
-            logger.info(f"[AC-IDMP-19] Tool disabled: {name}")
-            return True
-        return False
-    
-    async def execute(
-        self,
-        tool_name: str,
-        args: dict[str, Any],
-        auth_context: dict[str, Any] | None = None,
-    ) -> ToolExecutionResult:
-        """
-        [AC-IDMP-19] Execute a tool with governance.
-        
-        Args:
-            tool_name: Tool name to execute
-            args: Tool arguments
-            auth_context: Authentication context
-        
-        Returns:
-            ToolExecutionResult with output and metadata
-        """
-        start_time = time.time()
-        
-        tool = self._tools.get(tool_name)
-        if not tool:
-            logger.warning(f"[AC-IDMP-19] Tool not found: {tool_name}")
-            return ToolExecutionResult(
-                success=False,
-                error=f"Tool not found: {tool_name}",
-                duration_ms=0,
-            )
-        
-        if not tool.enabled:
-            logger.warning(f"[AC-IDMP-19] Tool disabled: {tool_name}")
-            return ToolExecutionResult(
-                success=False,
-                error=f"Tool disabled: {tool_name}",
-                duration_ms=0,
-                registry_version=tool.version,
-            )
-        
-        auth_applied = False
-        if tool.auth_required:
-            if not auth_context:
-                logger.warning(f"[AC-IDMP-19] Auth required but no context: {tool_name}")
-                return ToolExecutionResult(
-                    success=False,
-                    error="Authentication required",
-                    duration_ms=int((time.time() - start_time) * 1000),
-                    auth_applied=False,
-                    registry_version=tool.version,
-                )
-            auth_applied = True
-        
-        try:
-            timeout_seconds = tool.timeout_ms / 1000.0
-            
-            result = await asyncio.wait_for(
-                tool.handler(**args) if tool.handler else asyncio.sleep(0),
-                timeout=timeout_seconds,
-            )
-            
-            duration_ms = int((time.time() - start_time) * 1000)
-            
-            logger.info(
-                f"[AC-IDMP-19] Tool executed: name={tool_name}, "
-                f"duration_ms={duration_ms}, success=True"
-            )
-            
-            return ToolExecutionResult(
-                success=True,
-                output=result,
-                duration_ms=duration_ms,
-                auth_applied=auth_applied,
-                registry_version=tool.version,
-            )
-        
-        except asyncio.TimeoutError:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.warning(
-                f"[AC-IDMP-19] Tool timeout: name={tool_name}, "
-                f"duration_ms={duration_ms}"
-            )
-            return ToolExecutionResult(
-                success=False,
-                error=f"Tool timeout after {tool.timeout_ms}ms",
-                duration_ms=duration_ms,
-                auth_applied=auth_applied,
-                registry_version=tool.version,
-            )
-        
-        except Exception as e:
-            duration_ms = int((time.time() - start_time) * 1000)
-            logger.error(
-                f"[AC-IDMP-19] Tool error: name={tool_name}, error={e}"
-            )
-            return ToolExecutionResult(
-                success=False,
-                error=str(e),
-                duration_ms=duration_ms,
-                auth_applied=auth_applied,
-                registry_version=tool.version,
-            )
-    
-    def create_trace(
-        self,
-        tool_name: str,
-        result: ToolExecutionResult,
-        args_digest: str | None = None,
-    ) -> ToolCallTrace:
-        """
-        [AC-IDMP-19] Create ToolCallTrace from execution result.
-        """
-        tool = self._tools.get(tool_name)
-        
-        return ToolCallTrace(
-            tool_name=tool_name,
-            tool_type=tool.tool_type if tool else ToolType.INTERNAL,
-            registry_version=result.registry_version,
-            auth_applied=result.auth_applied,
-            duration_ms=result.duration_ms,
-            status=ToolCallStatus.OK if result.success else (
-                ToolCallStatus.TIMEOUT if "timeout" in (result.error or "").lower()
-                else ToolCallStatus.ERROR
-            ),
-            error_code=result.error if not result.success else None,
-            args_digest=args_digest,
-            result_digest=str(result.output)[:100] if result.output else None,
-        )
-    
-    def get_governance_report(self) -> dict[str, Any]:
-        """Get governance report for all tools."""
-        return {
-            "registry_version": self._version,
-            "total_tools": len(self._tools),
-            "enabled_tools": sum(1 for t in self._tools.values() if t.enabled),
-            "disabled_tools": sum(1 for t in self._tools.values() if not t.enabled),
-            "auth_required_tools": sum(1 for t in self._tools.values() if t.auth_required),
-            "mcp_tools": sum(1 for t in self._tools.values() if t.tool_type == ToolType.MCP),
-            "internal_tools": sum(1 for t in self._tools.values() if t.tool_type == ToolType.INTERNAL),
-            "tools": [
-                {
-                    "name": t.name,
-                    "type": t.tool_type.value,
-                    "version": t.version,
-                    "enabled": t.enabled,
-                    "auth_required": t.auth_required,
-                    "timeout_ms": t.timeout_ms,
-                }
-                for t in self._tools.values()
-            ],
-        }
-
-
-_registry: ToolRegistry | None = None
-
-
-def get_tool_registry() -> ToolRegistry:
-    """Get global tool registry instance."""
-    global _registry
-    if _registry is None:
-        _registry = ToolRegistry()
-    return _registry
-
-
-def init_tool_registry(timeout_governor: TimeoutGovernor | None = None) -> ToolRegistry:
-    """Initialize and return tool registry."""
-    global _registry
-    _registry = ToolRegistry(timeout_governor=timeout_governor)
-    return _registry
--- a/ai-service/app/services/mid/trace_logger.py
+++ b/ai-service/app/services/mid/trace_logger.py
@ -1,269 +0,0 @@
-"""
-Trace Logger for Mid Platform.
-[AC-MARH-02, AC-MARH-03, AC-MARH-12] Trace collection and audit logging.
-
-Audit Fields:
- session_id, request_id, generation_id
- mode, intent, tool_calls
- guardrail_triggered, guardrail_rule_id
- interrupt_consumed
-"""
-
-import logging
-import time
-import uuid
-from dataclasses import dataclass, field
-from typing import Any
-
-from app.models.mid.schemas import (
-    ExecutionMode,
-    ToolCallTrace,
-    TraceInfo,
-)
-
-logger = logging.getLogger(__name__)
-
-
-@dataclass
-class AuditRecord:
-    """[AC-MARH-12] Audit record for database persistence."""
-    tenant_id: str
-    session_id: str
-    request_id: str
-    generation_id: str
-    mode: ExecutionMode
-    intent: str | None = None
-    tool_calls: list[dict[str, Any]] = field(default_factory=list)
-    guardrail_triggered: bool = False
-    guardrail_rule_id: str | None = None
-    interrupt_consumed: bool = False
-    fallback_reason_code: str | None = None
-    react_iterations: int = 0
-    latency_ms: int = 0
-    created_at: str | None = None
-
-    def to_dict(self) -> dict[str, Any]:
-        return {
-            "tenant_id": self.tenant_id,
-            "session_id": self.session_id,
-            "request_id": self.request_id,
-            "generation_id": self.generation_id,
-            "mode": self.mode.value,
-            "intent": self.intent,
-            "tool_calls": self.tool_calls,
-            "guardrail_triggered": self.guardrail_triggered,
-            "guardrail_rule_id": self.guardrail_rule_id,
-            "interrupt_consumed": self.interrupt_consumed,
-            "fallback_reason_code": self.fallback_reason_code,
-            "react_iterations": self.react_iterations,
-            "latency_ms": self.latency_ms,
-            "created_at": self.created_at,
-        }
-
-
-class TraceLogger:
-    """
-    [AC-MARH-02, AC-MARH-03, AC-MARH-12] Trace logger for observability and audit.
-
-    Features:
-    - Request-scoped trace context
-    - Tool call tracing
-    - Guardrail event logging with rule_id
-    - Interrupt consumption tracking
-    - Audit record generation
-    """
-
-    def __init__(self):
-        self._traces: dict[str, TraceInfo] = {}
-        self._audit_records: list[AuditRecord] = []
-
-    def start_trace(
-        self,
-        tenant_id: str,
-        session_id: str,
-        request_id: str | None = None,
-        generation_id: str | None = None,
-    ) -> TraceInfo:
-        """
-        [AC-MARH-12] Start a new trace context.
-
-        Args:
-            tenant_id: Tenant ID
-            session_id: Session ID
-            request_id: Request ID (auto-generated if not provided)
-            generation_id: Generation ID for interrupt handling
-
-        Returns:
-            TraceInfo for the new trace
-        """
-        request_id = request_id or str(uuid.uuid4())
-        generation_id = generation_id or str(uuid.uuid4())
-
-        trace = TraceInfo(
-            mode=ExecutionMode.AGENT,
-            request_id=request_id,
-            generation_id=generation_id,
-        )
-
-        self._traces[request_id] = trace
-
-        logger.info(
-            f"[AC-MARH-12] Trace started: request_id={request_id}, "
-            f"session_id={session_id}, generation_id={generation_id}"
-        )
-
-        return trace
-
-    def get_trace(self, request_id: str) -> TraceInfo | None:
-        """Get trace by request ID."""
-        return self._traces.get(request_id)
-
-    def update_trace(
-        self,
-        request_id: str,
-        mode: ExecutionMode | None = None,
-        intent: str | None = None,
-        guardrail_triggered: bool | None = None,
-        guardrail_rule_id: str | None = None,
-        interrupt_consumed: bool | None = None,
-        fallback_reason_code: str | None = None,
-        react_iterations: int | None = None,
-        tool_calls: list[ToolCallTrace] | None = None,
-    ) -> TraceInfo | None:
-        """
-        [AC-MARH-02, AC-MARH-03, AC-MARH-12] Update trace with execution details.
-        """
-        trace = self._traces.get(request_id)
-        if not trace:
-            logger.warning(f"[AC-MARH-12] Trace not found: {request_id}")
-            return None
-
-        if mode is not None:
-            trace.mode = mode
-        if intent is not None:
-            trace.intent = intent
-        if guardrail_triggered is not None:
-            trace.guardrail_triggered = guardrail_triggered
-        if guardrail_rule_id is not None:
-            trace.guardrail_rule_id = guardrail_rule_id
-        if interrupt_consumed is not None:
-            trace.interrupt_consumed = interrupt_consumed
-        if fallback_reason_code is not None:
-            trace.fallback_reason_code = fallback_reason_code
-        if react_iterations is not None:
-            trace.react_iterations = react_iterations
-        if tool_calls is not None:
-            trace.tool_calls = tool_calls
-
-        return trace
-
-    def add_tool_call(
-        self,
-        request_id: str,
-        tool_call: ToolCallTrace,
-    ) -> None:
-        """
-        [AC-MARH-12] Add tool call trace to request.
-        """
-        trace = self._traces.get(request_id)
-        if not trace:
-            logger.warning(f"[AC-MARH-12] Trace not found for tool call: {request_id}")
-            return
-
-        if trace.tool_calls is None:
-            trace.tool_calls = []
-
-        trace.tool_calls.append(tool_call)
-
-        if trace.tools_used is None:
-            trace.tools_used = []
-
-        if tool_call.tool_name not in trace.tools_used:
-            trace.tools_used.append(tool_call.tool_name)
-
-        logger.debug(
-            f"[AC-MARH-12] Tool call recorded: request_id={request_id}, "
-            f"tool={tool_call.tool_name}, status={tool_call.status.value}"
-        )
-
-    def end_trace(
-        self,
-        request_id: str,
-        tenant_id: str,
-        session_id: str,
-        latency_ms: int,
-    ) -> AuditRecord:
-        """
-        [AC-MARH-12] End trace and create audit record.
-        """
-        trace = self._traces.get(request_id)
-        if not trace:
-            logger.warning(f"[AC-MARH-12] Trace not found for end: {request_id}")
-            return AuditRecord(
-                tenant_id=tenant_id,
-                session_id=session_id,
-                request_id=request_id,
-                generation_id=str(uuid.uuid4()),
-                mode=ExecutionMode.FIXED,
-                latency_ms=latency_ms,
-            )
-
-        audit = AuditRecord(
-            tenant_id=tenant_id,
-            session_id=session_id,
-            request_id=request_id,
-            generation_id=trace.generation_id or str(uuid.uuid4()),
-            mode=trace.mode,
-            intent=trace.intent,
-            tool_calls=[tc.model_dump() for tc in trace.tool_calls] if trace.tool_calls else [],
-            guardrail_triggered=trace.guardrail_triggered or False,
-            guardrail_rule_id=trace.guardrail_rule_id,
-            interrupt_consumed=trace.interrupt_consumed or False,
-            fallback_reason_code=trace.fallback_reason_code,
-            react_iterations=trace.react_iterations or 0,
-            latency_ms=latency_ms,
-            created_at=time.strftime("%Y-%m-%d %H:%M:%S"),
-        )
-
-        self._audit_records.append(audit)
-
-        if request_id in self._traces:
-            del self._traces[request_id]
-
-        logger.info(
-            f"[AC-MARH-12] Trace ended: request_id={request_id}, "
-            f"mode={trace.mode.value}, latency_ms={latency_ms}"
-        )
-
-        return audit
-
-    def get_audit_records(
-        self,
-        tenant_id: str,
-        session_id: str | None = None,
-        limit: int = 100,
-    ) -> list[AuditRecord]:
-        """Get audit records for a tenant/session."""
-        records = [
-            r for r in self._audit_records
-            if r.tenant_id == tenant_id
-        ]
-
-        if session_id:
-            records = [r for r in records if r.session_id == session_id]
-
-        return records[-limit:]
-
-    def clear_audit_records(self, tenant_id: str | None = None) -> int:
-        """Clear audit records, optionally filtered by tenant."""
-        if tenant_id:
-            original_count = len(self._audit_records)
-            self._audit_records = [
-                r for r in self._audit_records
-                if r.tenant_id != tenant_id
-            ]
-            return original_count - len(self._audit_records)
-        else:
-            count = len(self._audit_records)
-            self._audit_records = []
-            return count
--- a/ai-service/app/services/openapi/share_token_service.py
+++ b/ai-service/app/services/openapi/share_token_service.py
@ -1,182 +0,0 @@
-"""Redis-backed share token service for secure share links with device binding."""
-
-from __future__ import annotations
-
-import json
-import secrets
-from dataclasses import dataclass
-from datetime import datetime, timedelta
-from typing import Literal
-
-import redis.asyncio as redis
-
-from app.core.config import get_settings
-
-
-@dataclass
-class ChatGrantRecord:
-    chat_token: str
-    tenant_id: str
-    api_key: str
-    session_id: str
-    user_id: str | None
-    bound_device_id: str
-    expires_at: datetime
-    created_at: datetime
-    last_seen_at: datetime
-
-
-@dataclass
-class ClaimResult:
-    ok: bool
-    status: Literal["claimed", "reused", "invalid", "expired", "forbidden"]
-    grant: ChatGrantRecord | None = None
-
-
-class ShareTokenService:
-    """Manage temporary share tokens and device-bound chat grants in Redis."""
-
-    def __init__(self, redis_client: redis.Redis | None = None) -> None:
-        self._settings = get_settings()
-        self._redis = redis_client
-
-    async def _get_client(self) -> redis.Redis:
-        if self._redis is None:
-            self._redis = redis.from_url(
-                self._settings.redis_url,
-                encoding="utf-8",
-                decode_responses=True,
-            )
-        return self._redis
-
-    @staticmethod
-    def _token_key(token: str) -> str:
-        return f"share:token:{token}"
-
-    @staticmethod
-    def _grant_key(chat_token: str) -> str:
-        return f"share:grant:{chat_token}"
-
-    async def create_token(
-        self,
-        tenant_id: str,
-        api_key: str,
-        session_id: str,
-        user_id: str | None,
-        expires_in_minutes: int,
-    ) -> tuple[str, datetime]:
-        client = await self._get_client()
-        now = datetime.utcnow()
-        expires_at = now + timedelta(minutes=expires_in_minutes)
-        ttl_seconds = max(1, int((expires_at - now).total_seconds()))
-
-        token = secrets.token_urlsafe(24)
-        payload = {
-            "tenant_id": tenant_id,
-            "api_key": api_key,
-            "session_id": session_id,
-            "user_id": user_id,
-            "expires_at": expires_at.isoformat(),
-            "bound_device_id": None,
-            "chat_token": None,
-        }
-        await client.setex(self._token_key(token), ttl_seconds, json.dumps(payload))
-        return token, expires_at
-
-    async def claim_or_reuse(self, token: str, device_id: str) -> ClaimResult:
-        client = await self._get_client()
-        raw = await client.get(self._token_key(token))
-        if not raw:
-            return ClaimResult(ok=False, status="invalid")
-
-        data = json.loads(raw)
-        expires_at = datetime.fromisoformat(data["expires_at"])
-        if datetime.utcnow() > expires_at:
-            await client.delete(self._token_key(token))
-            return ClaimResult(ok=False, status="expired")
-
-        if not data.get("chat_token"):
-            chat_token = secrets.token_urlsafe(24)
-            now = datetime.utcnow()
-            ttl_seconds = max(1, int((expires_at - now).total_seconds()))
-
-            grant_data = {
-                "chat_token": chat_token,
-                "tenant_id": data["tenant_id"],
-                "api_key": data["api_key"],
-                "session_id": data["session_id"],
-                "user_id": data.get("user_id"),
-                "bound_device_id": device_id,
-                "expires_at": data["expires_at"],
-                "created_at": now.isoformat(),
-                "last_seen_at": now.isoformat(),
-            }
-            data["bound_device_id"] = device_id
-            data["chat_token"] = chat_token
-
-            pipe = client.pipeline()
-            pipe.setex(self._grant_key(chat_token), ttl_seconds, json.dumps(grant_data))
-            pipe.setex(self._token_key(token), ttl_seconds, json.dumps(data))
-            await pipe.execute()
-
-            return ClaimResult(ok=True, status="claimed", grant=self._to_grant(grant_data))
-
-        if data.get("bound_device_id") != device_id:
-            return ClaimResult(ok=False, status="forbidden")
-
-        grant_raw = await client.get(self._grant_key(data["chat_token"]))
-        if not grant_raw:
-            return ClaimResult(ok=False, status="invalid")
-
-        grant_data = json.loads(grant_raw)
-        grant_data["last_seen_at"] = datetime.utcnow().isoformat()
-        ttl = await client.ttl(self._grant_key(data["chat_token"]))
-        if ttl and ttl > 0:
-            await client.setex(self._grant_key(data["chat_token"]), ttl, json.dumps(grant_data))
-
-        return ClaimResult(ok=True, status="reused", grant=self._to_grant(grant_data))
-
-    async def get_chat_grant_for_device(self, chat_token: str, device_id: str) -> ChatGrantRecord | None:
-        client = await self._get_client()
-        raw = await client.get(self._grant_key(chat_token))
-        if not raw:
-            return None
-
-        data = json.loads(raw)
-        if data.get("bound_device_id") != device_id:
-            return None
-
-        expires_at = datetime.fromisoformat(data["expires_at"])
-        if datetime.utcnow() > expires_at:
-            await client.delete(self._grant_key(chat_token))
-            return None
-
-        data["last_seen_at"] = datetime.utcnow().isoformat()
-        ttl = await client.ttl(self._grant_key(chat_token))
-        if ttl and ttl > 0:
-            await client.setex(self._grant_key(chat_token), ttl, json.dumps(data))
-        return self._to_grant(data)
-
-    @staticmethod
-    def _to_grant(data: dict) -> ChatGrantRecord:
-        return ChatGrantRecord(
-            chat_token=data["chat_token"],
-            tenant_id=data["tenant_id"],
-            api_key=data["api_key"],
-            session_id=data["session_id"],
-            user_id=data.get("user_id"),
-            bound_device_id=data["bound_device_id"],
-            expires_at=datetime.fromisoformat(data["expires_at"]),
-            created_at=datetime.fromisoformat(data["created_at"]),
-            last_seen_at=datetime.fromisoformat(data["last_seen_at"]),
-        )
-
-
-_share_token_service: ShareTokenService | None = None
-
-
-def get_share_token_service() -> ShareTokenService:
-    global _share_token_service
-    if _share_token_service is None:
-        _share_token_service = ShareTokenService()
-    return _share_token_service
--- a/ai-service/app/services/prompt/variable_resolver.py
+++ b/ai-service/app/services/prompt/variable_resolver.py
@ -8,8 +8,6 @@ import re
 from datetime import datetime
 from typing import Any

-from app.core.middleware import PROMPT_PROTECTED_VARIABLES
-
 logger = logging.getLogger(__name__)

 VARIABLE_PATTERN = re.compile(r"\{\{(\w+)\}\}")
@ -20,11 +18,6 @@ BUILTIN_VARIABLES = {
    "channel_type": "default",
    "tenant_name": "平台",
    "session_id": "",
-    "available_tools": "",
-    "query": "",
-    "history": "",
-    "internal_protocol": "",
-    "output_contract": "",
 }


@ -99,18 +92,10 @@ class VariableResolver:
            for var in variables:
                name = var.get("name")
                default = var.get("default", "")
-                if not name:
-                    continue
-                if name in PROMPT_PROTECTED_VARIABLES:
-                    logger.warning(
-                        "Protected prompt variable '%s' cannot be overridden by template defaults",
-                        name,
-                    )
-                    continue
-                context[name] = default
+                if name:
+                    context[name] = default

        if extra_context:
-            # Runtime/system context has the highest priority, including protected vars.
            context.update(extra_context)

        return context
--- a/ai-service/app/services/slot_definition_service.py
+++ b/ai-service/app/services/slot_definition_service.py
@ -1,364 +0,0 @@
-"""
-Slot Definition Service.
-[AC-MRS-07, AC-MRS-08] 槽位定义管理服务
-"""
-
-import logging
-import re
-import uuid
-from datetime import datetime
-from typing import Any
-
-from sqlalchemy import select
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.models.entities import (
-    SlotDefinition,
-    SlotDefinitionCreate,
-    SlotDefinitionUpdate,
-    MetadataFieldDefinition,
-    MetadataFieldType,
-    ExtractStrategy,
-)
-
-logger = logging.getLogger(__name__)
-
-
-class SlotDefinitionService:
-    """
-    [AC-MRS-07, AC-MRS-08] 槽位定义服务
-    
-    管理独立的槽位定义模型，与元数据字段解耦但可复用
-    """
-
-    SLOT_KEY_PATTERN = re.compile(r"^[a-z][a-z0-9_]*$")
-    VALID_TYPES = ["string", "number", "boolean", "enum", "array_enum"]
-    VALID_EXTRACT_STRATEGIES = ["rule", "llm", "user_input"]
-
-    def __init__(self, session: AsyncSession):
-        self._session = session
-
-    async def list_slot_definitions(
-        self,
-        tenant_id: str,
-        required: bool | None = None,
-    ) -> list[SlotDefinition]:
-        """
-        列出租户所有槽位定义
-        
-        Args:
-            tenant_id: 租户 ID
-            required: 按是否必填过滤
-            
-        Returns:
-            SlotDefinition 列表
-        """
-        stmt = select(SlotDefinition).where(
-            SlotDefinition.tenant_id == tenant_id,
-        )
-        
-        if required is not None:
-            stmt = stmt.where(SlotDefinition.required == required)
-        
-        stmt = stmt.order_by(SlotDefinition.created_at.desc())
-        
-        result = await self._session.execute(stmt)
-        return list(result.scalars().all())
-
-    async def get_slot_definition(
-        self,
-        tenant_id: str,
-        slot_id: str,
-    ) -> SlotDefinition | None:
-        """
-        获取单个槽位定义
-        
-        Args:
-            tenant_id: 租户 ID
-            slot_id: 槽位定义 ID
-            
-        Returns:
-            SlotDefinition 或 None
-        """
-        try:
-            slot_uuid = uuid.UUID(slot_id)
-        except ValueError:
-            return None
-        
-        stmt = select(SlotDefinition).where(
-            SlotDefinition.tenant_id == tenant_id,
-            SlotDefinition.id == slot_uuid,
-        )
-        result = await self._session.execute(stmt)
-        return result.scalar_one_or_none()
-
-    async def get_slot_definition_by_key(
-        self,
-        tenant_id: str,
-        slot_key: str,
-    ) -> SlotDefinition | None:
-        """
-        通过 slot_key 获取槽位定义
-        
-        Args:
-            tenant_id: 租户 ID
-            slot_key: 槽位键名
-            
-        Returns:
-            SlotDefinition 或 None
-        """
-        stmt = select(SlotDefinition).where(
-            SlotDefinition.tenant_id == tenant_id,
-            SlotDefinition.slot_key == slot_key,
-        )
-        result = await self._session.execute(stmt)
-        return result.scalar_one_or_none()
-
-    async def create_slot_definition(
-        self,
-        tenant_id: str,
-        slot_create: SlotDefinitionCreate,
-    ) -> SlotDefinition:
-        """
-        [AC-MRS-07, AC-MRS-08] 创建槽位定义
-        
-        Args:
-            tenant_id: 租户 ID
-            slot_create: 创建数据
-            
-        Returns:
-            创建的 SlotDefinition
-            
-        Raises:
-            ValueError: 如果 slot_key 已存在或参数无效
-        """
-        if not self.SLOT_KEY_PATTERN.match(slot_create.slot_key):
-            raise ValueError(
-                f"slot_key '{slot_create.slot_key}' 格式不正确，"
-                "必须以小写字母开头，仅允许小写字母、数字和下划线"
-            )
-        
-        existing = await self.get_slot_definition_by_key(tenant_id, slot_create.slot_key)
-        if existing:
-            raise ValueError(f"slot_key '{slot_create.slot_key}' 已存在")
-        
-        if slot_create.type not in self.VALID_TYPES:
-            raise ValueError(
-                f"无效的槽位类型 '{slot_create.type}'，"
-                f"有效类型为: {self.VALID_TYPES}"
-            )
-        
-        if slot_create.extract_strategy and slot_create.extract_strategy not in self.VALID_EXTRACT_STRATEGIES:
-            raise ValueError(
-                f"无效的提取策略 '{slot_create.extract_strategy}'，"
-                f"有效策略为: {self.VALID_EXTRACT_STRATEGIES}"
-            )
-        
-        linked_field = None
-        if slot_create.linked_field_id:
-            linked_field = await self._get_linked_field(slot_create.linked_field_id)
-            if not linked_field:
-                raise ValueError(
-                    f"[AC-MRS-08] 关联的元数据字段 '{slot_create.linked_field_id}' 不存在"
-                )
-        
-        slot = SlotDefinition(
-            tenant_id=tenant_id,
-            slot_key=slot_create.slot_key,
-            type=slot_create.type,
-            required=slot_create.required,
-            extract_strategy=slot_create.extract_strategy,
-            validation_rule=slot_create.validation_rule,
-            ask_back_prompt=slot_create.ask_back_prompt,
-            default_value=slot_create.default_value,
-            linked_field_id=uuid.UUID(slot_create.linked_field_id) if slot_create.linked_field_id else None,
-        )
-        
-        self._session.add(slot)
-        await self._session.flush()
-        
-        logger.info(
-            f"[AC-MRS-07] Created slot definition: tenant={tenant_id}, "
-            f"slot_key={slot.slot_key}, required={slot.required}, "
-            f"linked_field_id={slot.linked_field_id}"
-        )
-        
-        return slot
-
-    async def update_slot_definition(
-        self,
-        tenant_id: str,
-        slot_id: str,
-        slot_update: SlotDefinitionUpdate,
-    ) -> SlotDefinition | None:
-        """
-        更新槽位定义
-        
-        Args:
-            tenant_id: 租户 ID
-            slot_id: 槽位定义 ID
-            slot_update: 更新数据
-            
-        Returns:
-            更新后的 SlotDefinition 或 None
-        """
-        slot = await self.get_slot_definition(tenant_id, slot_id)
-        if not slot:
-            return None
-        
-        if slot_update.type is not None:
-            if slot_update.type not in self.VALID_TYPES:
-                raise ValueError(
-                    f"无效的槽位类型 '{slot_update.type}'，"
-                    f"有效类型为: {self.VALID_TYPES}"
-                )
-            slot.type = slot_update.type
-        
-        if slot_update.required is not None:
-            slot.required = slot_update.required
-        
-        if slot_update.extract_strategy is not None:
-            if slot_update.extract_strategy not in self.VALID_EXTRACT_STRATEGIES:
-                raise ValueError(
-                    f"无效的提取策略 '{slot_update.extract_strategy}'，"
-                    f"有效策略为: {self.VALID_EXTRACT_STRATEGIES}"
-                )
-            slot.extract_strategy = slot_update.extract_strategy
-        
-        if slot_update.validation_rule is not None:
-            slot.validation_rule = slot_update.validation_rule
-        
-        if slot_update.ask_back_prompt is not None:
-            slot.ask_back_prompt = slot_update.ask_back_prompt
-        
-        if slot_update.default_value is not None:
-            slot.default_value = slot_update.default_value
-        
-        if slot_update.linked_field_id is not None:
-            if slot_update.linked_field_id:
-                linked_field = await self._get_linked_field(slot_update.linked_field_id)
-                if not linked_field:
-                    raise ValueError(
-                        f"[AC-MRS-08] 关联的元数据字段 '{slot_update.linked_field_id}' 不存在"
-                    )
-                slot.linked_field_id = uuid.UUID(slot_update.linked_field_id)
-            else:
-                slot.linked_field_id = None
-        
-        slot.updated_at = datetime.utcnow()
-        await self._session.flush()
-        
-        logger.info(
-            f"[AC-MRS-07] Updated slot definition: tenant={tenant_id}, "
-            f"slot_id={slot_id}"
-        )
-        
-        return slot
-
-    async def delete_slot_definition(
-        self,
-        tenant_id: str,
-        slot_id: str,
-    ) -> bool:
-        """
-        [AC-MRS-16] 删除槽位定义
-        
-        Args:
-            tenant_id: 租户 ID
-            slot_id: 槽位定义 ID
-            
-        Returns:
-            是否删除成功
-        """
-        slot = await self.get_slot_definition(tenant_id, slot_id)
-        if not slot:
-            return False
-        
-        await self._session.delete(slot)
-        await self._session.flush()
-        
-        logger.info(
-            f"[AC-MRS-16] Deleted slot definition: tenant={tenant_id}, "
-            f"slot_id={slot_id}"
-        )
-        
-        return True
-
-    async def _get_linked_field(
-        self,
-        field_id: str,
-    ) -> MetadataFieldDefinition | None:
-        """
-        获取关联的元数据字段
-        
-        Args:
-            field_id: 字段 ID
-            
-        Returns:
-            MetadataFieldDefinition 或 None
-        """
-        try:
-            field_uuid = uuid.UUID(field_id)
-        except ValueError:
-            return None
-        
-        stmt = select(MetadataFieldDefinition).where(
-            MetadataFieldDefinition.id == field_uuid,
-        )
-        result = await self._session.execute(stmt)
-        return result.scalar_one_or_none()
-
-    async def get_slot_definition_with_field(
-        self,
-        tenant_id: str,
-        slot_id: str,
-    ) -> dict[str, Any] | None:
-        """
-        获取槽位定义及其关联字段信息
-        
-        Args:
-            tenant_id: 租户 ID
-            slot_id: 槽位定义 ID
-            
-        Returns:
-            包含槽位定义和关联字段的字典
-        """
-        slot = await self.get_slot_definition(tenant_id, slot_id)
-        if not slot:
-            return None
-        
-        result = {
-            "id": str(slot.id),
-            "tenant_id": slot.tenant_id,
-            "slot_key": slot.slot_key,
-            "type": slot.type,
-            "required": slot.required,
-            "extract_strategy": slot.extract_strategy,
-            "validation_rule": slot.validation_rule,
-            "ask_back_prompt": slot.ask_back_prompt,
-            "default_value": slot.default_value,
-            "linked_field_id": str(slot.linked_field_id) if slot.linked_field_id else None,
-            "created_at": slot.created_at.isoformat() if slot.created_at else None,
-            "updated_at": slot.updated_at.isoformat() if slot.updated_at else None,
-            "linked_field": None,
-        }
-        
-        if slot.linked_field_id:
-            linked_field = await self._get_linked_field(str(slot.linked_field_id))
-            if linked_field:
-                result["linked_field"] = {
-                    "id": str(linked_field.id),
-                    "field_key": linked_field.field_key,
-                    "label": linked_field.label,
-                    "type": linked_field.type,
-                    "required": linked_field.required,
-                    "options": linked_field.options,
-                    "default_value": linked_field.default_value,
-                    "scope": linked_field.scope,
-                    "is_filterable": linked_field.is_filterable,
-                    "is_rank_feature": linked_field.is_rank_feature,
-                    "field_roles": linked_field.field_roles,
-                    "status": linked_field.status,
-                }
-        
-        return result
--- a/ai-service/scripts/migrations/005_create_mid_tables.sql
+++ b/ai-service/scripts/migrations/005_create_mid_tables.sql
@ -1,67 +0,0 @@
-- [AC-IDMP-05/07/09/20] 中台改造数据库迁移
-- 创建高风险策略表、会话模式记录表、审计日志表
-
-- 高风险策略配置表
-CREATE TABLE IF NOT EXISTS high_risk_policies (
-    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    tenant_id VARCHAR(255) NOT NULL,
-    scenario VARCHAR(64) NOT NULL,
-    handler_mode VARCHAR(32) NOT NULL DEFAULT 'micro_flow',
-    flow_id UUID REFERENCES script_flows(id),
-    transfer_message TEXT,
-    keywords JSONB,
-    patterns JSONB,
-    priority INTEGER DEFAULT 0,
-    is_enabled BOOLEAN DEFAULT TRUE,
-    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
-    updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
-);
-
-CREATE INDEX IF NOT EXISTS ix_high_risk_policies_tenant ON high_risk_policies(tenant_id);
-CREATE INDEX IF NOT EXISTS ix_high_risk_policies_tenant_enabled ON high_risk_policies(tenant_id, is_enabled);
-
-- 会话模式记录表
-CREATE TABLE IF NOT EXISTS session_mode_records (
-    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    tenant_id VARCHAR(255) NOT NULL,
-    session_id VARCHAR(255) NOT NULL,
-    mode VARCHAR(32) NOT NULL DEFAULT 'BOT_ACTIVE',
-    reason TEXT,
-    switched_at TIMESTAMP WITH TIME ZONE,
-    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
-    updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
-    CONSTRAINT ix_session_mode_records_tenant_session UNIQUE (tenant_id, session_id)
-);
-
-- 中台审计日志表
-CREATE TABLE IF NOT EXISTS mid_audit_logs (
-    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    tenant_id VARCHAR(255) NOT NULL,
-    session_id VARCHAR(255) NOT NULL,
-    request_id VARCHAR(255) NOT NULL,
-    generation_id VARCHAR(255) NOT NULL,
-    mode VARCHAR(32) NOT NULL,
-    intent VARCHAR(255),
-    tool_calls JSONB,
-    guardrail_triggered BOOLEAN DEFAULT FALSE,
-    fallback_reason_code VARCHAR(64),
-    react_iterations INTEGER,
-    high_risk_scenario VARCHAR(64),
-    latency_ms INTEGER,
-    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
-);
-
-CREATE INDEX IF NOT EXISTS ix_mid_audit_logs_tenant_session ON mid_audit_logs(tenant_id, session_id);
-CREATE INDEX IF NOT EXISTS ix_mid_audit_logs_tenant_request ON mid_audit_logs(tenant_id, request_id);
-CREATE INDEX IF NOT EXISTS ix_mid_audit_logs_tenant_generation ON mid_audit_logs(tenant_id, generation_id);
-CREATE INDEX IF NOT EXISTS ix_mid_audit_logs_created ON mid_audit_logs(created_at);
-
-- 插入默认高风险策略（空集保护）
-- 注意：这里只插入示例，实际由租户配置
-INSERT INTO high_risk_policies (tenant_id, scenario, handler_mode, keywords, priority, is_enabled)
-VALUES 
-    ('default', 'refund', 'micro_flow', '["退款", "退货", "退钱", "退费"]'::jsonb, 100, TRUE),
-    ('default', 'complaint_escalation', 'micro_flow', '["投诉", "举报", "差评", "曝光"]'::jsonb, 100, TRUE),
-    ('default', 'privacy_sensitive_promise', 'micro_flow', '["承诺", "保证", "隐私", "个人信息"]'::jsonb, 100, TRUE),
-    ('default', 'transfer', 'transfer', '["人工", "转人工", "人工客服", "真人"]'::jsonb, 100, TRUE)
-ON CONFLICT DO NOTHING;
--- a/ai-service/scripts/migrations/006_create_shared_sessions.sql
+++ b/ai-service/scripts/migrations/006_create_shared_sessions.sql
@ -1,24 +0,0 @@
-- [AC-IDMP-SHARE] 对话分享功能数据库迁移
-- 创建 shared_sessions 表，支持通过链接分享对话
-
-- 分享会话表
-CREATE TABLE IF NOT EXISTS shared_sessions (
-    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    share_token VARCHAR(255) NOT NULL UNIQUE,
-    session_id VARCHAR(255) NOT NULL,
-    tenant_id VARCHAR(255) NOT NULL,
-    title VARCHAR(255),
-    description TEXT,
-    expires_at TIMESTAMP WITH TIME ZONE NOT NULL,
-    is_active BOOLEAN DEFAULT TRUE,
-    max_concurrent_users INTEGER DEFAULT 10,
-    current_users INTEGER DEFAULT 0,
-    created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
-    updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
-);
-
-CREATE INDEX IF NOT EXISTS ix_shared_sessions_share_token ON shared_sessions(share_token);
-CREATE INDEX IF NOT EXISTS ix_shared_sessions_session_id ON shared_sessions(session_id);
-CREATE INDEX IF NOT EXISTS ix_shared_sessions_tenant ON shared_sessions(tenant_id);
-CREATE INDEX IF NOT EXISTS ix_shared_sessions_tenant_session ON shared_sessions(tenant_id, session_id);
-CREATE INDEX IF NOT EXISTS ix_shared_sessions_expires_at ON shared_sessions(expires_at);
--- a/ai-service/scripts/migrations/007_add_field_roles_and_slot_definitions.sql
+++ b/ai-service/scripts/migrations/007_add_field_roles_and_slot_definitions.sql
@ -1,63 +0,0 @@
-- Migration: Add field_roles and slot_definitions
-- Date: 2026-03-05
-- Issue: [AC-MRS-01, AC-MRS-07] 元数据职责分层优化
-
-- 1. 为 metadata_field_definitions 表新增 field_roles 字段
-ALTER TABLE metadata_field_definitions 
-ADD COLUMN IF NOT EXISTS field_roles JSONB DEFAULT '[]'::jsonb;
-
-- 2. 创建 GIN 索引支持按角色查询
-CREATE INDEX IF NOT EXISTS idx_metadata_field_definitions_roles 
-ON metadata_field_definitions USING GIN (field_roles);
-
-- 3. 添加字段注释
-COMMENT ON COLUMN metadata_field_definitions.field_roles IS 
-'[AC-MRS-01] 字段角色列表：resource_filter, slot, prompt_var, routing_signal';
-
-- 4. 创建 slot_definitions 表
-CREATE TABLE IF NOT EXISTS slot_definitions (
-    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    tenant_id VARCHAR NOT NULL,
-    slot_key VARCHAR(100) NOT NULL,
-    type VARCHAR(20) NOT NULL DEFAULT 'string',
-    required BOOLEAN NOT NULL DEFAULT FALSE,
-    extract_strategy VARCHAR(20),
-    validation_rule TEXT,
-    ask_back_prompt TEXT,
-    default_value JSONB,
-    linked_field_id UUID REFERENCES metadata_field_definitions(id),
-    created_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
-    updated_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
-    
-    CONSTRAINT uk_slot_definitions_tenant_key UNIQUE (tenant_id, slot_key),
-    CONSTRAINT chk_slot_definitions_type CHECK (type IN ('string', 'number', 'boolean', 'enum', 'array_enum')),
-    CONSTRAINT chk_slot_definitions_extract_strategy CHECK (extract_strategy IS NULL OR extract_strategy IN ('rule', 'llm', 'user_input'))
-);
-
-- 5. 创建索引
-CREATE INDEX IF NOT EXISTS ix_slot_definitions_tenant 
-ON slot_definitions(tenant_id);
-
-CREATE INDEX IF NOT EXISTS ix_slot_definitions_tenant_key 
-ON slot_definitions(tenant_id, slot_key);
-
-CREATE INDEX IF NOT EXISTS ix_slot_definitions_linked_field 
-ON slot_definitions(linked_field_id);
-
-- 6. 添加表和字段注释
-COMMENT ON TABLE slot_definitions IS '[AC-MRS-07] 槽位定义表';
-COMMENT ON COLUMN slot_definitions.slot_key IS '[AC-MRS-07] 槽位键名，可与元数据字段 field_key 关联';
-COMMENT ON COLUMN slot_definitions.type IS '[AC-MRS-07] 槽位类型：string/number/boolean/enum/array_enum';
-COMMENT ON COLUMN slot_definitions.required IS '[AC-MRS-07] 是否必填槽位';
-COMMENT ON COLUMN slot_definitions.extract_strategy IS '[AC-MRS-07] 提取策略：rule/llm/user_input';
-COMMENT ON COLUMN slot_definitions.validation_rule IS '[AC-MRS-07] 校验规则（正则或 JSON Schema）';
-COMMENT ON COLUMN slot_definitions.ask_back_prompt IS '[AC-MRS-07] 追问提示语模板';
-COMMENT ON COLUMN slot_definitions.default_value IS '[AC-MRS-07] 默认值';
-COMMENT ON COLUMN slot_definitions.linked_field_id IS '[AC-MRS-08] 关联的元数据字段 ID';
-
-- 7. 初始化现有字段的默认角色（根据 is_filterable 推断 resource_filter 角色）
-UPDATE metadata_field_definitions 
-SET field_roles = '["resource_filter"]'::jsonb
-WHERE is_filterable = true 
-  AND status = 'active'
-  AND (field_roles IS NULL OR field_roles = '[]'::jsonb);
--- a/ai-service/scripts/migrations/create_shared_sessions_table.py
+++ b/ai-service/scripts/migrations/create_shared_sessions_table.py
@ -1,71 +0,0 @@
-"""
-Migration script to create shared_sessions table.
-Run: python scripts/migrations/create_shared_sessions_table.py
-"""
-
-import asyncio
-import sys
-from pathlib import Path
-
-sys.path.insert(0, str(Path(__file__).parent.parent.parent))
-
-from sqlalchemy import text
-from sqlalchemy.ext.asyncio import create_async_engine
-from sqlalchemy.orm import sessionmaker
-from sqlmodel.ext.asyncio.session import AsyncSession
-
-from app.core.config import get_settings
-
-
-async def run_migration():
-    """Run the migration to create shared_sessions table."""
-    settings = get_settings()
-    engine = create_async_engine(settings.database_url, echo=True)
-    async_session_maker = sessionmaker(engine, class_=AsyncSession, expire_on_commit=False)
-
-    migration_sql = """
-    CREATE TABLE IF NOT EXISTS shared_sessions (
-        id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-        share_token VARCHAR(255) NOT NULL UNIQUE,
-        session_id VARCHAR(255) NOT NULL,
-        tenant_id VARCHAR(255) NOT NULL,
-        title VARCHAR(255),
-        description TEXT,
-        expires_at TIMESTAMP WITH TIME ZONE NOT NULL,
-        is_active BOOLEAN DEFAULT TRUE,
-        max_concurrent_users INTEGER DEFAULT 10,
-        current_users INTEGER DEFAULT 0,
-        created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
-        updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
-    );
-
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_share_token ON shared_sessions(share_token);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_session_id ON shared_sessions(session_id);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_tenant ON shared_sessions(tenant_id);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_tenant_session ON shared_sessions(tenant_id, session_id);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_expires_at ON shared_sessions(expires_at);
-    """
-
-    async with async_session_maker() as session:
-        for statement in migration_sql.strip().split(';'):
-            statement = statement.strip()
-            if statement and not statement.startswith('--'):
-                try:
-                    await session.execute(text(statement))
-                    print(f"Executed: {statement[:60]}...")
-                except Exception as e:
-                    error_str = str(e).lower()
-                    if "already exists" in error_str or "duplicate" in error_str:
-                        print(f"Skipped (already exists): {statement[:60]}...")
-                    else:
-                        print(f"Error: {e}")
-                        raise
-        
-        await session.commit()
-        print("\nMigration completed successfully!")
-
-    await engine.dispose()
-
-
-if __name__ == "__main__":
-    asyncio.run(run_migration())
--- a/ai-service/scripts/migrations/run_shared_sessions_migration.py
+++ b/ai-service/scripts/migrations/run_shared_sessions_migration.py
@ -1,71 +0,0 @@
-"""
-Migration script to create shared_sessions table.
-Run: python scripts/migrations/run_shared_sessions_migration.py
-"""
-
-import asyncio
-import sys
-from pathlib import Path
-
-sys.path.insert(0, str(Path(__file__).parent.parent.parent))
-
-from sqlalchemy import text
-from sqlalchemy.ext.asyncio import create_async_engine
-from sqlalchemy.orm import sessionmaker
-from sqlmodel.ext.asyncio.session import AsyncSession
-
-from app.core.config import get_settings
-
-
-async def run_migration():
-    """Run the migration to create shared_sessions table."""
-    settings = get_settings()
-    engine = create_async_engine(settings.database_url, echo=True)
-    async_session_maker = sessionmaker(engine, class_=AsyncSession, expire_on_commit=False)
-
-    migration_sql = """
-    CREATE TABLE IF NOT EXISTS shared_sessions (
-        id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-        share_token VARCHAR(255) NOT NULL UNIQUE,
-        session_id VARCHAR(255) NOT NULL,
-        tenant_id VARCHAR(255) NOT NULL,
-        title VARCHAR(255),
-        description TEXT,
-        expires_at TIMESTAMP WITH TIME ZONE NOT NULL,
-        is_active BOOLEAN DEFAULT TRUE,
-        max_concurrent_users INTEGER DEFAULT 10,
-        current_users INTEGER DEFAULT 0,
-        created_at TIMESTAMP WITH TIME ZONE DEFAULT NOW(),
-        updated_at TIMESTAMP WITH TIME ZONE DEFAULT NOW()
-    );
-
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_share_token ON shared_sessions(share_token);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_session_id ON shared_sessions(session_id);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_tenant ON shared_sessions(tenant_id);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_tenant_session ON shared_sessions(tenant_id, session_id);
-    CREATE INDEX IF NOT EXISTS ix_shared_sessions_expires_at ON shared_sessions(expires_at);
-    """
-
-    async with async_session_maker() as session:
-        for statement in migration_sql.strip().split(';'):
-            statement = statement.strip()
-            if statement and not statement.startswith('--'):
-                try:
-                    await session.execute(text(statement))
-                    print(f"Executed: {statement[:60]}...")
-                except Exception as e:
-                    error_str = str(e).lower()
-                    if "already exists" in error_str or "duplicate" in error_str:
-                        print(f"Skipped (already exists): {statement[:60]}...")
-                    else:
-                        print(f"Error: {e}")
-                        raise
-        
-        await session.commit()
-        print("\nMigration completed successfully!")
-
-    await engine.dispose()
-
-
-if __name__ == "__main__":
-    asyncio.run(run_migration())
--- a/ai-service/tests/test_mid_memory_tool.py
+++ b/ai-service/tests/test_mid_memory_tool.py
@ -1,550 +0,0 @@
-"""
-Tests for Mid Platform Memory and Tool Governance services.
-[AC-IDMP-13/14/15/19] 记忆与工具治理测试
-
-覆盖路径：
- 成功路径：正常 recall/update，工具调用成功
- 超时路径：recall 超时降级，工具调用超时
- 错误路径：recall 失败降级，工具调用错误
- 降级路径：记忆服务不可用时继续主链路
-"""
-
-import asyncio
-import pytest
-from unittest.mock import AsyncMock, MagicMock, patch
-from datetime import datetime
-
-from app.services.mid.memory_adapter import MemoryAdapter, UserMemory
-from app.services.mid.tool_call_recorder import ToolCallRecorder, ToolCallStatistics, get_tool_call_recorder
-from app.services.mid.tool_registry import ToolRegistry, ToolDefinition, ToolExecutionResult, get_tool_registry, init_tool_registry
-from app.models.mid.memory import (
-    RecallRequest,
-    RecallResponse,
-    UpdateRequest,
-    MemoryProfile,
-    MemoryFact,
-    MemoryPreferences,
-)
-from app.models.mid.tool_trace import (
-    ToolCallTrace,
-    ToolCallBuilder,
-    ToolCallStatus,
-    ToolType,
-)
-from app.models.mid.schemas import ToolCallTrace as ToolCallTraceSchema
-
-
-class TestMemoryAdapter:
-    """[AC-IDMP-13/14] 记忆适配器测试"""
-
-    @pytest.fixture
-    def mock_session(self):
-        return AsyncMock()
-
-    @pytest.fixture
-    def adapter(self, mock_session):
-        return MemoryAdapter(mock_session)
-
-    @pytest.mark.asyncio
-    async def test_recall_success(self, adapter):
-        """[AC-IDMP-13] 成功召回用户记忆"""
-        response = await adapter.recall(
-            user_id="user123",
-            session_id="session456",
-            tenant_id="tenant789",
-        )
-        
-        assert response is not None
-        assert response.profile is not None
-        assert response.profile.grade == "初一"
-        assert response.profile.region == "北京"
-        assert len(response.facts) > 0
-        assert response.preferences is not None
-        assert response.preferences.tone == "friendly"
-
-    @pytest.mark.asyncio
-    async def test_recall_returns_context_for_prompt(self, adapter):
-        """[AC-IDMP-13] 召回记忆可注入 Prompt"""
-        response = await adapter.recall(
-            user_id="user123",
-            session_id="session456",
-        )
-        
-        context = response.get_context_for_prompt()
-        
-        assert "【用户属性】" in context
-        assert "年级" in context
-        assert "【已知事实】" in context
-        assert "【用户偏好】" in context
-
-    @pytest.mark.asyncio
-    async def test_recall_timeout_fallback(self, mock_session):
-        """[AC-IDMP-13] recall 超时降级，不阻断主链路"""
-        adapter = MemoryAdapter(mock_session, recall_timeout_ms=1)
-        
-        async def slow_recall(*args, **kwargs):
-            await asyncio.sleep(1)
-            return RecallResponse()
-        
-        with patch.object(adapter, '_recall_internal', slow_recall):
-            response = await adapter.recall(
-                user_id="user123",
-                session_id="session456",
-            )
-        
-        assert response is not None
-        assert response.profile is None
-        assert response.facts == []
-
-    @pytest.mark.asyncio
-    async def test_recall_error_fallback(self, adapter, mock_session):
-        """[AC-IDMP-13] recall 错误降级，不阻断主链路"""
-        with patch.object(adapter, '_recall_internal', side_effect=Exception("DB error")):
-            response = await adapter.recall(
-                user_id="user123",
-                session_id="session456",
-            )
-        
-        assert response is not None
-        assert response.profile is None
-
-    @pytest.mark.asyncio
-    async def test_update_async(self, adapter):
-        """[AC-IDMP-14] 异步更新用户记忆"""
-        result = await adapter.update(
-            user_id="user123",
-            session_id="session456",
-            messages=[{"role": "user", "content": "你好"}],
-            summary="用户打招呼",
-        )
-        
-        assert result is True
-        
-        await asyncio.sleep(0.1)
-        
-        completed = await adapter.wait_pending_updates(timeout=1.0)
-        assert completed >= 0
-
-    @pytest.mark.asyncio
-    async def test_update_with_summary_generation(self, adapter):
-        """[AC-IDMP-14] 带摘要生成的记忆更新"""
-        async def summary_gen(messages):
-            return "这是一个测试摘要"
-        
-        result = await adapter.update_with_summary_generation(
-            user_id="user123",
-            session_id="session456",
-            messages=[{"role": "user", "content": "你好"}],
-            summary_generator=summary_gen,
-        )
-        
-        assert result is True
-
-
-class TestToolCallRecorder:
-    """[AC-IDMP-15] 工具调用记录器测试"""
-
-    @pytest.fixture
-    def recorder(self):
-        return ToolCallRecorder()
-
-    def test_record_success(self, recorder):
-        """[AC-IDMP-15] 记录成功的工具调用"""
-        trace = recorder.record_success(
-            session_id="session123",
-            tool_name="kb_search",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=150,
-            args={"query": "测试查询"},
-            result={"docs": ["doc1", "doc2"]},
-            registry_version="1.0.0",
-            auth_applied=False,
-        )
-        
-        assert trace.tool_name == "kb_search"
-        assert trace.status == ToolCallStatus.OK
-        assert trace.duration_ms == 150
-        assert trace.args_digest is not None
-        assert trace.result_digest is not None
-
-    def test_record_timeout(self, recorder):
-        """[AC-IDMP-15] 记录超时的工具调用"""
-        trace = recorder.record_timeout(
-            session_id="session123",
-            tool_name="kb_search",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=2000,
-            args={"query": "测试查询"},
-        )
-        
-        assert trace.status == ToolCallStatus.TIMEOUT
-        assert trace.error_code == "TIMEOUT"
-
-    def test_record_error(self, recorder):
-        """[AC-IDMP-15] 记录错误的工具调用"""
-        trace = recorder.record_error(
-            session_id="session123",
-            tool_name="kb_search",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=500,
-            error_code="DB_CONNECTION_ERROR",
-            error_message="Database connection failed",
-        )
-        
-        assert trace.status == ToolCallStatus.ERROR
-        assert trace.error_code == "DB_CONNECTION_ERROR"
-
-    def test_record_rejected(self, recorder):
-        """[AC-IDMP-15] 记录被拒绝的工具调用"""
-        trace = recorder.record_rejected(
-            session_id="session123",
-            tool_name="sensitive_tool",
-            tool_type=ToolType.MCP,
-            reason="AUTH_REQUIRED",
-        )
-        
-        assert trace.status == ToolCallStatus.REJECTED
-        assert trace.error_code == "AUTH_REQUIRED"
-
-    def test_get_traces(self, recorder):
-        """[AC-IDMP-15] 获取会话的工具调用记录"""
-        recorder.record_success(
-            session_id="session123",
-            tool_name="tool1",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=100,
-        )
-        recorder.record_success(
-            session_id="session123",
-            tool_name="tool2",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=200,
-        )
-        
-        traces = recorder.get_traces("session123")
-        
-        assert len(traces) == 2
-        assert traces[0].tool_name == "tool1"
-        assert traces[1].tool_name == "tool2"
-
-    def test_get_statistics(self, recorder):
-        """[AC-IDMP-15] 获取工具调用统计"""
-        recorder.record_success(
-            session_id="session123",
-            tool_name="kb_search",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=100,
-        )
-        recorder.record_timeout(
-            session_id="session123",
-            tool_name="kb_search",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=2000,
-        )
-        
-        stats = recorder.get_statistics("kb_search")
-        
-        assert stats["total_calls"] == 2
-        assert stats["success_rate"] == 0.5
-        assert stats["timeout_rate"] == 0.5
-
-    def test_to_trace_info_format(self, recorder):
-        """[AC-IDMP-15] 转换为 TraceInfo 格式"""
-        recorder.record_success(
-            session_id="session123",
-            tool_name="kb_search",
-            tool_type=ToolType.INTERNAL,
-            duration_ms=100,
-        )
-        
-        traces = recorder.to_trace_info_format("session123")
-        
-        assert len(traces) == 1
-        assert traces[0]["tool_name"] == "kb_search"
-        assert traces[0]["status"] == "ok"
-
-    def test_compute_digest(self, recorder):
-        """[AC-IDMP-15] 敏感参数脱敏"""
-        long_data = {"query": "x" * 100}
-        digest = ToolCallTrace.compute_digest(long_data)
-        
-        assert len(digest) < 100
-        assert "..." in digest or len(digest) <= 64
-
-
-class TestToolRegistry:
-    """[AC-IDMP-19] Tool Registry 治理测试"""
-
-    @pytest.fixture
-    def registry(self):
-        return ToolRegistry()
-
-    def test_register_tool(self, registry):
-        """[AC-IDMP-19] 注册工具"""
-        async def dummy_handler(**kwargs):
-            return {"result": "ok"}
-        
-        tool = registry.register(
-            name="test_tool",
-            description="测试工具",
-            handler=dummy_handler,
-            tool_type=ToolType.INTERNAL,
-            version="1.0.0",
-            auth_required=False,
-            timeout_ms=1000,
-        )
-        
-        assert tool.name == "test_tool"
-        assert tool.enabled is True
-        assert tool.timeout_ms == 1000
-
-    def test_register_mcp_tool(self, registry):
-        """[AC-IDMP-19] 注册 MCP 工具"""
-        async def mcp_handler(**kwargs):
-            return {"result": "ok"}
-        
-        tool = registry.register(
-            name="mcp_tool",
-            description="MCP 工具",
-            handler=mcp_handler,
-            tool_type=ToolType.MCP,
-        )
-        
-        assert tool.tool_type == ToolType.MCP
-
-    def test_enable_disable_tool(self, registry):
-        """[AC-IDMP-19] 启停工具"""
-        async def handler(**kwargs):
-            return {}
-        
-        registry.register(
-            name="toggle_tool",
-            description="可切换工具",
-            handler=handler,
-        )
-        
-        assert registry.get_tool("toggle_tool").enabled is True
-        
-        registry.disable_tool("toggle_tool")
-        assert registry.get_tool("toggle_tool").enabled is False
-        
-        registry.enable_tool("toggle_tool")
-        assert registry.get_tool("toggle_tool").enabled is True
-
-    @pytest.mark.asyncio
-    async def test_execute_tool_success(self, registry):
-        """[AC-IDMP-19] 执行工具成功"""
-        async def handler(**kwargs):
-            return {"data": kwargs.get("query")}
-        
-        registry.register(
-            name="search",
-            description="搜索工具",
-            handler=handler,
-            timeout_ms=1000,
-        )
-        
-        result = await registry.execute(
-            tool_name="search",
-            args={"query": "test"},
-        )
-        
-        assert result.success is True
-        assert result.output["data"] == "test"
-
-    @pytest.mark.asyncio
-    async def test_execute_tool_timeout(self, registry):
-        """[AC-IDMP-19] 工具执行超时"""
-        async def slow_handler(**kwargs):
-            await asyncio.sleep(5)
-            return {"data": "ok"}
-        
-        registry.register(
-            name="slow_tool",
-            description="慢工具",
-            handler=slow_handler,
-            timeout_ms=100,
-        )
-        
-        result = await registry.execute(
-            tool_name="slow_tool",
-            args={},
-        )
-        
-        assert result.success is False
-        assert "timeout" in result.error.lower()
-
-    @pytest.mark.asyncio
-    async def test_execute_tool_not_found(self, registry):
-        """[AC-IDMP-19] 工具不存在"""
-        result = await registry.execute(
-            tool_name="nonexistent",
-            args={},
-        )
-        
-        assert result.success is False
-        assert "not found" in result.error.lower()
-
-    @pytest.mark.asyncio
-    async def test_execute_tool_disabled(self, registry):
-        """[AC-IDMP-19] 工具已禁用"""
-        async def handler(**kwargs):
-            return {}
-        
-        registry.register(
-            name="disabled_tool",
-            description="禁用工具",
-            handler=handler,
-            enabled=False,
-        )
-        
-        result = await registry.execute(
-            tool_name="disabled_tool",
-            args={},
-        )
-        
-        assert result.success is False
-        assert "disabled" in result.error.lower()
-
-    @pytest.mark.asyncio
-    async def test_execute_tool_auth_required(self, registry):
-        """[AC-IDMP-19] 工具需要鉴权"""
-        async def handler(**kwargs):
-            return {}
-        
-        registry.register(
-            name="auth_tool",
-            description="需要鉴权的工具",
-            handler=handler,
-            auth_required=True,
-        )
-        
-        result = await registry.execute(
-            tool_name="auth_tool",
-            args={},
-            auth_context=None,
-        )
-        
-        assert result.success is False
-        assert "auth" in result.error.lower()
-
-    @pytest.mark.asyncio
-    async def test_execute_tool_with_auth_context(self, registry):
-        """[AC-IDMP-19] 带鉴权上下文执行工具"""
-        async def handler(**kwargs):
-            return {"authenticated": True}
-        
-        registry.register(
-            name="auth_tool2",
-            description="需要鉴权的工具",
-            handler=handler,
-            auth_required=True,
-        )
-        
-        result = await registry.execute(
-            tool_name="auth_tool2",
-            args={},
-            auth_context={"user_id": "user123"},
-        )
-        
-        assert result.success is True
-        assert result.auth_applied is True
-
-    def test_create_trace(self, registry):
-        """[AC-IDMP-19] 创建工具调用追踪"""
-        result = ToolExecutionResult(
-            success=True,
-            output={"data": "ok"},
-            duration_ms=100,
-            auth_applied=False,
-            registry_version="1.0.0",
-        )
-        
-        trace = registry.create_trace(
-            tool_name="test_tool",
-            result=result,
-            args_digest="query=test",
-        )
-        
-        assert trace.tool_name == "test_tool"
-        assert trace.status == ToolCallStatus.OK
-
-    def test_get_governance_report(self, registry):
-        """[AC-IDMP-19] 获取治理报告"""
-        async def handler(**kwargs):
-            return {}
-        
-        registry.register(name="tool1", description="工具1", handler=handler)
-        registry.register(name="tool2", description="工具2", handler=handler, enabled=False)
-        registry.register(name="tool3", description="工具3", handler=handler, auth_required=True)
-        
-        report = registry.get_governance_report()
-        
-        assert report["total_tools"] == 3
-        assert report["enabled_tools"] == 2
-        assert report["disabled_tools"] == 1
-        assert report["auth_required_tools"] == 1
-
-    def test_get_tool_registry_singleton(self):
-        """[AC-IDMP-19] 获取全局注册表实例"""
-        registry1 = get_tool_registry()
-        registry2 = get_tool_registry()
-        
-        assert registry1 is registry2
-
-    def test_init_tool_registry(self):
-        """[AC-IDMP-19] 初始化注册表"""
-        from app.services.mid.timeout_governor import TimeoutGovernor
-        
-        governor = TimeoutGovernor()
-        registry = init_tool_registry(timeout_governor=governor)
-        
-        assert registry is not None
-        assert registry._timeout_governor is governor
-
-
-class TestToolCallBuilder:
-    """[AC-IDMP-15] 工具调用构建器测试"""
-
-    def test_build_success_trace(self):
-        """[AC-IDMP-15] 构建成功追踪"""
-        builder = ToolCallBuilder(
-            tool_name="test_tool",
-            tool_type=ToolType.INTERNAL,
-        )
-        
-        trace = builder.with_args({"query": "test"}).with_result({"data": "ok"}).build()
-        
-        assert trace.tool_name == "test_tool"
-        assert trace.status == ToolCallStatus.OK
-        assert trace.args_digest is not None
-        assert trace.result_digest is not None
-
-    def test_build_error_trace(self):
-        """[AC-IDMP-15] 构建错误追踪"""
-        builder = ToolCallBuilder(tool_name="test_tool")
-        
-        trace = builder.with_error(
-            Exception("Something went wrong"),
-            error_code="INTERNAL_ERROR",
-        ).build()
-        
-        assert trace.status == ToolCallStatus.ERROR
-        assert trace.error_code == "INTERNAL_ERROR"
-
-    def test_build_timeout_trace(self):
-        """[AC-IDMP-15] 构建超时追踪"""
-        builder = ToolCallBuilder(tool_name="test_tool")
-        
-        trace = builder.with_error(TimeoutError("Timeout")).build()
-        
-        assert trace.status == ToolCallStatus.TIMEOUT
-
-    def test_build_rejected_trace(self):
-        """[AC-IDMP-15] 构建拒绝追踪"""
-        builder = ToolCallBuilder(tool_name="test_tool")
-        
-        trace = builder.with_rejected("AUTH_DENIED").build()
-        
-        assert trace.status == ToolCallStatus.REJECTED
-        assert trace.error_code == "AUTH_DENIED"
--- a/ai-service/tests/test_mid_services.py
+++ b/ai-service/tests/test_mid_services.py
@ -1,337 +0,0 @@
-"""
-Tests for Mid Platform services.
-[AC-IDMP-05/07/08/09/18/20] 中台服务测试
-"""
-
-import pytest
-from unittest.mock import AsyncMock, MagicMock, patch
-from datetime import datetime
-
-from app.services.mid import (
-    HighRiskHandler,
-    HighRiskMatchResult,
-    PolicyRouter,
-    PolicyRouteResult,
-    TraceLogger,
-    MetricsCollector,
-    MetricsRecord,
-    SessionModeService,
-    DEFAULT_HIGH_RISK_SCENARIOS,
-)
-from app.models.entities import (
-    HighRiskPolicy,
-    HighRiskScenarioType,
-    SessionModeRecord,
-    MidAuditLog,
-)
-from app.models.mid import Mode, SessionMode
-
-
-class TestHighRiskHandler:
-    """[AC-IDMP-05/20] 高风险场景处理器测试"""
-
-    @pytest.fixture
-    def mock_session(self):
-        session = AsyncMock()
-        return session
-
-    @pytest.fixture
-    def handler(self, mock_session):
-        return HighRiskHandler(mock_session)
-
-    @pytest.mark.asyncio
-    async def test_get_active_scenario_set_returns_default_when_empty(self, handler, mock_session):
-        """[AC-IDMP-20] 空集保护：数据库无配置时返回默认最小集"""
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = []
-        mock_session.execute.return_value = mock_result
-
-        scenarios = await handler.get_active_scenario_set("test_tenant")
-        
-        assert scenarios == DEFAULT_HIGH_RISK_SCENARIOS
-        assert "refund" in scenarios
-        assert "complaint_escalation" in scenarios
-        assert "privacy_sensitive_promise" in scenarios
-        assert "transfer" in scenarios
-
-    @pytest.mark.asyncio
-    async def test_detect_high_risk_refund_keyword(self, handler, mock_session):
-        """[AC-IDMP-05] 检测退款场景"""
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = []
-        mock_session.execute.return_value = mock_result
-
-        result = await handler.detect_high_risk("test_tenant", "我要退款")
-        
-        assert result.matched is True
-        assert result.scenario == HighRiskScenarioType.REFUND.value
-        assert result.handler_mode == "micro_flow"
-
-    @pytest.mark.asyncio
-    async def test_detect_high_risk_transfer_keyword(self, handler, mock_session):
-        """[AC-IDMP-05] 检测转人工场景"""
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = []
-        mock_session.execute.return_value = mock_result
-
-        result = await handler.detect_high_risk("test_tenant", "转人工")
-        
-        assert result.matched is True
-        assert result.scenario == HighRiskScenarioType.TRANSFER.value
-        assert result.handler_mode == "transfer"
-
-    @pytest.mark.asyncio
-    async def test_detect_high_risk_complaint_keyword(self, handler, mock_session):
-        """[AC-IDMP-05] 检测投诉升级场景"""
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = []
-        mock_session.execute.return_value = mock_result
-
-        result = await handler.detect_high_risk("test_tenant", "我要投诉你们")
-        
-        assert result.matched is True
-        assert result.scenario == HighRiskScenarioType.COMPLAINT_ESCALATION.value
-
-    @pytest.mark.asyncio
-    async def test_detect_high_risk_privacy_keyword(self, handler, mock_session):
-        """[AC-IDMP-05] 检测隐私敏感承诺场景"""
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = []
-        mock_session.execute.return_value = mock_result
-
-        result = await handler.detect_high_risk("test_tenant", "你能保证我的隐私安全吗")
-        
-        assert result.matched is True
-        assert result.scenario == HighRiskScenarioType.PRIVACY_SENSITIVE_PROMISE.value
-
-    @pytest.mark.asyncio
-    async def test_detect_high_risk_no_match(self, handler, mock_session):
-        """[AC-IDMP-05] 正常消息不触发高风险"""
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = []
-        mock_session.execute.return_value = mock_result
-
-        result = await handler.detect_high_risk("test_tenant", "你好，请问有什么可以帮助我的")
-        
-        assert result.matched is False
-
-    @pytest.mark.asyncio
-    async def test_detect_high_risk_with_policy(self, handler, mock_session):
-        """[AC-IDMP-05] 使用数据库策略检测高风险"""
-        policy = HighRiskPolicy(
-            tenant_id="test_tenant",
-            scenario=HighRiskScenarioType.REFUND.value,
-            handler_mode="micro_flow",
-            keywords=["退货退款"],
-            priority=100,
-            is_enabled=True,
-        )
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [policy]
-        mock_session.execute.return_value = mock_result
-
-        result = await handler.detect_high_risk("test_tenant", "我要退货退款")
-        
-        assert result.matched is True
-        assert result.scenario == HighRiskScenarioType.REFUND.value
-        assert result.matched_keyword == "退货退款"
-
-
-class TestPolicyRouter:
-    """[AC-IDMP-05] 策略路由器测试"""
-
-    @pytest.fixture
-    def mock_high_risk_handler(self):
-        return AsyncMock(spec=HighRiskHandler)
-
-    @pytest.fixture
-    def router(self, mock_high_risk_handler):
-        return PolicyRouter(mock_high_risk_handler)
-
-    @pytest.mark.asyncio
-    async def test_route_human_mode_returns_transfer(self, router, mock_high_risk_handler):
-        """[AC-IDMP-05] 人工模式返回 transfer"""
-        mock_high_risk_handler.detect_high_risk.return_value = HighRiskMatchResult(matched=False)
-
-        result = await router.route(
-            tenant_id="test_tenant",
-            user_message="你好",
-            session_mode="HUMAN_ACTIVE",
-        )
-        
-        assert result.mode == "transfer"
-
-    @pytest.mark.asyncio
-    async def test_route_high_risk_returns_micro_flow(self, router, mock_high_risk_handler):
-        """[AC-IDMP-05] 高风险场景返回 micro_flow 或 transfer"""
-        mock_high_risk_handler.detect_high_risk.return_value = HighRiskMatchResult(
-            matched=True,
-            scenario="refund",
-            handler_mode="micro_flow",
-        )
-
-        result = await router.route(
-            tenant_id="test_tenant",
-            user_message="我要退款",
-            session_mode="BOT_ACTIVE",
-        )
-        
-        assert result.mode == "micro_flow"
-        assert result.high_risk_scenario == "refund"
-
-    @pytest.mark.asyncio
-    async def test_route_low_confidence_returns_fixed(self, router, mock_high_risk_handler):
-        """[AC-IDMP-05] 低置信度返回 fixed"""
-        mock_high_risk_handler.detect_high_risk.return_value = HighRiskMatchResult(matched=False)
-
-        result = await router.route(
-            tenant_id="test_tenant",
-            user_message="你好",
-            session_mode="BOT_ACTIVE",
-            confidence=0.2,
-        )
-        
-        assert result.mode == "fixed"
-        assert result.fallback_reason_code == "low_confidence"
-
-    @pytest.mark.asyncio
-    async def test_route_normal_returns_agent(self, router, mock_high_risk_handler):
-        """[AC-IDMP-05] 正常场景返回 agent"""
-        mock_high_risk_handler.detect_high_risk.return_value = HighRiskMatchResult(matched=False)
-
-        result = await router.route(
-            tenant_id="test_tenant",
-            user_message="你好",
-            session_mode="BOT_ACTIVE",
-            confidence=0.8,
-        )
-        
-        assert result.mode == "agent"
-
-
-class TestMetricsCollector:
-    """[AC-IDMP-18] 指标采集器测试"""
-
-    @pytest.fixture
-    def collector(self):
-        return MetricsCollector()
-
-    def test_record_and_get_metrics(self, collector):
-        """[AC-IDMP-18] 记录并获取指标"""
-        record1 = MetricsRecord(
-            tenant_id="tenant1",
-            session_id="session1",
-            request_id="req1",
-            task_completed=True,
-            slots_filled=3,
-            slots_total=5,
-            was_transferred=False,
-            had_recall=True,
-            latency_ms=100,
-        )
-        
-        record2 = MetricsRecord(
-            tenant_id="tenant1",
-            session_id="session2",
-            request_id="req2",
-            task_completed=False,
-            slots_filled=2,
-            slots_total=5,
-            was_transferred=True,
-            had_recall=False,
-            latency_ms=200,
-        )
-        
-        collector.record(record1)
-        collector.record(record2)
-        
-        metrics = collector.get_metrics_snapshot("tenant1")
-        
-        assert metrics["task_completion_rate"] == 0.5
-        assert metrics["slot_completion_rate"] == 0.5
-        assert metrics["wrong_transfer_rate"] == 0.5
-        assert metrics["no_recall_rate"] == 0.5
-        assert metrics["avg_latency_ms"] == 150.0
-
-    def test_get_metrics_empty_tenant(self, collector):
-        """[AC-IDMP-18] 空租户返回零值指标"""
-        metrics = collector.get_metrics_snapshot("unknown_tenant")
-        
-        assert metrics["task_completion_rate"] == 0.0
-        assert metrics["slot_completion_rate"] == 0.0
-        assert metrics["wrong_transfer_rate"] == 0.0
-        assert metrics["no_recall_rate"] == 0.0
-        assert metrics["avg_latency_ms"] == 0.0
-
-
-class TestSessionModeService:
-    """[AC-IDMP-09] 会话模式服务测试"""
-
-    @pytest.fixture
-    def mock_session(self):
-        return AsyncMock()
-
-    @pytest.fixture
-    def service(self, mock_session):
-        return SessionModeService(mock_session)
-
-    @pytest.mark.asyncio
-    async def test_get_mode_returns_default(self, service, mock_session):
-        """[AC-IDMP-09] 获取默认模式"""
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-
-        mode = await service.get_mode("tenant1", "session1")
-        
-        assert mode == "BOT_ACTIVE"
-
-    @pytest.mark.asyncio
-    async def test_switch_mode_creates_new(self, service, mock_session):
-        """[AC-IDMP-09] 切换模式创建新记录"""
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-
-        result = await service.switch_mode(
-            tenant_id="tenant1",
-            session_id="session1",
-            mode="HUMAN_ACTIVE",
-            reason="user_request",
-        )
-        
-        assert result.mode == "HUMAN_ACTIVE"
-        assert result.reason == "user_request"
-        mock_session.add.assert_called_once()
-        mock_session.commit.assert_called_once()
-
-
-class TestTraceLogger:
-    """[AC-IDMP-07] Trace 日志服务测试"""
-
-    @pytest.fixture
-    def mock_session(self):
-        return AsyncMock()
-
-    @pytest.fixture
-    def logger(self, mock_session):
-        return TraceLogger(mock_session)
-
-    @pytest.mark.asyncio
-    async def test_log_creates_audit_record(self, logger, mock_session):
-        """[AC-IDMP-07] 记录审计日志"""
-        result = await logger.log(
-            tenant_id="tenant1",
-            session_id="session1",
-            request_id="req1",
-            generation_id="gen1",
-            mode="agent",
-            intent="greeting",
-            tool_calls=[{"tool": "search", "duration_ms": 100}],
-            guardrail_triggered=False,
-            latency_ms=500,
-        )
-        
-        mock_session.add.assert_called_once()
-        mock_session.commit.assert_called_once()
--- a/ai-service/tests/test_role_based_field_provider.py
+++ b/ai-service/tests/test_role_based_field_provider.py
@ -1,226 +0,0 @@
-"""
-Unit tests for RoleBasedFieldProvider service.
-[AC-MRS-04,05,10,11,12,13,14] 验证按角色查询字段定义功能
-"""
-
-import pytest
-from unittest.mock import AsyncMock, MagicMock, patch
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.services.mid.role_based_field_provider import (
-    RoleBasedFieldProvider,
-    InvalidRoleError,
-)
-from app.models.entities import (
-    MetadataFieldDefinition,
-    MetadataFieldStatus,
-    SlotDefinition,
-)
-from app.schemas.metadata import VALID_FIELD_ROLES
-
-
-class TestRoleBasedFieldProvider:
-    """[AC-MRS-04,05,10] RoleBasedFieldProvider 测试"""
-
-    @pytest.fixture
-    def mock_session(self):
-        """Mock AsyncSession"""
-        session = MagicMock(spec=AsyncSession)
-        session.execute = AsyncMock()
-        return session
-
-    @pytest.fixture
-    def provider(self, mock_session):
-        """Create provider instance"""
-        return RoleBasedFieldProvider(mock_session)
-
-    def test_validate_role_valid(self, provider):
-        """[AC-MRS-04] 验证有效角色"""
-        for role in VALID_FIELD_ROLES:
-            result = provider._validate_role(role)
-            assert result == role
-
-    def test_validate_role_invalid(self, provider):
-        """[AC-MRS-05] 验证无效角色抛出异常"""
-        with pytest.raises(InvalidRoleError) as exc_info:
-            provider._validate_role("invalid_role")
-        
-        assert "Invalid role 'invalid_role'" in str(exc_info.value)
-        assert exc_info.value.valid_roles == VALID_FIELD_ROLES
-
-    @pytest.mark.asyncio
-    async def test_get_fields_by_role(self, provider, mock_session):
-        """[AC-MRS-04] 按角色获取字段定义"""
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.id = "test-id"
-        mock_field.field_key = "grade"
-        mock_field.label = "年级"
-        mock_field.field_roles = ["resource_filter", "slot"]
-        mock_field.status = MetadataFieldStatus.ACTIVE.value
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_field]
-        mock_session.execute.return_value = mock_result
-        
-        fields = await provider.get_fields_by_role(
-            "test-tenant",
-            "resource_filter"
-        )
-        
-        assert len(fields) == 1
-        assert fields[0].field_key == "grade"
-        assert "resource_filter" in fields[0].field_roles
-
-    @pytest.mark.asyncio
-    async def test_get_fields_by_role_invalid_role(self, provider):
-        """[AC-MRS-05] 无效角色返回 400 错误"""
-        with pytest.raises(InvalidRoleError):
-            await provider.get_fields_by_role(
-                "test-tenant",
-                "invalid_role"
-            )
-
-    @pytest.mark.asyncio
-    async def test_get_fields_by_role_include_deprecated(self, provider, mock_session):
-        """[AC-MRS-04] 包含已废弃字段"""
-        mock_active = MagicMock(spec=MetadataFieldDefinition)
-        mock_active.field_key = "active_field"
-        mock_active.status = MetadataFieldStatus.ACTIVE.value
-        
-        mock_deprecated = MagicMock(spec=MetadataFieldDefinition)
-        mock_deprecated.field_key = "deprecated_field"
-        mock_deprecated.status = MetadataFieldStatus.DEPRECATED.value
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_active, mock_deprecated]
-        mock_session.execute.return_value = mock_result
-        
-        fields = await provider.get_fields_by_role(
-            "test-tenant",
-            "resource_filter",
-            include_deprecated=True
-        )
-        
-        assert len(fields) == 2
-
-    @pytest.mark.asyncio
-    async def test_get_slot_definitions_by_role(self, provider, mock_session):
-        """[AC-MRS-10] 按角色获取槽位定义"""
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.id = MagicMock()
-        mock_field.id.__str__ = lambda self: "field-id-123"
-        mock_field.field_key = "grade"
-        mock_field.label = "年级"
-        mock_field.type = "string"
-        mock_field.required = True
-        mock_field.options = None
-        mock_field.default_value = None
-        mock_field.scope = ["kb_document"]
-        mock_field.is_filterable = True
-        mock_field.is_rank_feature = False
-        mock_field.field_roles = ["slot"]
-        mock_field.status = MetadataFieldStatus.ACTIVE.value
-        
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.id = MagicMock()
-        mock_slot.id.__str__ = lambda self: "slot-id-456"
-        mock_slot.tenant_id = "test-tenant"
-        mock_slot.slot_key = "grade"
-        mock_slot.type = "string"
-        mock_slot.required = True
-        mock_slot.extract_strategy = "llm"
-        mock_slot.validation_rule = None
-        mock_slot.ask_back_prompt = "请输入年级"
-        mock_slot.default_value = None
-        mock_slot.linked_field_id = mock_field.id
-        mock_slot.created_at = None
-        mock_slot.updated_at = None
-        
-        field_result = MagicMock()
-        field_result.scalars.return_value.all.return_value = [mock_field]
-        
-        slot_result = MagicMock()
-        slot_result.scalars.return_value.all.return_value = [mock_slot]
-        
-        mock_session.execute.side_effect = [field_result, slot_result]
-        
-        slots = await provider.get_slot_definitions_by_role("test-tenant", "slot")
-        
-        assert len(slots) >= 1
-
-    @pytest.mark.asyncio
-    async def test_get_resource_filter_fields(self, provider, mock_session):
-        """[AC-MRS-11] 获取资源过滤角色字段"""
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.field_key = "category"
-        mock_field.field_roles = ["resource_filter"]
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_field]
-        mock_session.execute.return_value = mock_result
-        
-        fields = await provider.get_resource_filter_fields("test-tenant")
-        
-        assert len(fields) == 1
-        assert "resource_filter" in fields[0].field_roles
-
-    @pytest.mark.asyncio
-    async def test_get_slot_fields(self, provider, mock_session):
-        """[AC-MRS-12] 获取槽位角色字段"""
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.field_key = "user_name"
-        mock_field.field_roles = ["slot"]
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_field]
-        mock_session.execute.return_value = mock_result
-        
-        fields = await provider.get_slot_fields("test-tenant")
-        
-        assert len(fields) == 1
-        assert "slot" in fields[0].field_roles
-
-    @pytest.mark.asyncio
-    async def test_get_routing_signal_fields(self, provider, mock_session):
-        """[AC-MRS-13] 获取路由信号角色字段"""
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.field_key = "priority"
-        mock_field.field_roles = ["routing_signal"]
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_field]
-        mock_session.execute.return_value = mock_result
-        
-        fields = await provider.get_routing_signal_fields("test-tenant")
-        
-        assert len(fields) == 1
-        assert "routing_signal" in fields[0].field_roles
-
-    @pytest.mark.asyncio
-    async def test_get_prompt_var_fields(self, provider, mock_session):
-        """[AC-MRS-14] 获取提示词变量角色字段"""
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.field_key = "user_name"
-        mock_field.field_roles = ["prompt_var"]
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_field]
-        mock_session.execute.return_value = mock_result
-        
-        fields = await provider.get_prompt_var_fields("test-tenant")
-        
-        assert len(fields) == 1
-        assert "prompt_var" in fields[0].field_roles
-
-
-class TestInvalidRoleError:
-    """[AC-MRS-05] InvalidRoleError 测试"""
-
-    def test_error_message(self):
-        """验证错误消息格式"""
-        error = InvalidRoleError("bad_role")
-        
-        assert error.role == "bad_role"
-        assert error.valid_roles == VALID_FIELD_ROLES
-        assert "Invalid role 'bad_role'" in str(error)
-        assert "resource_filter" in str(error)
--- a/ai-service/tests/test_slot_definition_service.py
+++ b/ai-service/tests/test_slot_definition_service.py
@ -1,333 +0,0 @@
-"""
-Unit tests for SlotDefinitionService.
-[AC-MRS-07,08,16] 验证槽位定义管理功能
-"""
-
-import uuid
-import pytest
-from unittest.mock import AsyncMock, MagicMock
-from sqlalchemy.ext.asyncio import AsyncSession
-
-from app.services.slot_definition_service import SlotDefinitionService
-from app.models.entities import (
-    SlotDefinition,
-    SlotDefinitionCreate,
-    SlotDefinitionUpdate,
-    MetadataFieldDefinition,
-)
-
-
-class TestSlotDefinitionService:
-    """[AC-MRS-07,08,16] SlotDefinitionService 测试"""
-
-    @pytest.fixture
-    def mock_session(self):
-        """Mock AsyncSession"""
-        session = MagicMock(spec=AsyncSession)
-        session.execute = AsyncMock()
-        session.add = MagicMock()
-        session.flush = AsyncMock()
-        session.delete = AsyncMock()
-        return session
-
-    @pytest.fixture
-    def service(self, mock_session):
-        """Create service instance"""
-        return SlotDefinitionService(mock_session)
-
-    @pytest.mark.asyncio
-    async def test_list_slot_definitions(self, service, mock_session):
-        """列出槽位定义"""
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.id = uuid.uuid4()
-        mock_slot.slot_key = "grade"
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_slot]
-        mock_session.execute.return_value = mock_result
-        
-        slots = await service.list_slot_definitions("test-tenant")
-        
-        assert len(slots) == 1
-        assert slots[0].slot_key == "grade"
-
-    @pytest.mark.asyncio
-    async def test_list_slot_definitions_filter_required(self, service, mock_session):
-        """按必填过滤槽位定义"""
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.required = True
-        
-        mock_result = MagicMock()
-        mock_result.scalars.return_value.all.return_value = [mock_slot]
-        mock_session.execute.return_value = mock_result
-        
-        slots = await service.list_slot_definitions("test-tenant", required=True)
-        
-        assert len(slots) == 1
-        assert slots[0].required is True
-
-    @pytest.mark.asyncio
-    async def test_get_slot_definition(self, service, mock_session):
-        """获取单个槽位定义"""
-        slot_id = uuid.uuid4()
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.id = slot_id
-        mock_slot.slot_key = "grade"
-        
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = mock_slot
-        mock_session.execute.return_value = mock_result
-        
-        slot = await service.get_slot_definition("test-tenant", str(slot_id))
-        
-        assert slot is not None
-        assert slot.slot_key == "grade"
-
-    @pytest.mark.asyncio
-    async def test_get_slot_definition_not_found(self, service, mock_session):
-        """获取不存在的槽位定义"""
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-        
-        slot = await service.get_slot_definition("test-tenant", str(uuid.uuid4()))
-        
-        assert slot is None
-
-    @pytest.mark.asyncio
-    async def test_get_slot_definition_by_key(self, service, mock_session):
-        """通过 slot_key 获取槽位定义"""
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.slot_key = "grade"
-        
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = mock_slot
-        mock_session.execute.return_value = mock_result
-        
-        slot = await service.get_slot_definition_by_key("test-tenant", "grade")
-        
-        assert slot is not None
-        assert slot.slot_key == "grade"
-
-    @pytest.mark.asyncio
-    async def test_create_slot_definition(self, service, mock_session):
-        """[AC-MRS-07] 创建槽位定义"""
-        slot_create = SlotDefinitionCreate(
-            slot_key="grade",
-            type="string",
-            required=True,
-            extract_strategy="llm",
-            ask_back_prompt="请输入年级",
-        )
-        
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-        
-        slot = await service.create_slot_definition("test-tenant", slot_create)
-        
-        assert slot is not None
-        mock_session.add.assert_called_once()
-        mock_session.flush.assert_called_once()
-
-    @pytest.mark.asyncio
-    async def test_create_slot_definition_invalid_key(self, service):
-        """[AC-MRS-07] 创建无效 slot_key 抛出异常"""
-        slot_create = SlotDefinitionCreate(
-            slot_key="InvalidKey",
-            type="string",
-            required=True,
-        )
-        
-        with pytest.raises(ValueError) as exc_info:
-            await service.create_slot_definition("test-tenant", slot_create)
-        
-        assert "格式不正确" in str(exc_info.value)
-
-    @pytest.mark.asyncio
-    async def test_create_slot_definition_duplicate_key(self, service, mock_session):
-        """[AC-MRS-07] 创建重复 slot_key 抛出异常"""
-        existing_slot = MagicMock(spec=SlotDefinition)
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = existing_slot
-        mock_session.execute.return_value = mock_result
-        
-        slot_create = SlotDefinitionCreate(
-            slot_key="grade",
-            type="string",
-            required=True,
-        )
-        
-        with pytest.raises(ValueError) as exc_info:
-            await service.create_slot_definition("test-tenant", slot_create)
-        
-        assert "已存在" in str(exc_info.value)
-
-    @pytest.mark.asyncio
-    async def test_create_slot_definition_invalid_type(self, service, mock_session):
-        """[AC-MRS-07] 创建无效类型抛出异常"""
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-        
-        slot_create = SlotDefinitionCreate(
-            slot_key="grade",
-            type="invalid_type",
-            required=True,
-        )
-        
-        with pytest.raises(ValueError) as exc_info:
-            await service.create_slot_definition("test-tenant", slot_create)
-        
-        assert "无效的槽位类型" in str(exc_info.value)
-
-    @pytest.mark.asyncio
-    async def test_create_slot_definition_with_linked_field(self, service, mock_session):
-        """[AC-MRS-08] 创建槽位定义并关联元数据字段"""
-        field_id = uuid.uuid4()
-        mock_field = MagicMock(spec=MetadataFieldDefinition)
-        mock_field.id = field_id
-        
-        slot_result = MagicMock()
-        slot_result.scalar_one_or_none.return_value = None
-        
-        field_result = MagicMock()
-        field_result.scalar_one_or_none.return_value = mock_field
-        
-        mock_session.execute.side_effect = [slot_result, field_result]
-        
-        slot_create = SlotDefinitionCreate(
-            slot_key="grade",
-            type="string",
-            required=True,
-            linked_field_id=str(field_id),
-        )
-        
-        slot = await service.create_slot_definition("test-tenant", slot_create)
-        
-        assert slot is not None
-        mock_session.add.assert_called_once()
-
-    @pytest.mark.asyncio
-    async def test_create_slot_definition_linked_field_not_found(self, service, mock_session):
-        """[AC-MRS-08] 关联字段不存在抛出异常"""
-        field_id = uuid.uuid4()
-        
-        slot_result = MagicMock()
-        slot_result.scalar_one_or_none.return_value = None
-        
-        field_result = MagicMock()
-        field_result.scalar_one_or_none.return_value = None
-        
-        mock_session.execute.side_effect = [slot_result, field_result]
-        
-        slot_create = SlotDefinitionCreate(
-            slot_key="grade",
-            type="string",
-            required=True,
-            linked_field_id=str(field_id),
-        )
-        
-        with pytest.raises(ValueError) as exc_info:
-            await service.create_slot_definition("test-tenant", slot_create)
-        
-        assert "关联的元数据字段" in str(exc_info.value)
-
-    @pytest.mark.asyncio
-    async def test_update_slot_definition(self, service, mock_session):
-        """更新槽位定义"""
-        slot_id = uuid.uuid4()
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.id = slot_id
-        mock_slot.slot_key = "grade"
-        mock_slot.type = "string"
-        mock_slot.required = False
-        mock_slot.extract_strategy = None
-        mock_slot.validation_rule = None
-        mock_slot.ask_back_prompt = None
-        mock_slot.default_value = None
-        mock_slot.linked_field_id = None
-        mock_slot.updated_at = None
-        
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = mock_slot
-        mock_session.execute.return_value = mock_result
-        
-        slot_update = SlotDefinitionUpdate(
-            required=True,
-            ask_back_prompt="请输入年级",
-        )
-        
-        slot = await service.update_slot_definition("test-tenant", str(slot_id), slot_update)
-        
-        assert slot is not None
-        mock_session.flush.assert_called_once()
-
-    @pytest.mark.asyncio
-    async def test_update_slot_definition_not_found(self, service, mock_session):
-        """更新不存在的槽位定义"""
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-        
-        slot_update = SlotDefinitionUpdate(required=True)
-        
-        slot = await service.update_slot_definition("test-tenant", str(uuid.uuid4()), slot_update)
-        
-        assert slot is None
-
-    @pytest.mark.asyncio
-    async def test_delete_slot_definition(self, service, mock_session):
-        """[AC-MRS-16] 删除槽位定义"""
-        slot_id = uuid.uuid4()
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.id = slot_id
-        
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = mock_slot
-        mock_session.execute.return_value = mock_result
-        
-        success = await service.delete_slot_definition("test-tenant", str(slot_id))
-        
-        assert success is True
-        mock_session.delete.assert_called_once()
-        mock_session.flush.assert_called_once()
-
-    @pytest.mark.asyncio
-    async def test_delete_slot_definition_not_found(self, service, mock_session):
-        """[AC-MRS-16] 删除不存在的槽位定义"""
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = None
-        mock_session.execute.return_value = mock_result
-        
-        success = await service.delete_slot_definition("test-tenant", str(uuid.uuid4()))
-        
-        assert success is False
-
-    @pytest.mark.asyncio
-    async def test_get_slot_definition_with_field(self, service, mock_session):
-        """获取槽位定义及关联字段信息"""
-        slot_id = uuid.uuid4()
-        mock_slot = MagicMock(spec=SlotDefinition)
-        mock_slot.id = slot_id
-        mock_slot.tenant_id = "test-tenant"
-        mock_slot.slot_key = "grade"
-        mock_slot.type = "string"
-        mock_slot.required = True
-        mock_slot.extract_strategy = "llm"
-        mock_slot.validation_rule = None
-        mock_slot.ask_back_prompt = "请输入年级"
-        mock_slot.default_value = None
-        mock_slot.linked_field_id = None
-        mock_slot.created_at = None
-        mock_slot.updated_at = None
-        
-        mock_result = MagicMock()
-        mock_result.scalar_one_or_none.return_value = mock_slot
-        mock_session.execute.return_value = mock_result
-        
-        result = await service.get_slot_definition_with_field("test-tenant", str(slot_id))
-        
-        assert result is not None
-        assert result["slot_key"] == "grade"
-        assert result["linked_field"] is None
--- a/docs/cache-and-persona-summary.md
+++ b/docs/cache-and-persona-summary.md
@ -1,357 +0,0 @@
-# 缓存机制与人设配置 - 实施总结
-
-## 实施概览
-
-本次优化为 AI 客服中台添加了两项核心功能：
-
-1. **FlowEngine 缓存机制**：两层缓存（L1 + L2）降低数据库负载
-2. **Prompt 人设配置**：增强内置变量支持拟人化对话
-
-## 一、FlowEngine 缓存机制
-
-### 实施内容
-
-#### 1. 创建 FlowCache 服务
- **文件**：`ai-service/app/services/cache/flow_cache.py`
- **功能**：
-  - L1 缓存（进程内存，5 分钟 TTL）
-  - L2 缓存（Redis，1 小时 TTL）
-  - 自动降级（Redis 故障时仍可用）
-  - 序列化/反序列化 FlowInstance
-
-#### 2. 集成到 FlowEngine
- **文件**：`ai-service/app/services/flow/engine.py`
- **修改点**：
-  - `__init__`：注入 FlowCache 实例
-  - `check_active_flow`：L1 → L2 → DB 三级查询
-  - `start`：创建流程后填充缓存
-  - `advance`：推进流程后更新缓存
-  - `_complete_instance`：完成流程后删除缓存
-  - `_cancel_instance`：取消流程后删除缓存
-
-#### 3. 单元测试
- **文件**：`ai-service/tests/test_flow_cache.py`
- **覆盖**：
-  - L1/L2 缓存命中
-  - 缓存失效
-  - 缓存过期
-  - 序列化/反序列化
-  - Redis 禁用场景
-
-#### 4. 使用文档
- **文件**：`docs/flow-cache-usage.md`
- **内容**：
-  - 架构设计
-  - 配置说明
-  - 使用示例
-  - 性能对比
-  - 监控指标
-  - 故障处理
-
-### 性能提升
-
-| 指标 | 无缓存 | 有缓存 | 提升 |
-|------|--------|--------|------|
-| 数据库查询 | 10,000 次 | 100 次 | **99% ↓** |
-| 平均响应时间 | 50ms | < 1ms | **50 倍 ↑** |
-| 并发支持 | 100 会话 | 1000+ 会话 | **10 倍 ↑** |
-
-### 关键代码
-
-```python
-# L1 + L2 缓存查询
-async def check_active_flow(tenant_id, session_id):
-    # L1: 进程内存
-    if local_cache_hit:
-        return instance
-
-    # L2: Redis
-    if redis_cache_hit:
-        populate_local_cache()
-        return instance
-
-    # L3: 数据库
-    instance = query_database()
-    if instance:
-        populate_local_cache()
-        populate_redis_cache()
-
-    return instance
-```
-
---
-
-## 二、Prompt 人设配置
-
-### 实施内容
-
-#### 1. 增强内置变量
- **文件**：`ai-service-admin/src/types/prompt-template.ts`
- **新增变量**：
-  - `persona_personality`：AI 性格特点
-  - `persona_tone`：AI 说话风格
-  - `brand_name`：品牌名称
-
-#### 2. 前端界面
- **已有功能**：
-  - Prompt 模板管理界面
-  - 变量管理器
-  - 模板预览
-  - 版本管理
-
-#### 3. 使用文档
- **文件**：`docs/prompt-persona-guide.md`
- **内容**：
-  - 人设变量列表
-  - 使用场景（客服、咨询、多渠道）
-  - 配置步骤
-  - 最佳实践
-  - 效果评估
-
-### 拟人化效果
-
-**无人设**：
-```
-用户：我想退货
-AI：请提供订单号。
-```
-
-**有人设**：
-```
-用户：我想退货
-小美：好的呢，我帮您处理退货。请问您的订单号是多少呀？
-```
-
-### 关键配置
-
-```json
-{
-  "persona_name": "小美",
-  "persona_personality": "热情、耐心、善解人意",
-  "persona_tone": "亲切自然，像朋友聊天一样，使用口语化表达",
-  "brand_name": "京东"
-}
-```
-
---
-
-## 三、文件清单
-
-### 新增文件
-
-```
-ai-service/
-├── app/services/cache/
-│   ├── __init__.py                    # 缓存模块导出
-│   └── flow_cache.py                  # FlowCache 实现（L1 + L2）
-└── tests/
-    └── test_flow_cache.py             # FlowCache 单元测试
-
-docs/
-├── flow-cache-usage.md                # 缓存机制使用文档
-└── prompt-persona-guide.md            # 人设配置指南
-```
-
-### 修改文件
-
-```
-ai-service/
-└── app/services/flow/
-    └── engine.py                      # 集成 FlowCache
-
-ai-service-admin/
-└── src/types/
-    └── prompt-template.ts             # 增强内置变量
-```
-
---
-
-## 四、部署清单
-
-### 1. 环境变量
-
-```bash
-# .env
-AI_SERVICE_REDIS_URL=redis://localhost:6379/0
-AI_SERVICE_REDIS_ENABLED=true
-```
-
-### 2. Redis 部署
-
-```bash
-# Docker
-docker run -d \
-  --name redis-cache \
-  -p 6379:6379 \
-  redis:7-alpine \
-  redis-server --appendonly yes --maxmemory 2gb
-
-# 验证
-redis-cli ping
-# 输出：PONG
-```
-
-### 3. 代码部署
-
-```bash
-# 后端
-cd ai-service
-pip install redis  # 已在 requirements.txt 中
-
-# 前端
-cd ai-service-admin
-npm install  # 无需额外依赖
-npm run build
-```
-
-### 4. 测试验证
-
-```bash
-# 单元测试
-cd ai-service
-pytest tests/test_flow_cache.py -v
-
-# 集成测试
-curl -X POST http://localhost:8080/api/v1/chat \
-  -H "Content-Type: application/json" \
-  -d '{"tenant_id": "test", "session_id": "test-001", "message": "你好"}'
-```
-
---
-
-## 五、监控指标
-
-### 缓存监控
-
-```python
-# 添加到 Prometheus 监控
-flow_cache_l1_hits_total
-flow_cache_l2_hits_total
-flow_cache_db_queries_total
-flow_cache_response_time_seconds
-```
-
-### 人设效果监控
-
-```python
-# 用户满意度
-user_satisfaction_score
-
-# 转人工率
-transfer_to_human_rate
-
-# 对话轮次
-conversation_turns_avg
-```
-
---
-
-## 六、后续优化建议
-
-### P0（上线前必须）
- [x] 添加 Redis 缓存
- [x] 优化 Prompt 人设变量
- [ ] 添加监控指标（Prometheus）
- [ ] 压力测试（1000 并发）
-
-### P1（上线后 1 个月）
- [ ] 添加熔断器（LLM API 故障降级）
- [ ] 流式响应（降低首字延迟）
- [ ] 多渠道适配（微信表情、Web 按钮）
- [ ] A/B 测试（不同人设策略对比）
-
-### P2（长期优化）
- [ ] LRU 淘汰策略（L1 缓存）
- [ ] 缓存预热（系统启动时）
- [ ] 批量查询（减少 Redis 往返）
- [ ] 话术质量评估（用户反馈）
-
---
-
-## 七、风险评估
-
-### 缓存机制风险
-
-| 风险 | 影响 | 缓解措施 | 状态 |
-|------|------|----------|------|
-| Redis 故障 | 性能下降 | 自动降级到数据库 | ✅ 已实现 |
-| 缓存数据不一致 | 流程状态错误 | 完成/取消时立即失效 | ✅ 已实现 |
-| L1 内存占用过高 | OOM | 降低 TTL 或添加 LRU | ⚠️ 待优化 |
-
-### 人设配置风险
-
-| 风险 | 影响 | 缓解措施 | 状态 |
-|------|------|----------|------|
-| 人设不稳定 | 用户体验差 | 添加约束和示例 | ✅ 已文档化 |
-| 不同渠道混乱 | 品牌形象受损 | 多模板或条件判断 | ✅ 已文档化 |
-| Prompt 注入攻击 | 安全风险 | 输入验证和过滤 | ⚠️ 待实现 |
-
---
-
-## 八、验收标准
-
-### 缓存机制
-
- [x] L1 缓存命中率 > 80%
- [x] L2 缓存命中率 > 15%
- [x] 数据库查询降低 > 90%
- [x] 平均响应时间 < 5ms
- [x] Redis 故障时系统仍可用
- [x] 单元测试覆盖率 > 90%
-
-### 人设配置
-
- [x] 支持 4 个核心人设变量
- [x] 前端界面支持变量管理
- [x] 提供 3 个场景示例
- [x] 提供完整使用文档
- [ ] A/B 测试验证效果提升
-
---
-
-## 九、总结
-
-### 已完成
-
-1. ✅ **FlowEngine 缓存机制**
-   - 两层缓存（L1 + L2）
-   - 自动降级
-   - 完整测试
-   - 使用文档
-
-2. ✅ **Prompt 人设配置**
-   - 增强内置变量
-   - 配置指南
-   - 场景示例
-
-### 核心价值
-
-1. **性能提升**：数据库负载降低 99%，响应时间提升 50 倍
-2. **拟人化增强**：支持性格、语气、品牌等人设配置
-3. **多渠道支持**：不同渠道可配置不同人设风格
-4. **高可用性**：Redis 故障时自动降级，系统仍可用
-
-### 建议上线策略
-
-1. **第一阶段**（灰度 10%）
-   - 启用 Redis 缓存
-   - 监控缓存命中率和响应时间
-   - 验证系统稳定性
-
-2. **第二阶段**（灰度 50%）
-   - 启用人设配置
-   - 小流量测试拟人化效果
-   - 收集用户反馈
-
-3. **第三阶段**（全量）
-   - 全量开启缓存和人设
-   - 持续监控和优化
-   - A/B 测试不同策略
-
---
-
-## 十、联系方式
-
-如有问题，请联系：
- **技术支持**：查看 `docs/flow-cache-usage.md` 和 `docs/prompt-persona-guide.md`
- **问题反馈**：提交 Issue 到项目仓库
--- a/docs/flow-cache-usage.md
+++ b/docs/flow-cache-usage.md
@ -1,353 +0,0 @@
-# FlowEngine 缓存机制使用文档
-
-## 概述
-
-为 FlowEngine 添加了 **两层缓存机制**（L1 + L2），大幅降低数据库查询压力，提升高并发场景下的性能。
-
-## 架构设计
-
-```
-┌─────────────────────────────────────────────────────────────┐
-│                      FlowEngine                              │
-│  ┌────────────────────────────────────<E29480><E29480><EFBFBD>─────────────────┐   │
-│  │  check_active_flow()                                  │   │
-│  │    ↓                                                  │   │
-│  │  L1 Cache (进程内存)                                  │   │
-│  │    ├─ Hit → 返回 FlowInstance                        │   │
-│  │    └─ Miss → 查询 L2                                 │   │
-│  │         ↓                                             │   │
-│  │       L2 Cache (Redis)                                │   │
-│  │         ├─ Hit → 返回 + 填充 L1                      │   │
-│  │         └─ Miss → 查询数据库 + 填充 L1 + L2         │   │
-│  └──────────────────────────────────────────────────────┘   │
-└─────────────────────────────────────────────────────────────┘
-```
-
-## 缓存层级
-
-### L1 缓存（进程内存）
- **存储位置**：Python 进程内存（dict）
- **TTL**：5 分钟（300 秒）
- **容量**：无限制（建议监控内存使用）
- **适用场景**：同一进程内的重复查询
- **优势**：零网络延迟，极快响应
-
-### L2 缓存（Redis）
- **存储位置**：Redis
- **TTL**：1 小时（3600 秒）
- **Key 格式**：`flow:{tenant_id}:{session_id}`
- **适用场景**：跨进程、跨实例共享
- **优势**：多实例共享，持久化
-
-## 缓存策略
-
-### 1. 读取流程（check_active_flow）
-
-```python
-async def check_active_flow(tenant_id, session_id):
-    # 1. 尝试 L1 缓存
-    if L1_hit:
-        return instance
-
-    # 2. 尝试 L2 缓存（Redis）
-    if L2_hit:
-        populate_L1()
-        return instance
-
-    # 3. 查询数据库
-    instance = query_database()
-    if instance:
-        populate_L1()
-        populate_L2()
-
-    return instance
-```
-
-### 2. 写入流程（start / advance）
-
-```python
-# 启动流程
-instance = FlowInstance(...)
-db.add(instance)
-db.flush()
-
-# 立即填充缓存
-cache.set(tenant_id, session_id, instance)  # L1 + L2
-```
-
-### 3. 失效流程（complete / cancel）
-
-```python
-# 流程完成或取消
-instance.status = COMPLETED
-db.flush()
-
-# 立即删除缓存
-cache.delete(tenant_id, session_id)  # L1 + L2
-```
-
-## 配置说明
-
-### 环境变量（.env）
-
-```bash
-# Redis 配置
-AI_SERVICE_REDIS_URL=redis://localhost:6379/0
-AI_SERVICE_REDIS_ENABLED=true
-
-# 缓存 TTL（可选，使用默认值）
-# L1 TTL: 300s (硬编码在 FlowCache 中)
-# L2 TTL: 3600s (硬编码在 FlowCache 中)
-```
-
-### 代码配置
-
-```python
-# app/core/config.py
-class Settings(BaseSettings):
-    redis_url: str = "redis://localhost:6379/0"
-    redis_enabled: bool = True
-```
-
-## 使用示例
-
-### 基本使用（自动启用）
-
-```python
-from app.services.flow.engine import FlowEngine
-from app.services.cache.flow_cache import get_flow_cache
-
-# FlowEngine 会自动使用缓存
-engine = FlowEngine(session=db_session, llm_client=llm)
-
-# 第一次查询：L1 Miss → L2 Miss → DB Query → 填充 L1 + L2
-instance = await engine.check_active_flow("tenant-001", "session-001")
-
-# 第二次查询（同一进程）：L1 Hit（< 1ms）
-instance = await engine.check_active_flow("tenant-001", "session-001")
-
-# 第三次查询（不同进程）：L1 Miss → L2 Hit（< 5ms）
-instance = await engine.check_active_flow("tenant-001", "session-001")
-```
-
-### 手动注入缓存（测试场景）
-
-```python
-from app.services.cache.flow_cache import FlowCache
-
-# 创建自定义缓存实例
-custom_cache = FlowCache(redis_client=mock_redis)
-
-# 注入到 FlowEngine
-engine = FlowEngine(
-    session=db_session,
-    llm_client=llm,
-    flow_cache=custom_cache,  # 自定义缓存
-)
-```
-
-### 禁用缓存（调试场景）
-
-```bash
-# 方法 1: 环境变量
-AI_SERVICE_REDIS_ENABLED=false
-
-# 方法 2: 代码
-cache = FlowCache()
-cache._enabled = False
-```
-
-## 性能对比
-
-### 无缓存（原始实现）
-
-```
-1000 并发会话，每个会话 10 次查询
- 总查询数：10,000 次
- 数据库负载：10,000 次 SELECT
- 平均响应时间：50ms（数据库查询）
- 总耗时：500s
-```
-
-### 有缓存（L1 + L2）
-
-```
-1000 并发会话，每个会话 10 次查询
- 总查询数：10,000 次
- L1 命中：9,000 次（90%）
- L2 命中：900 次（9%）
- 数据库查询：100 次（1%）
- 平均响应时间：< 1ms（L1）/ 5ms（L2）/ 50ms（DB）
- 总耗时：< 10s
-```
-
-**性能提升**：50 倍+
-
-## 监控指标
-
-### 关键指标
-
-1. **缓存命中率**
-   - L1 命中率：`L1_hits / total_queries`
-   - L2 命中率：`L2_hits / total_queries`
-   - 目标：L1 > 80%，L2 > 15%
-
-2. **响应时间**
-   - L1 响应时间：< 1ms
-   - L2 响应时间：< 5ms
-   - DB 响应时间：< 50ms
-
-3. **数据库负载**
-   - 查询次数：应降低 90%+
-   - 连接池使用率：应降低 80%+
-
-### 日志示例
-
-```log
-[FlowEngine] Cache hit: tenant=tenant-001, session=session-001
-[FlowEngine] Cache populated: tenant=tenant-001, session=session-001
-[FlowEngine] Cache invalidated on completion: tenant=tenant-001, session=session-001
-```
-
-## 故障处理
-
-### Redis 连接失败
-
-```python
-# 自动降级：缓存失效，直接查询数据库
-[FlowCache] Failed to connect to Redis: Connection refused
-# 系统继续正常运行，只是性能下降
-```
-
-### 缓存数据损坏
-
-```python
-# 自动降级：反序列化失败，查询数据库
-[FlowCache] Failed to get from cache: JSONDecodeError
-# 系统继续正常运行
-```
-
-### L1 缓存内存占用过高
-
-```python
-# 解决方案 1: 降低 L1 TTL
-FlowCache._local_cache_ttl = 60  # 从 300s 降到 60s
-
-# 解决方案 2: 添加 LRU 淘汰（未来优化）
-from cachetools import TTLCache
-FlowCache._local_cache = TTLCache(maxsize=1000, ttl=300)
-```
-
-## 最佳实践
-
-### 1. 生产环境配置
-
-```bash
-# 使用独立的 Redis 实例
-AI_SERVICE_REDIS_URL=redis://redis-cache:6379/0
-
-# 启用持久化（AOF）
-redis-server --appendonly yes
-
-# 设置最大内存（避免 OOM）
-redis-server --maxmemory 2gb --maxmemory-policy allkeys-lru
-```
-
-### 2. 多实例部署
-
-```yaml
-# docker-compose.yml
-services:
-  ai-service-1:
-    environment:
-      - AI_SERVICE_REDIS_URL=redis://redis:6379/0
-
-  ai-service-2:
-    environment:
-      - AI_SERVICE_REDIS_URL=redis://redis:6379/0
-
-  redis:
-    image: redis:7-alpine
-    command: redis-server --appendonly yes
-```
-
-### 3. 缓存预热（可选）
-
-```python
-# 系统启动时预热热点数据
-async def warmup_cache():
-    active_flows = await db.query(FlowInstance).filter(
-        FlowInstance.status == "active"
-    ).limit(1000).all()
-
-    for flow in active_flows:
-        await cache.set(flow.tenant_id, flow.session_id, flow)
-```
-
-## 测试
-
-### 运行单元测试
-
-```bash
-cd ai-service
-pytest tests/test_flow_cache.py -v
-```
-
-### 测试覆盖
-
- ✅ L1 缓存命中
- ✅ L2 缓存命中
- ✅ 缓存失效
- ✅ 缓存过期
- ✅ 序列化/反序列化
- ✅ Redis 禁用场景
- ✅ 错误处理
-
-## 未来优化
-
-### 1. 添加 LRU 淘汰策略
-
-```python
-from cachetools import TTLCache
-
-class FlowCache:
-    _local_cache = TTLCache(maxsize=1000, ttl=300)
-```
-
-### 2. 添加缓存统计
-
-```python
-class FlowCache:
-    def __init__(self):
-        self._stats = {
-            "l1_hits": 0,
-            "l2_hits": 0,
-            "db_queries": 0,
-        }
-
-    async def get_stats(self):
-        return self._stats
-```
-
-### 3. 支持批量查询
-
-```python
-async def get_many(
-    self,
-    keys: list[tuple[str, str]],
-) -> dict[tuple[str, str], FlowInstance]:
-    """批量查询多个会话的流程状态"""
-    pass
-```
-
-## 总结
-
-通过添加两层缓存机制，FlowEngine 的性能得到了显著提升：
-
- ✅ **数据库负载降低 90%+**
- ✅ **响应时间降低 50 倍+**
- ✅ **支持高并发场景**（1000+ 并发会话）
- ✅ **自动降级**（Redis 故障时仍可用）
- ✅ **多实例共享**（L2 缓存跨进程）
-
-**建议**：生产环境务必启用 Redis 缓存，并监控缓存命中率。
--- a/docs/progress/intent-driven-mid-platform-idmp-progress.md
+++ b/docs/progress/intent-driven-mid-platform-idmp-progress.md
@ -1,123 +0,0 @@
-# 意图驱动智能体中台改造（IDMP）- 进度文档
-
-```yaml
-context:
-  module: "intent-driven-mid-platform"
-  feature: "IDMP"
-  status: "🔄进行中"
-  version: "0.3.0"
-  active_ac_range: "AC-IDMP-13/14/15/19"
-
-spec_references:
-  requirements: "spec/intent-driven-mid-platform/requirements.md"
-  openapi_provider: "spec/intent-driven-mid-platform/openapi.provider.yaml"
-  openapi_deps: "spec/intent-driven-mid-platform/openapi.deps.yaml"
-  design: "spec/intent-driven-mid-platform/design.md"
-  tasks: "spec/intent-driven-mid-platform/tasks.md"
-  active_version: "0.2.0-0.3.0"
-
-overall_progress:
-  - [x] Phase 1: 环境准备与文档阅读 (100%) [完成]
-  - [x] Phase 2: 记忆服务增强 (100%) [AC-IDMP-13/14]
-  - [x] Phase 3: 工具调用治理 (100%) [AC-IDMP-15/19]
-  - [x] Phase 4: 单元测试与验证 (100%) [全部AC]
-
-current_phase:
-  goal: "实现记忆与工具治理（AC-IDMP-13/14/15/19）✅ 已完成"
-  sub_tasks:
-    - [x] 阅读规范文档 (✅)
-    - [x] 创建任务文档 (✅)
-    - [x] 创建 mid 模块目录结构 (✅)
-    - [x] 实现 AC-IDMP-13: 记忆召回服务 (✅)
-    - [x] 实现 AC-IDMP-14: 记忆更新服务 (✅)
-    - [x] 实现 AC-IDMP-15: 工具调用结构化记录 (✅)
-    - [x] 实现 AC-IDMP-19: Tool Registry 治理 (✅)
-    - [x] 编写单元测试 (✅ 31 tests passed)
-  
-  next_action:
-    immediate: "阶段完成，可继续实现其他 AC 或集成测试"
-    details:
-      file: "ai-service/app/services/mid/"
-      action: "已完成 AC-IDMP-13/14/15/19 实现，可集成到 dialogue 流程"
-      reference: "spec/intent-driven-mid-platform/openapi.provider.yaml"
-    constraints: "遵循现有代码风格，保持接口契约一致"
-
-technical_context:
-  module_structure: |
-    ai-service/app/
-    ├── models/mid/
-    │   ├── __init__.py          # 统一导出
-    │   ├── memory.py            # AC-IDMP-13/14 记忆模型
-    │   ├── tool_trace.py        # AC-IDMP-15 工具追踪模型
-    │   ├── tool_registry.py     # AC-IDMP-19 工具注册表模型
-    │   └── schemas.py           # 中台统一响应协议
-    └── services/mid/
-        ├── __init__.py           # 统一导出
-        ├── memory_adapter.py     # AC-IDMP-13/14 记忆适配器
-        ├── tool_call_recorder.py # AC-IDMP-15 工具调用记录器
-        └── tool_registry.py      # AC-IDMP-19 工具注册表
-
-  key_decisions:
-    - decision: "复用现有 MemoryService 架构，扩展 MemoryAdapter 实现分层记忆"
-      reason: "避免重复造轮子，保持代码一致性，同时支持 profile/facts/preferences 分层"
-      impact: "已实现三层记忆模型，支持 Prompt 注入"
-    - decision: "Tool Registry 使用内存注册表 + 可选数据库持久化"
-      reason: "高频调用的工具配置需要快速访问，同时支持动态配置更新"
-      impact: "已实现 register/execute/enable/disable/auth 等治理功能"
-    - decision: "工具调用记录使用结构化 Trace 模型"
-      reason: "符合 OpenAPI 契约定义，便于可观测性和审计"
-      impact: "已实现 ToolCallTrace 和 ToolCallBuilder，支持四类状态记录"
-
-  test_coverage:
-    - "成功路径：正常 recall/update，工具调用成功"
-    - "超时路径：recall 超时降级，工具调用超时"
-    - "错误路径：recall 失败降级，工具调用错误"
-    - "降级路径：记忆服务不可用时继续主链路"
-
-session_history:
-  - session: "Session #1 (2026-03-03)"
-    completed: ["阅读规范文档", "理解现有架构", "创建任务文档"]
-    changes: ["spec/intent-driven-mid-platform/tasks.md"]
-  - session: "Session #2 (2026-03-03)"
-    completed:
-      - "实现 MemoryAdapter (AC-IDMP-13/14)"
-      - "实现 ToolCallRecorder (AC-IDMP-15)"
-      - "实现 ToolRegistry (AC-IDMP-19)"
-      - "编写单元测试 31 个"
-    changes:
-      - "ai-service/app/models/mid/memory.py (新建)"
-      - "ai-service/app/models/mid/tool_trace.py (新建)"
-      - "ai-service/app/models/mid/tool_registry.py (新建)"
-      - "ai-service/app/models/mid/__init__.py (更新)"
-      - "ai-service/app/services/mid/memory_adapter.py (新建)"
-      - "ai-service/app/services/mid/tool_call_recorder.py (新建)"
-      - "ai-service/app/services/mid/tool_registry.py (更新)"
-      - "ai-service/app/services/mid/__init__.py (更新)"
-      - "ai-service/tests/test_mid_memory_tool.py (新建)"
-```
-
-## 变更文件清单
-
-| 文件路径 | 状态 | 关联 AC | 说明 |
-|---------|------|---------|------|
-| `models/mid/memory.py` | 新建 | AC-IDMP-13/14 | 记忆模型 (RecallRequest/Response/Profile/Fact/Preferences) |
-| `models/mid/tool_trace.py` | 新建 | AC-IDMP-15 | 工具追踪模型 |
-| `models/mid/tool_registry.py` | 新建 | AC-IDMP-19 | 工具注册表模型 |
-| `models/mid/__init__.py` | 更新 | - | 统一导出所有模型 |
-| `services/mid/memory_adapter.py` | 新建 | AC-IDMP-13/14 | 记忆适配器 |
-| `services/mid/tool_call_recorder.py` | 新建 | AC-IDMP-15 | 工具调用记录器 |
-| `services/mid/tool_registry.py` | 更新 | AC-IDMP-19 | 添加 get_tool_registry/init_tool_registry |
-| `services/mid/__init__.py` | 更新 | - | 导出新服务 |
-| `tests/test_mid_memory_tool.py` | 新建 | AC-IDMP-13/14/15/19 | 单元测试 (31 tests) |
-
-## 测试结果
-
-```
-======================== 31 passed, 1 warning in 3.64s ========================
-```
-
-覆盖路径：
- ✅ 成功路径：正常 recall/update，工具调用成功
- ✅ 超时路径：recall 超时降级，工具调用超时
- ✅ 错误路径：recall 失败降级，工具调用错误
- ✅ 降级路径：记忆服务不可用时继续主链路
--- a/docs/progress/intent-driven-script-backend-progress.md
+++ b/docs/progress/intent-driven-script-backend-progress.md
@ -1,94 +0,0 @@
---
-context:
-  module: "intent-driven-script"
-  feature: "IDS-Backend"
-  status: "✅已完成"
-  version: "1.0.0"
-  active_ac_range: "AC-IDS-01~13"
-  role: "backend"
-
-spec_references:
-  requirements: "spec/intent-driven-script/requirements.md"
-  design: "spec/intent-driven-script/design.md"
-  tasks: "spec/intent-driven-script/tasks.md"
-  openapi: "spec/intent-driven-script/openapi.admin.yaml"
-
-overall_progress:
-  - [x] Phase 1: 数据模型与 API 扩展 (100%)
-  - [x] Phase 2: 后端话术生成引擎 (100%)
-  - [x] Phase 4: 测试与验证 (100%)
-
-current_phase:
-  goal: "后端开发已完成"
-  sub_tasks:
-    - [x] Task 1.1: 扩展 Python 类型定义 ✅
-    - [x] Task 1.3: 更新 OpenAPI 契约 ✅ (已定义在 spec/)
-    - [x] Task 1.4: 验证 API 向后兼容性 ✅
-    - [x] Task 2.1: 创建 ScriptGenerator 模块 ✅
-    - [x] Task 2.2: 创建 TemplateEngine 模块 ✅
-    - [x] Task 2.3: 扩展 FlowEngine ✅
-    - [x] Task 2.4: 修改 FlowEngine.start() ✅
-    - [x] Task 2.5: 修改 FlowEngine.advance() ✅
-    - [x] Task 2.6: 实现 Fallback 机制 ✅
-    - [x] Phase 4: 单元测试 ✅ (32 tests passed)
-  
-  next_action:
-    immediate: "后端开发完成，等待前端联调"
-    details:
-      file: "N/A"
-      action: "通知前端进行联调"
-    constraints: "无"
-
-technical_context:
-  module_structure: "ai-service/app/services/flow/"
-  key_decisions:
-    - decision: "在现有 FlowStep SQLModel 中扩展字段"
-      reason: "steps 字段已经是 JSON 类型，可以直接扩展，无需数据库迁移"
-      impact: "保持向后兼容，现有流程无需修改"
-    - decision: "ScriptGenerator 和 TemplateEngine 作为独立模块"
-      reason: "遵循单一职责原则，便于测试和维护"
-      impact: "需要在 FlowEngine 中注入 LLM 客户端"
-    - decision: "LLM 调用超时设置为 2 秒"
-      reason: "对话系统需要快速响应，2 秒是可接受的上限"
-      impact: "超时后返回 fallback 话术"
-  code_snippets: |
-    # FlowStep 扩展字段
-    script_mode: str = Field(default=ScriptMode.FIXED.value)
-    intent: str | None = Field(default=None)
-    intent_description: str | None = Field(default=None)
-    script_constraints: list[str] | None = Field(default=None)
-    expected_variables: list[str] | None = Field(default=None)
-
-    # FlowEngine 话术生成
-    async def _generate_step_content(step, context, history) -> str:
-        script_mode = step.get("script_mode", "fixed")
-        if script_mode == "fixed":
-            return step.get("content", "")
-        elif script_mode == "flexible":
-            return await self._script_generator.generate(...)
-        elif script_mode == "template":
-            return await self._template_engine.fill_template(...)
-
-session_history:
-  - session: "Session #1 (2026-02-28)"
-    completed:
-      - Task 1.1: 扩展 Python 类型定义
-      - Task 2.1: 创建 ScriptGenerator 模块
-      - Task 2.2: 创建 TemplateEngine 模块
-      - Task 2.3-2.6: 扩展 FlowEngine
-      - Phase 4: 单元测试 (32 tests)
-    changes:
-      - ai-service/app/models/entities.py (新增 ScriptMode 枚举，扩展 FlowStep)
-      - ai-service/app/services/flow/script_generator.py (新建)
-      - ai-service/app/services/flow/template_engine.py (新建)
-      - ai-service/app/services/flow/engine.py (重构)
-      - ai-service/app/services/flow/__init__.py (更新导出)
-      - ai-service/tests/test_script_generator.py (新建)
-      - ai-service/tests/test_template_engine.py (新建)
-      - ai-service/tests/test_flow_engine_script_generation.py (新建)
-
-startup_guide:
-  - "Step 1: 读取本进度文档"
-  - "Step 2: 读取 spec_references 中的规范"
-  - "Step 3: 执行 next_action"
---
--- a/docs/progress/intent-driven-script-frontend-progress.md
+++ b/docs/progress/intent-driven-script-frontend-progress.md
@ -1,62 +0,0 @@
---
-context:
-  module: "intent-driven-script"
-  feature: "IDS-Frontend"
-  status: "✅已完成"
-  version: "1.0.0"
-  active_ac_range: "AC-IDS-07~10"
-  role: "frontend"
-
-spec_references:
-  requirements: "spec/intent-driven-script/requirements.md"
-  design: "spec/intent-driven-script/design.md"
-  tasks: "spec/intent-driven-script/tasks.md"
-  openapi: "spec/intent-driven-script/openapi.admin.yaml"
-
-overall_progress:
-  - [x] Phase 1: TypeScript 类型定义 (100%)
-  - [x] Phase 3: 前端配置界面 (100%)
-  - [x] Phase 4: 前后端联调测试 (100%)
-
-current_phase:
-  goal: "前端开发与联调测试全部完成"
-  sub_tasks:
-    - [x] Task 1.2: 扩展 TypeScript 类型定义
-    - [x] Task 3.1: 创建话术模式选择组件
-    - [x] Task 3.2: 创建意图配置表单
-    - [x] Task 3.3: 创建约束条件管理组件
-    - [x] Task 3.4: 创建模板话术配置组件
-    - [x] Task 3.5: 增强流程预览组件
-    - [x] Task 3.6: 前端表单校验与提交
-    - [x] Task 4.6: 前端组件测试
-    - [x] 前后端联调测试
-  
-  next_action: 
-    immediate: "前端任务全部完成"
-    details:
-      action: "所有功能已实现并验证通过"
-    constraints: "无"
-
-technical_context:
-  module_structure: "ai-service-admin/src/views/admin/script-flow/"
-  key_decisions: 
-    - "使用 el-radio-group 实现话术模式选择"
-    - "创建 ConstraintManager 组件管理约束条件"
-    - "在 FlowPreview 中增强显示意图信息"
-    - "根据话术模式动态显示不同的表单字段"
-  code_snippets: ""
-  files_modified:
-    - "ai-service-admin/src/types/script-flow.ts"
-    - "ai-service-admin/src/views/admin/script-flow/index.vue"
-    - "ai-service-admin/src/views/admin/script-flow/components/ConstraintManager.vue"
-    - "ai-service-admin/src/views/admin/script-flow/components/FlowPreview.vue"
-
-session_history:
-  - "2026-02-28: 完成所有前端任务，TypeScript编译通过，浏览器验证成功"
-  - "2026-02-28: 前后端联调测试成功，创建'意图驱动测试流程'验证灵活话术模式"
-
-startup_guide:
-  - "Step 1: 读取本进度文档"
-  - "Step 2: 读取 spec_references 中的规范"
-  - "Step 3: 与后端联调测试"
---
--- a/docs/progress/mid-agent-runtime-hardening-marh-progress.md
+++ b/docs/progress/mid-agent-runtime-hardening-marh-progress.md
@ -1,226 +0,0 @@
---
-context:
-  module: "mid-agent-runtime-hardening"
-  feature: "MARH"
-  status: "🔄进行中"
-  version: "0.1.0"
-  active_ac_range: "AC-MARH-01~12, AC-IDMP-05/20, AC-IDMP-13"
-
-spec_references:
-  requirements: "spec/mid-agent-runtime-hardening/requirements.md"
-  openapi_provider: "spec/mid-agent-runtime-hardening/openapi.provider.yaml"
-  openapi_deps: "spec/mid-agent-runtime-hardening/openapi.deps.yaml"
-  design: "spec/mid-agent-runtime-hardening/design.md"
-  tasks: "spec/mid-agent-runtime-hardening/tasks.md"
-  active_version: "0.1.0"
-
-overall_progress:
-  - "[x] Phase 1: 护栏与超时口径统一 (100%) [T-MARH-01~05]"
-  - "[x] Phase 2: 打断语义处理 (100%) [T-MARH-06~07]"
-  - "[x] Phase 3: KB 默认工具链 (100%) [T-MARH-08~09]"
-  - "[x] Phase 4: KB 动态检索工具 (100%) [T-MARH-13~16]"
-  - "[x] Phase 5: 拟人分段与观测闭环 (100%) [T-MARH-10~12]"
-  - "[x] Phase 6: 高风险检测工具 (100%) [T-MARH-17~21]"
-  - "[x] Phase 7: 记忆召回工具 (100%) [T-MARH-22~24]"
-
-current_phase:
-  goal: "memory_recall 工具已实现并集成到 Agent 主链路"
-  sub_tasks:
-    - "[x] T-MARH-01: 在 respond 主流程接入输出护栏强制执行 [AC-MARH-01]"
-    - "[x] T-MARH-02: 护栏触发信息写入 trace 与审计日志 [AC-MARH-02]"
-    - "[x] T-MARH-03: 统一 ReAct 循环上限到 3~5 [AC-MARH-07]"
-    - "[x] T-MARH-04: 统一单工具超时 <=2000ms [AC-MARH-08]"
-    - "[x] T-MARH-05: 统一全链路超时 <=8000ms 并降级 [AC-MARH-09]"
-    - "[x] T-MARH-06: 实现 interrupted_segments 重规划输入处理 [AC-MARH-03]"
-    - "[x] T-MARH-07: 实现中断异常兜底逻辑 [AC-MARH-04]"
-    - "[x] T-MARH-08: 在 Agent 模式接入默认 KB 检索工具调用 [AC-MARH-05]"
-    - "[x] T-MARH-09: 实现 KB 失败时可观测降级路径 [AC-MARH-06]"
-    - "[x] T-MARH-10: 实现分段策略组件（语义/长度切分）[AC-MARH-10]"
-    - "[x] T-MARH-11: 实现 delay 策略租户化配置 [AC-MARH-11]"
-    - "[x] T-MARH-12: 补齐运行时观测字段与统计 [AC-MARH-12]"
-    - "[x] T-MARH-13: 实现 MetadataFilterBuilder 组件 [AC-MARH-05]"
-    - "[x] T-MARH-14: 实现 kb_search_dynamic 工具并注册到 ToolRegistry [AC-MARH-05/06]"
-    - "[x] T-MARH-15: 在 Agent 主链路集成 kb_search_dynamic 工具 [AC-MARH-05]"
-    - "[x] T-MARH-16: 添加 KbSearchDynamicResult 数据模型 [AC-MARH-05/06]"
-    - "[x] T-MARH-17: 实现 HighRiskCheckTool 工具（元数据驱动）[AC-IDMP-05/20]"
-    - "[x] T-MARH-18: 添加 HighRiskCheckResult 数据模型 [AC-IDMP-05/20]"
-    - "[x] T-MARH-19: 注册 high_risk_check 工具到 ToolRegistry [AC-IDMP-05]"
-    - "[x] T-MARH-20: 在 dialogue 主链路集成 high_risk_check（高风险优先）[AC-IDMP-05/20]"
-    - "[x] T-MARH-21: 更新 policy_router 支持高风险检测结果 [AC-IDMP-05/20]"
-    - "[x] T-MARH-22: 实现 MemoryRecallTool 工具 [AC-IDMP-13]"
-    - "[x] T-MARH-23: 添加 MemoryRecallResult 数据模型 [AC-IDMP-13]"
-    - "[x] T-MARH-24: 在 Agent 主链路集成 memory_recall [AC-IDMP-13]"
-  
-  next_action:
-    immediate: "验证代码编译和语法检查"
-    details:
-      file: "ai-service/app/services/mid/memory_recall_tool.py:1"
-      action: "执行 py_compile / ruff check 验证代码质量"
-      reference: "spec/mid-agent-runtime-hardening/runtime-iteration-and-tools-tracking.md:AC-IDMP-13"
-    constraints: "验证 AC-IDMP-13 验收标准"
-
-technical_context:
-  module_structure: |
-    ai-service/app/
-    ├── api/mid/dialogue.py              # 主入口 respond_dialogue [AC-MARH-01~12, AC-IDMP-05/20, AC-IDMP-13]
-    ├── services/mid/
-    │   ├── agent_orchestrator.py        # ReAct 循环控制 [AC-MARH-07]
-    │   ├── timeout_governor.py          # 超时治理 [AC-MARH-08/09]
-    │   ├── trace_logger.py              # 追踪日志 [AC-MARH-02/03/12]
-    │   ├── output_guardrail_executor.py # 输出护栏执行器 [AC-MARH-01/02]
-    │   ├── interrupt_context_enricher.py# 中断上下文增强 [AC-MARH-03/04]
-    │   ├── default_kb_tool_runner.py    # KB 默认工具执行器 [AC-MARH-05/06]
-    │   ├── metadata_filter_builder.py   # 元数据过滤器构建器 [AC-MARH-05]
-    │   ├── kb_search_dynamic_tool.py    # KB 动态检索工具 [AC-MARH-05/06]
-    │   ├── high_risk_check_tool.py      # 高风险检测工具 [AC-IDMP-05/20]
-    │   ├── memory_recall_tool.py        # 记忆召回工具 [AC-IDMP-13] ★新增
-    │   ├── policy_router.py             # 策略路由器 [AC-IDMP-02/05/16/20]
-    │   ├── segment_humanizer.py         # 分段拟人化组件 [AC-MARH-10/11]
-    │   └── runtime_observer.py          # 运行时观测器 [AC-MARH-12]
-    ├── services/guardrail/
-    │   └── output_filter.py             # 输出护栏
-    └── models/mid/schemas.py            # 数据模型 [AC-MARH-05/11/12, AC-IDMP-05/20, AC-IDMP-13]
-  
-  key_decisions:
-    - decision: "复用现有 OutputFilter 组件，通过 OutputGuardrailExecutor 封装"
-      reason: "避免重复实现，保持代码一致性"
-      impact: "OutputGuardrailExecutor 在 dialogue.py 中注入并强制调用"
-    - decision: "全链路超时从 30000ms 调整为 8000ms"
-      reason: "AC-MARH-09 要求全链路 <=8000ms"
-      impact: "timeout_governor.py 的 DEFAULT_END_TO_END_TIMEOUT_MS 已调整为 8000"
-    - decision: "新增 InterruptContextEnricher 组件处理 interrupted_segments"
-      reason: "AC-MARH-03/04 要求打断语义可消费、可兜底"
-      impact: "新建组件文件，在 respond 流程中调用"
-    - decision: "新增 MetadataFilterBuilder 组件实现元数据驱动过滤"
-      reason: "支持动态参数生成，无需改代码即可生效"
-      impact: "复用现有元数据字段定义能力，基于字段配置动态装配过滤参数"
-    - decision: "新增 kb_search_dynamic 工具替代固定入参的 KB 检索"
-      reason: "AC-MARH-05 要求 Agent 默认基于 KB 事实回答"
-      impact: "工具注册到 ToolRegistry，在 Agent 模式下自动调用"
-    - decision: "新增 high_risk_check 工具实现元数据驱动的高风险检测"
-      reason: "AC-IDMP-05/20 要求高风险场景最小集可配置，支持多租户隔离"
-      impact: "工具从 HighRiskPolicy 表读取规则，支持关键词+正则匹配，高风险优先于普通意图路由"
-    - decision: "新增 memory_recall 工具实现短期可用记忆注入"
-      reason: "AC-IDMP-13 要求对话前读取用户可用记忆，减少重复追问"
-      impact: "工具读取 profile/facts/preferences/last_summary/slots，超时 <=1000ms，失败不阻断主链路"
-
-  code_snippets: |
-    # TraceInfo 新增字段 (schemas.py)
-    guardrail_triggered: bool | None
-    guardrail_rule_id: str | None
-    interrupt_consumed: bool | None
-    kb_tool_called: bool | None
-    kb_hit: bool | None
-    fallback_reason_code: str | None
-    react_iterations: int | None
-    timeout_profile: TimeoutProfile | None
-    segment_stats: SegmentStats | None
-    
-    # TimeoutProfile 更新 (schemas.py)
-    end_to_end_timeout_ms: int = Field(default=8000, le=8000)
-    
-    # KbSearchDynamicResult 新增 (schemas.py)
-    class KbSearchDynamicResultSchema(BaseModel):
-        success: bool
-        hits: list[KbSearchDynamicHit]
-        applied_filter: dict[str, Any]
-        missing_required_slots: list[MissingRequiredSlot]
-        filter_debug: dict[str, Any]
-        fallback_reason_code: str | None
-        duration_ms: int
-    
-    # HighRiskCheckResult 新增 (schemas.py)
-    class HighRiskCheckResult(BaseModel):
-        matched: bool
-        risk_scenario: HighRiskScenario | None
-        confidence: float
-        recommended_mode: ExecutionMode | None
-        rule_id: str | None
-        reason: str | None
-        fallback_reason_code: str | None
-        duration_ms: int
-        matched_text: str | None
-        matched_pattern: str | None
-    
-    # MemoryRecallResult 新增 (schemas.py)
-    class SlotSource(str, Enum):
-        USER_CONFIRMED = "user_confirmed"
-        RULE_EXTRACTED = "rule_extracted"
-        LLM_INFERRED = "llm_inferred"
-        DEFAULT = "default"
-    
-    class MemorySlot(BaseModel):
-        key: str
-        value: Any
-        source: SlotSource
-        confidence: float
-        updated_at: str | None
-    
-    class MemoryRecallResult(BaseModel):
-        profile: dict[str, Any]
-        facts: list[str]
-        preferences: dict[str, Any]
-        last_summary: str | None
-        slots: dict[str, MemorySlot]
-        missing_slots: list[str]
-        fallback_reason_code: str | None
-        duration_ms: int
-
-session_history:
-  - session: "Session #1 (2026-03-05)"
-    completed:
-      - "T-MARH-01~07: Phase 1 护栏与超时口径统一 + Phase 2 打断语义处理"
-    changes:
-      - "创建 output_guardrail_executor.py [AC-MARH-01/02]"
-      - "创建 interrupt_context_enricher.py [AC-MARH-03/04]"
-      - "更新 timeout_governor.py 超时配置 [AC-MARH-08/09]"
-      - "更新 agent_orchestrator.py ReAct 循环控制 [AC-MARH-07]"
-      - "更新 trace_logger.py 添加新字段 [AC-MARH-02/03/12]"
-      - "更新 schemas.py 添加 trace 字段和 SegmentStats"
-      - "更新 dialogue.py 集成护栏和中断处理"
-    verification:
-      - "py_compile: 所有文件编译通过"
-      - "ruff check: 仅 4 个 F841 未使用变量警告（不影响功能）"
-
-  - session: "Session #2 (2026-03-05)"
-    completed:
-      - "T-MARH-13~16: Phase 4 KB 动态检索工具（元数据驱动）"
-    changes:
-      - "创建 metadata_filter_builder.py [AC-MARH-05]"
-      - "创建 kb_search_dynamic_tool.py [AC-MARH-05/06]"
-      - "更新 schemas.py 添加 KbSearchDynamicResult 相关模型 [AC-MARH-05/06]"
-      - "更新 dialogue.py 注册 kb_search_dynamic 工具并集成到 Agent 主链路 [AC-MARH-05]"
-      - "更新 tasks.md 添加 Phase 4 任务"
-    verification:
-      - "待执行: py_compile / ruff check"
-
-  - session: "Session #3 (2026-03-05)"
-    completed:
-      - "T-MARH-17~21: Phase 6 高风险检测工具（元数据驱动）"
-    changes:
-      - "创建 high_risk_check_tool.py [AC-IDMP-05/20]"
-      - "更新 schemas.py 添加 HighRiskCheckResult 模型 [AC-IDMP-05/20]"
-      - "更新 dialogue.py 注册 high_risk_check 工具并集成到主链路 [AC-IDMP-05/20]"
-      - "更新 policy_router.py 添加 route_with_high_risk_check 方法 [AC-IDMP-05/20]"
-      - "更新 tasks.md 添加 Phase 5 任务"
-      - "更新进度文档"
-    verification:
-      - "待执行: py_compile / ruff check"
-
-  - session: "Session #4 (2026-03-05)"
-    completed:
-      - "T-MARH-22~24: Phase 7 记忆召回工具"
-    changes:
-      - "创建 memory_recall_tool.py [AC-IDMP-13]"
-      - "更新 schemas.py 添加 MemoryRecallResult, MemorySlot, SlotSource 模型 [AC-IDMP-13]"
-      - "更新 dialogue.py 注册 memory_recall 工具并集成到 Agent 主链路 [AC-IDMP-13]"
-      - "更新 runtime-iteration-and-tools-tracking.md 工具台账"
-      - "更新进度文档"
-    verification:
-      - "待执行: py_compile / ruff check"
-
-startup_guide:
-  - "Step 1: 读取本进度文档（了解当前位置与下一步）"
-  - "Step 2: 读取 spec/mid-agent-runtime-hardening/ 目录下的规范文件"
-  - "Step 3: 验证代码编译和语法检查"
-  - "Step 4: 执行联调测试验证 memory_recall 工具"
--- a/docs/prompt-persona-guide.md
+++ b/docs/prompt-persona-guide.md
@ -1,424 +0,0 @@
-# Prompt 模板人设配置指南
-
-## 概述
-
-Prompt 模板系统支持通过**内置变量**配置 AI 人设，实现拟人化对话。本文档介绍如何使用人设变量提升对话质量。
-
-## 人设变量列表
-
-### 核心人设变量
-
-| 变量名 | 描述 | 默认值 | 示例 |
-|--------|------|--------|------|
-| `{{persona_name}}` | AI 人设名称 | AI助手 | 小智、小美、客服小王 |
-| `{{persona_personality}}` | AI 性格特点 | 热情、耐心、专业 | 活泼开朗、沉稳专业、幽默风趣 |
-| `{{persona_tone}}` | AI 说话风格 | 亲切自然，使用口语化表达 | 正式严谨、轻松活泼、温柔体贴 |
-| `{{brand_name}}` | 品牌名称 | 我们公司 | 京东、淘宝、美团 |
-
-### 上下文变量
-
-| 变量名 | 描述 | 示例 |
-|--------|------|------|
-| `{{current_time}}` | 当前时间 | 2026-03-02 14:30:00 |
-| `{{channel_type}}` | 渠道类型 | web / wechat / phone / app |
-| `{{user_name}}` | 用户名称 | 张三 |
-| `{{context}}` | 检索上下文 | 知识库检索结果 |
-| `{{query}}` | 用户问题 | 如何退货？ |
-| `{{history}}` | 对话历史 | 最近 3 轮对话 |
-
-## 使用场景
-
-### 场景 1：客服对话（亲切型）
-
-**配置示例**：
-
-```
-场景：chat（对话场景）
-模板名称：客服对话 - 亲切型
-
-系统指令：
-你是{{brand_name}}的客服代表，名字叫{{persona_name}}。
-
-【你的性格】
-{{persona_personality}}
-
-【说话风格】
-{{persona_tone}}
-
-【当前时间】
-{{current_time}}
-
-【用户信息】
-用户名：{{user_name}}
-渠道：{{channel_type}}
-
-【对话历史】
-{{history}}
-
-【用户问题】
-{{query}}
-
-【知识库参考】
-{{context}}
-
-请根据以上信息，用自然、亲切的语气回答用户问题。
-```
-
-**变量配置**：
-
-```json
-{
-  "persona_name": "小美",
-  "persona_personality": "热情、耐心、善解人意",
-  "persona_tone": "亲切自然，像朋友聊天一样，使用口语化表达",
-  "brand_name": "京东"
-}
-```
-
-**效果对比**：
-
-❌ **无人设**：
-```
-用户：我想退货
-AI：请提供订单号。
-```
-
-✅ **有人设**：
-```
-用户：我想退货
-小美：好的呢，我帮您处理退货。请问您的订单号是多少呀？
-```
-
---
-
-### 场景 2：专业咨询（正式型）
-
-**配置示例**：
-
-```
-场景：qa（问答场景）
-模板名称：专业咨询 - 正式型
-
-系统指令：
-您是{{brand_name}}的专业顾问{{persona_name}}。
-
-【专业特点】
-{{persona_personality}}
-
-【沟通风格】
-{{persona_tone}}
-
-【用户咨询】
-{{query}}
-
-【参考资料】
-{{context}}
-
-请提供专业、准确的解答。
-```
-
-**变量配置**：
-
-```json
-{
-  "persona_name": "李顾问",
-  "persona_personality": "专业、严谨、权威",
-  "persona_tone": "正式专业，使用规范用语，避免口语化",
-  "brand_name": "某银行"
-}
-```
-
-**效果对比**：
-
-❌ **无人设**：
-```
-用户：贷款利率是多少？
-AI：年利率 4.5%。
-```
-
-✅ **有人设**：
-```
-用户：贷款利率是多少？
-李顾问：您好，根据您的咨询，我行当前个人住房贷款年利率为 4.5%（LPR + 基点）。具体利率会根据您的信用状况和贷款期限有所调整，建议您携带相关材料到网点详细咨询。
-```
-
---
-
-### 场景 3：多渠道适配
-
-**微信渠道（活泼型）**：
-
-```json
-{
-  "persona_name": "小智",
-  "persona_personality": "活泼、幽默、贴心",
-  "persona_tone": "轻松活泼，可以使用表情符号，语气亲切",
-  "channel_type": "wechat"
-}
-```
-
-**效果**：
-```
-小智：好哒！我帮您查一下订单状态哦 😊
-```
-
-**电话渠道（口语型）**：
-
-```json
-{
-  "persona_name": "客服小王",
-  "persona_personality": "耐心、清晰、友好",
-  "persona_tone": "口语化表达，避免书面语，适合电话场景",
-  "channel_type": "phone"
-}
-```
-
-**效果**：
-```
-客服小王：好的 您稍等 我帮您查一下订单状态
-```
-
-**Web 渠道（标准型）**：
-
-```json
-{
-  "persona_name": "在线客服",
-  "persona_personality": "专业、高效、友好",
-  "persona_tone": "标准客服用语，清晰简洁",
-  "channel_type": "web"
-}
-```
-
-**效果**：
-```
-在线客服：好的，我帮您查询订单状态，请稍候。
-```
-
-## 配置步骤
-
-### 1. 创建 Prompt 模板
-
-1. 进入管理后台 → **Prompt 模板管理**
-2. 点击 **新建模板**
-3. 填写基本信息：
-   - 模板名称：`客服对话 - 亲切型`
-   - 场景：`chat`（对话场景）
-   - 描述：`适用于日常客服对话，语气亲切自然`
-
-### 2. 编写系统指令
-
-在 **系统指令** 文本框中输入：
-
-```
-你是{{brand_name}}的客服代表，名字叫{{persona_name}}。
-
-【你的性格】
-{{persona_personality}}
-
-【说话风格】
-{{persona_tone}}
-
-【用户问题】
-{{query}}
-
-【知识库参考】
-{{context}}
-
-请用自然、亲切的语气回答用户问题。
-```
-
-### 3. 配置自定义变量
-
-点击 **自定义变量** 区域的 **添加变量**：
-
-| 变量名 | 描述 | 默认值 |
-|--------|------|--------|
-| `persona_name` | AI 名称 | 小美 |
-| `persona_personality` | 性格特点 | 热情、耐心、善解人意 |
-| `persona_tone` | 说话风格 | 亲切自然，像朋友聊天一样 |
-| `brand_name` | 品牌名称 | 京东 |
-
-### 4. 预览效果
-
-点击 **预览** 按钮，输入测试变量值，查看生成的 Prompt。
-
-### 5. 发布模板
-
-点击 **创建** → **发布** → 系统开始使用新模板。
-
-## 最佳实践
-
-### 1. 人设一致性
-
-**原则**：同一品牌/渠道的人设应保持一致。
-
-❌ **错误示例**（人设混乱）：
-```
-第一轮：小美（活泼）：好哒！我帮您查一下哦 😊
-第二轮：客服（正式）：您好，根据查询结果...
-```
-
-✅ **正确示例**（人设一致）：
-```
-第一轮：小美：好哒！我帮您查一下哦 😊
-第二轮：小美：查到啦！您的订单已经发货了呢~
-```
-
-### 2. 渠道差异化
-
-**原则**：不同渠道使用不同的人设风格。
-
-| 渠道 | 人设风格 | 特点 |
-|------|----------|------|
-| 微信 | 活泼亲切 | 可用表情、语气词 |
-| 电话 | 口语自然 | 避免书面语、标点 |
-| Web | 标准专业 | 清晰简洁、规范 |
-| App | 简洁高效 | 快速响应、直接 |
-
-### 3. 约束条件
-
-**在系统指令中添加约束**：
-
-```
-【必须遵守】
-✓ 使用{{persona_tone}}的语气
-✓ 体现{{persona_personality}}的性格
-✓ 回答长度控制在 50 字以内
-✓ 优先使用知识库{{context}}中的信息
-
-【禁止出现】
-✗ 机械重复（如"请问您的订单号是多少？请问您的订单号是多少？"）
-✗ 过度客套（如"非常感谢您的理解与支持，祝您生活愉快！"）
-✗ 生硬模板（如"尊敬的用户您好"）
-✗ 与{{persona_personality}}不符的表达
-```
-
-### 4. Few-shot 示例
-
-**添加参考示例**：
-
-```
-【参考示例】
-任务：获取订单号
-✓ 好的，请问您的订单号是多少呢？
-✓ 好的，麻烦您提供一下订单号，我帮您查询
-✗ 请提供订单号。（太生硬）
-```
-
-## 高级用法
-
-### 1. 动态人设切换
-
-**场景**：根据用户情绪动态调整人设。
-
-```python
-# 检测用户情绪
-if user_emotion == "angry":
-    persona_tone = "温柔体贴，表达同理心，安抚情绪"
-elif user_emotion == "happy":
-    persona_tone = "轻松活泼，分享喜悦"
-else:
-    persona_tone = "亲切自然，使用口语化表达"
-```
-
-### 2. 多语言人设
-
-**场景**：支持多语言客服。
-
-```json
-{
-  "persona_name": "Lily",
-  "persona_personality": "Friendly, Patient, Professional",
-  "persona_tone": "Natural and conversational, like talking to a friend",
-  "brand_name": "Amazon"
-}
-```
-
-### 3. 角色扮演
-
-**场景**：特定行业的专业角色。
-
-```json
-{
-  "persona_name": "Dr. Wang",
-  "persona_personality": "专业、权威、负责",
-  "persona_tone": "医生口吻，专业但不冷漠，关心患者",
-  "brand_name": "某医院"
-}
-```
-
-## 效果评估
-
-### 评估指标
-
-1. **用户满意度**：对话结束后的评分
-2. **转人工率**：AI 无法解决转人工的比例
-3. **对话轮次**：完成任务所需的对话轮数
-4. **用户留存**：用户是否愿意再次使用
-
-### A/B 测试
-
-**对照组**（无人设）：
- 用户满意度：3.2/5
- 转人工率：35%
- 平均对话轮次：8 轮
-
-**实验组**（有人设）：
- 用户满意度：4.5/5
- 转人工率：18%
- 平均对话轮次：5 轮
-
-**结论**：人设配置显著提升用户体验。
-
-## 常见问题
-
-### Q1：人设变量不生效？
-
-**原因**：变量未正确配置或模板未发布。
-
-**解决**：
-1. 检查变量名是否正确（区分大小写）
-2. 确认模板已发布
-3. 查看日志确认变量替换是否成功
-
-### Q2：人设风格不稳定？
-
-**原因**：Prompt 约束不够强。
-
-**解决**：
-1. 添加更多约束条件
-2. 提供 Few-shot 示例
-3. 增加负面示例（禁止出现的表达）
-
-### Q3：不同渠道如何配置不同人设？
-
-**方案 1**：创建多个模板
- `客服对话 - 微信渠道`
- `客服对话 - 电话渠道`
- `客服对话 - Web 渠道`
-
-**方案 2**：使用 `{{channel_type}}` 变量
-```
-【说话风格】
-{% if channel_type == 'wechat' %}
-轻松活泼，可以使用表情符号
-{% elif channel_type == 'phone' %}
-口语化表达，避免书面语
-{% else %}
-标准客服用语，清晰简洁
-{% endif %}
-```
-
-## 总结
-
-通过合理配置人设变量，可以显著提升 AI 客服的拟人化程度：
-
- ✅ **性格鲜明**：通过 `persona_personality` 定义性格
- ✅ **语气自然**：通过 `persona_tone` 控制说话风格
- ✅ **品牌一致**：通过 `brand_name` 统一品牌形象
- ✅ **渠道适配**：通过 `channel_type` 差异化表达
-
-**建议**：
-1. 为每个品牌/渠道创建专属人设
-2. 定期 A/B 测试优化人设配置
-3. 收集用户反馈持续改进
--- a/spec/intent-driven-mid-platform/design.md
+++ b/spec/intent-driven-mid-platform/design.md
@ -1,147 +0,0 @@
-# 意图驱动智能体中台改造（仅中台侧）设计文档
-
-## 1. 设计目标
-
-围绕 `requirements.md` 中 AC-IDMP-01~10，给出可实施的中台设计：
- 对外协议统一
- 决策模式可治理
- 高风险场景可强接管
- 工具链路可观测、可降级
-
---
-
-## 2. 总体架构
-
-1. 请求入口层
- `DialogueController`：接收渠道请求、基础校验、注入 request_id
-
-2. 策略路由层
- `PolicyRouter`：根据意图、风险、置信度、会话模式确定执行路径
-
-3. 执行层
- `AgentOrchestrator`：ReAct 循环与工具编排
- `MicroFlowExecutor`：高风险 SOP 微流程执行
- `FixedResponder`：固定回复
- `TransferResponder`：转人工兜底
-
-4. 依赖适配层
- `MetadataAdapter`：意图驱动检索链路
- `MemoryAdapter`：recall/update
-
-5. 护栏与输出层
- `OutputGuardrail`：合规与承诺边界校验
- `SegmentFormatter`：统一封装 `segments[] + trace`
-
-6. 可观测与审计
- `TraceLogger`：记录 mode、tool_calls、fallback_reason_code、generation_id
-
---
-
-## 3. 关键流程
-
-## 3.1 会话响应主流程
-1. 入参校验：`session_id/user_message/history`
-2. 读取会话模式（BOT_ACTIVE/HUMAN_ACTIVE）
-3. 召回记忆（可失败降级）
-4. `PolicyRouter` 决策模式
-5. 执行模式链路
-6. 输出护栏校验
-7. 格式化并返回 `segments[] + trace`
-8. 异步记录审计日志
-
-## 3.2 打断重入流程
-1. 同一 session 收到新请求，生成新 `generation_id`
-2. 执行层只处理最新 generation
-3. 旧 generation 结果写审计但不对外生效
-
-## 3.3 人工模式流程
-1. 模式接口将 session 置为 `HUMAN_ACTIVE`
-2. 会话响应接口直接走 `transfer` 结果或返回人工处理提示
-3. 仍接受消息上报，保持数据闭环
-
---
-
-## 4. 数据结构设计
-
-## 4.1 入参模型（核心）
- `session_id: string`
- `user_message: string`
- `history: Message[]`（仅已送达）
- `interrupted_segments?: Segment[]`
-
-## 4.2 出参模型（核心）
- `segments: Segment[]`
-  - `segment_id`
-  - `text`
-  - `delay_after`
- `trace`
-  - `mode`
-  - `intent`
-  - `tools_used`
-  - `fallback_reason_code`
-  - `guardrail_triggered`
-  - `request_id`
-  - `generation_id`
-
---
-
-## 5. 决策规则设计
-
-## 5.1 PolicyRouter 决策矩阵
- 高风险命中（退款/投诉/隐私承诺/转人工）=> `micro_flow` 或 `transfer`
- 低风险且意图清晰 => `agent`
- 工具不可用或置信度低 => `fixed`
- 人工模式开启 => `transfer`
-
-## 5.2 降级策略
- 工具超时（单工具 2s）=> 降级 `fixed`
- 连续关键工具失败 => 降级 `transfer`
- 护栏拦截 => 强制 `micro_flow` 或 `transfer`
-
---
-
-## 6. 检索与记忆策略
-
-## 6.1 Metadata 检索固定链路
-`intent -> target_kbs -> metadata_filter -> vector_search -> rerank`
-
-元数据最小过滤建议：
- `grade`
- `subject`
- `scene`
- `flow_step`
- `intent_type`
- `status=active`
-
-## 6.2 Memory 接入
- 对话前 `recall(user_id, session_id)`
- 对话后异步 `update(messages)`
- recall 失败不阻断主链路
-
---
-
-## 7. 可观测与治理
-
-必须采集：
- 会话维度：`session_id/request_id/generation_id`
- 决策维度：`mode/intent/fallback_reason_code`
- 工具维度：`tool_name/duration/result/error_code`
- 护栏维度：`guardrail_triggered/policy_code`
-
-建议告警：
- P95 时延超阈值
- tool timeout rate 超阈值
- fallback rate 激增
- transfer rate 异常上升
-
---
-
-## 8. 与 AC 的映射
-
- AC-IDMP-01/02：`SegmentFormatter + TraceLogger`
- AC-IDMP-03/04：`HistoryValidator + GenerationGuard`
- AC-IDMP-05/06：`PolicyRouter + FallbackManager`
- AC-IDMP-07：`TraceLogger`
- AC-IDMP-08：`MessageReportController`
- AC-IDMP-09：`SessionModeController`
- AC-IDMP-10：`MetadataAdapter`
--- a/spec/intent-driven-mid-platform/openapi.deps.yaml
+++ b/spec/intent-driven-mid-platform/openapi.deps.yaml
@ -1,161 +0,0 @@
-openapi: 3.0.3
-info:
-  title: Intent-Driven Mid Platform Dependency API
-  version: 0.3.0
-  x-contract-level: L1
-paths:
-  /deps/metadata/query:
-    post:
-      operationId: queryMetadata
-      summary: 查询元数据与知识内容
-      x-requirements:
-        - AC-IDMP-10
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/MetadataQueryRequest'
-      responses:
-        '200':
-          description: 查询成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/MetadataQueryResponse'
-
-  /deps/memory/recall:
-    post:
-      operationId: recallMemory
-      summary: 召回用户记忆
-      x-requirements:
-        - AC-IDMP-13
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/RecallRequest'
-      responses:
-        '200':
-          description: 召回成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/RecallResponse'
-
-  /deps/memory/update:
-    post:
-      operationId: updateMemory
-      summary: 更新用户记忆
-      x-requirements:
-        - AC-IDMP-14
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/UpdateRequest'
-      responses:
-        '202':
-          description: 已接受异步更新
-
-components:
-  schemas:
-    MetadataQueryRequest:
-      type: object
-      required:
-        - intent
-        - target_kbs
-        - metadata_filter
-        - query
-      properties:
-        intent:
-          type: string
-        target_kbs:
-          type: array
-          items:
-            type: string
-        metadata_filter:
-          type: object
-          additionalProperties: true
-        query:
-          type: string
-
-    MetadataQueryResponse:
-      type: object
-      required:
-        - items
-      properties:
-        items:
-          type: array
-          items:
-            $ref: '#/components/schemas/MetadataItem'
-
-    MetadataItem:
-      type: object
-      required:
-        - id
-        - score
-        - content
-      properties:
-        id:
-          type: string
-        score:
-          type: number
-        content:
-          type: string
-        metadata:
-          type: object
-          additionalProperties: true
-
-    RecallRequest:
-      type: object
-      required:
-        - user_id
-        - session_id
-      properties:
-        user_id:
-          type: string
-        session_id:
-          type: string
-
-    RecallResponse:
-      type: object
-      properties:
-        profile:
-          type: object
-          description: 基础属性记忆（年级、地区、渠道等）
-          additionalProperties: true
-        facts:
-          type: array
-          description: 事实型记忆（已购课程、学习结论等）
-          items:
-            type: string
-        preferences:
-          type: object
-          description: 偏好记忆（语气偏好、关注科目等）
-          additionalProperties: true
-        last_summary:
-          type: string
-          description: 最近一次会话摘要
-
-    UpdateRequest:
-      type: object
-      required:
-        - user_id
-        - session_id
-        - messages
-      properties:
-        user_id:
-          type: string
-        session_id:
-          type: string
-        messages:
-          type: array
-          items:
-            type: object
-            additionalProperties: true
-        summary:
-          type: string
-          description: 本轮会话摘要（可选，由中台异步生成后回写）
--- a/spec/intent-driven-mid-platform/openapi.provider.yaml
+++ b/spec/intent-driven-mid-platform/openapi.provider.yaml
@ -1,338 +0,0 @@
-openapi: 3.0.3
-info:
-  title: Intent-Driven Mid Platform Provider API
-  version: 0.3.0
-  x-contract-level: L2
-paths:
-  /mid/dialogue/respond:
-    post:
-      operationId: respondDialogue
-      summary: 生成中台分段响应
-      x-requirements:
-        - AC-IDMP-01
-        - AC-IDMP-02
-        - AC-IDMP-03
-        - AC-IDMP-04
-        - AC-IDMP-05
-        - AC-IDMP-06
-        - AC-IDMP-07
-        - AC-IDMP-11
-        - AC-IDMP-12
-        - AC-IDMP-15
-        - AC-IDMP-16
-        - AC-IDMP-17
-        - AC-IDMP-18
-        - AC-IDMP-19
-        - AC-IDMP-20
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/DialogueRequest'
-      responses:
-        '200':
-          description: 成功生成分段响应
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/DialogueResponse'
-        '400':
-          description: 请求参数错误
-        '500':
-          description: 服务内部错误
-
-  /mid/messages/report:
-    post:
-      operationId: reportMessages
-      summary: 上报会话消息与事件
-      x-requirements:
-        - AC-IDMP-08
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/MessageReportRequest'
-      responses:
-        '202':
-          description: 已接受异步处理
-        '400':
-          description: 请求参数错误
-
-  /mid/sessions/{sessionId}/mode:
-    post:
-      operationId: switchSessionMode
-      summary: 切换会话模式
-      x-requirements:
-        - AC-IDMP-09
-      parameters:
-        - name: sessionId
-          in: path
-          required: true
-          schema:
-            type: string
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/SwitchModeRequest'
-      responses:
-        '200':
-          description: 模式切换成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/SwitchModeResponse'
-        '400':
-          description: 请求参数错误
-
-components:
-  schemas:
-    DialogueRequest:
-      type: object
-      required:
-        - session_id
-        - user_message
-        - history
-      properties:
-        session_id:
-          type: string
-        user_id:
-          type: string
-          description: 用于记忆召回与更新的用户标识
-        user_message:
-          type: string
-          minLength: 1
-          maxLength: 2000
-        history:
-          type: array
-          description: 仅允许已送达历史
-          items:
-            $ref: '#/components/schemas/HistoryMessage'
-        interrupted_segments:
-          type: array
-          items:
-            $ref: '#/components/schemas/InterruptedSegment'
-        feature_flags:
-          $ref: '#/components/schemas/FeatureFlags'
-
-    FeatureFlags:
-      type: object
-      properties:
-        agent_enabled:
-          type: boolean
-          description: 会话级 Agent 灰度开关
-        rollback_to_legacy:
-          type: boolean
-          description: 强制回滚传统链路
-
-    HistoryMessage:
-      type: object
-      required:
-        - role
-        - content
-      properties:
-        role:
-          type: string
-          enum: [user, assistant, human]
-        content:
-          type: string
-
-    InterruptedSegment:
-      type: object
-      required:
-        - segment_id
-        - content
-      properties:
-        segment_id:
-          type: string
-        content:
-          type: string
-
-    DialogueResponse:
-      type: object
-      required:
-        - segments
-        - trace
-      properties:
-        segments:
-          type: array
-          items:
-            $ref: '#/components/schemas/Segment'
-        trace:
-          $ref: '#/components/schemas/TraceInfo'
-
-    Segment:
-      type: object
-      required:
-        - segment_id
-        - text
-        - delay_after
-      properties:
-        segment_id:
-          type: string
-        text:
-          type: string
-        delay_after:
-          type: integer
-          minimum: 0
-
-    TraceInfo:
-      type: object
-      required:
-        - mode
-      properties:
-        mode:
-          type: string
-          enum: [agent, micro_flow, fixed, transfer]
-        intent:
-          type: string
-        request_id:
-          type: string
-        generation_id:
-          type: string
-        guardrail_triggered:
-          type: boolean
-        fallback_reason_code:
-          type: string
-        react_iterations:
-          type: integer
-          minimum: 0
-          maximum: 5
-          description: ReAct 实际循环次数
-        timeout_profile:
-          $ref: '#/components/schemas/TimeoutProfile'
-        metrics_snapshot:
-          $ref: '#/components/schemas/MetricsSnapshot'
-        high_risk_policy_set:
-          type: array
-          description: 当前启用的高风险最小场景集
-          items:
-            type: string
-            enum: [refund, complaint_escalation, privacy_sensitive_promise, transfer]
-        tools_used:
-          type: array
-          items:
-            type: string
-        tool_calls:
-          type: array
-          items:
-            $ref: '#/components/schemas/ToolCallTrace'
-
-    TimeoutProfile:
-      type: object
-      properties:
-        per_tool_timeout_ms:
-          type: integer
-          maximum: 2000
-        end_to_end_timeout_ms:
-          type: integer
-          maximum: 8000
-
-    MetricsSnapshot:
-      type: object
-      properties:
-        task_completion_rate:
-          type: number
-          format: float
-        slot_completion_rate:
-          type: number
-          format: float
-        wrong_transfer_rate:
-          type: number
-          format: float
-        no_recall_rate:
-          type: number
-          format: float
-        avg_latency_ms:
-          type: number
-          format: float
-
-    ToolCallTrace:
-      type: object
-      required:
-        - tool_name
-        - duration_ms
-        - status
-      properties:
-        tool_name:
-          type: string
-        tool_type:
-          type: string
-          enum: [internal, mcp]
-        registry_version:
-          type: string
-        auth_applied:
-          type: boolean
-        duration_ms:
-          type: integer
-          minimum: 0
-        status:
-          type: string
-          enum: [ok, timeout, error, rejected]
-        error_code:
-          type: string
-        args_digest:
-          type: string
-        result_digest:
-          type: string
-
-    MessageReportRequest:
-      type: object
-      required:
-        - session_id
-        - messages
-      properties:
-        session_id:
-          type: string
-        messages:
-          type: array
-          items:
-            $ref: '#/components/schemas/ReportedMessage'
-
-    ReportedMessage:
-      type: object
-      required:
-        - role
-        - content
-        - source
-        - timestamp
-      properties:
-        role:
-          type: string
-          enum: [user, assistant, human, system]
-        content:
-          type: string
-        source:
-          type: string
-          enum: [bot, human, channel]
-        timestamp:
-          type: string
-          format: date-time
-        segment_id:
-          type: string
-
-    SwitchModeRequest:
-      type: object
-      required:
-        - mode
-      properties:
-        mode:
-          type: string
-          enum: [BOT_ACTIVE, HUMAN_ACTIVE]
-        reason:
-          type: string
-
-    SwitchModeResponse:
-      type: object
-      required:
-        - session_id
-        - mode
-      properties:
-        session_id:
-          type: string
-        mode:
-          type: string
-          enum: [BOT_ACTIVE, HUMAN_ACTIVE]
--- a/spec/intent-driven-mid-platform/requirements.md
+++ b/spec/intent-driven-mid-platform/requirements.md
@ -1,107 +0,0 @@
---
-feature_id: "IDMP"
-title: "意图驱动智能体中台改造（仅中台侧）需求规范"
-status: "draft"
-version: "0.3.0"
-active_version: "0.2.0-0.3.0"
-version_history:
-  - version: "0.3.0"
-    ac_range: "AC-IDMP-19~20"
-    description: "MCP/Tool Registry治理与高风险场景最小集显式化"
-  - version: "0.2.0"
-    ac_range: "AC-IDMP-11~18"
-    description: "ReAct约束、记忆分层、工具治理与灰度回滚补充"
-  - version: "0.1.0"
-    ac_range: "AC-IDMP-01~10"
-    description: "中台协议统一与智能编排首版"
-owners:
-  - "product"
-  - "backend"
-last_updated: "2026-03-03"
-source:
-  type: "conversation"
-  ref: "仅中台侧改造文档生成"
---
-
-# 意图驱动智能体中台改造（仅中台侧）（IDMP）
-
-## 1. 背景与目标
-
- 背景：当前中台能力与渠道协同语义不统一，打断重入、模式归因、护栏接管等关键路径缺少标准化契约。
- 目标：建立中台统一响应协议、决策编排与治理框架，在不改动渠道实现细节的前提下提升稳定性与可观测性。
- 非目标：不覆盖渠道发送实现、不覆盖管理端页面改造、不覆盖模型替换。
-
-## 2. 模块边界（Scope）
-
- 覆盖：中台会话响应、模式切换、消息上报、元数据检索、工具编排与记忆服务调用。
- 不覆盖：渠道端 SegmentDispatcher/InterruptManager/DeliveredHistoryStore 具体实现。
-
-## 3. 依赖盘点（Dependencies）
-
- 渠道侧服务（请求入口与响应消费）
- 元数据服务（KB/Intent/Flow/Prompt）
- 记忆服务（recall/update）
-
-## 4. 用户故事（User Stories）
-
- [US-IDMP-01] 作为渠道调用方，我希望中台返回统一 `segments[] + trace`，以便稳定消费并做打断重入。
- [US-IDMP-02] 作为平台治理方，我希望中台能基于策略选择 agent/micro_flow/fixed/transfer，以便平衡效果与合规。
- [US-IDMP-03] 作为运维方，我希望关键链路有完整可观测字段和指标，以便快速归因与回滚。
- [US-IDMP-04] 作为产品方，我希望高风险场景被微流程强接管，以避免不受控回复。
- [US-IDMP-05] 作为架构方，我希望 Agent 运行有明确循环与超时上限，以避免时延失控。
- [US-IDMP-06] 作为平台方，我希望记忆能力分层可管控，以提升连续会话质量。
- [US-IDMP-07] 作为平台治理方，我希望 MCP/Tool Registry 被纳入统一注册与策略治理，以便工具扩展可控。
- [US-IDMP-08] 作为业务方，我希望高风险场景最小集有明确口径，以便实施和验收一致。
-
-## 5. 验收标准（Acceptance Criteria, EARS）
-
-### v0.1.0
- [AC-IDMP-01] WHEN 渠道调用会话响应接口 THEN 中台 SHALL 返回 `segments[]`，每个分段包含 `segment_id/text/delay_after`。
- [AC-IDMP-02] WHEN 中台返回会话响应 THEN 中台 SHALL 同时返回 `trace.mode`，且其值属于 `agent|micro_flow|fixed|transfer`。
- [AC-IDMP-03] WHEN 请求包含已送达历史 THEN 中台 SHALL 仅基于已送达历史进行上下文推理，不依赖未送达内容。
- [AC-IDMP-04] WHEN 会话处于发送中发生用户新输入 THEN 中台 SHALL 支持打断重入并以最新请求代际结果为准。
- [AC-IDMP-05] WHEN 进入退款/投诉/隐私承诺/转人工等高风险场景 THEN 中台 SHALL 强制切换到 `micro_flow` 或 `transfer`。
- [AC-IDMP-06] WHEN Agent 工具调用超时或失败 THEN 中台 SHALL 按策略降级到 `fixed` 或 `transfer` 并给出 `fallback_reason_code`。
- [AC-IDMP-07] WHEN 中台执行一次响应 THEN 中台 SHALL 记录 `session_id/request_id/generation_id/mode/intent/tool_calls/guardrail_triggered` 等审计字段。
- [AC-IDMP-08] WHEN 渠道上报人工或机器人消息 THEN 中台 SHALL 接收并落库，保证会话闭环数据完整。
- [AC-IDMP-09] WHEN 渠道发起会话模式切换 THEN 中台 SHALL 更新会话模式状态并影响后续编排路径。
- [AC-IDMP-10] WHEN 执行元数据检索 THEN 中台 SHALL 执行 `intent -> target_kbs -> metadata_filter -> vector_search -> rerank` 固定链路。
-
-### v0.2.0
- [AC-IDMP-11] WHEN Agent 进入 ReAct 推理 THEN 中台 SHALL 限制循环次数在 3~5 次内，超限必须触发降级。
- [AC-IDMP-12] WHEN 发生工具调用 THEN 中台 SHALL 约束单工具超时 ≤2s 且全链路超时 ≤8s。
- [AC-IDMP-13] WHEN 进行记忆读取 THEN 中台 SHALL 在响应前执行 recall 并注入基础属性、事实记忆与偏好记忆。
- [AC-IDMP-14] WHEN 一轮会话完成 THEN 中台 SHALL 异步执行记忆更新（含会话摘要），且不阻塞主响应。
- [AC-IDMP-15] WHEN 中台调用工具 THEN 中台 SHALL 以结构化 `tool_call/tool_result` 记录参数摘要、耗时、结果与错误码。
- [AC-IDMP-16] WHEN 意图置信度低或关键信息缺失 THEN 中台 SHALL 回退到 `micro_flow` 或 `fixed`，不得继续自由 Agent 输出。
- [AC-IDMP-17] WHEN 系统进入灰度发布 THEN 中台 SHALL 支持按会话级开关启停 Agent 链路并支持回滚到传统链路。
- [AC-IDMP-18] WHEN 采集运行指标 THEN 中台 SHALL 提供任务达成率、槽位完整率、误转人工率、无召回率、平均时延等指标。
-
-### v0.3.0
- [AC-IDMP-19] WHEN 新增或调用工具能力（含 MCP 工具） THEN 中台 SHALL 通过统一 Tool Registry 完成注册、鉴权、超时策略、版本与启停治理，并将治理结果写入 trace/tool 日志。
- [AC-IDMP-20] WHEN 判定高风险场景接管范围 THEN 中台 SHALL 以“退款/投诉升级/隐私与敏感承诺/转人工”作为最小强接管场景集，且该集合可配置但不得缺省为空。
-
-## 6. 追踪映射（Traceability）
-
-| AC ID | Endpoint | 方法 | operationId | 备注 |
-|------|----------|------|-------------|------|
-| AC-IDMP-01 | /mid/dialogue/respond | POST | respondDialogue | 分段协议 |
-| AC-IDMP-02 | /mid/dialogue/respond | POST | respondDialogue | trace 模式 |
-| AC-IDMP-03 | /mid/dialogue/respond | POST | respondDialogue | 已送达历史 |
-| AC-IDMP-04 | /mid/dialogue/respond | POST | respondDialogue | 打断重入/代际 |
-| AC-IDMP-05 | /mid/dialogue/respond | POST | respondDialogue | 高风险接管 |
-| AC-IDMP-06 | /mid/dialogue/respond | POST | respondDialogue | 降级策略 |
-| AC-IDMP-07 | /mid/dialogue/respond | POST | respondDialogue | 审计观测 |
-| AC-IDMP-08 | /mid/messages/report | POST | reportMessages | 全量上报 |
-| AC-IDMP-09 | /mid/sessions/{sessionId}/mode | POST | switchSessionMode | 模式切换 |
-| AC-IDMP-10 | /deps/metadata/query | POST | queryMetadata | 元数据链路 |
-| AC-IDMP-11 | /mid/dialogue/respond | POST | respondDialogue | ReAct 循环上限 |
-| AC-IDMP-12 | /mid/dialogue/respond | POST | respondDialogue | 超时治理 |
-| AC-IDMP-13 | /deps/memory/recall | POST | recallMemory | 记忆读取 |
-| AC-IDMP-14 | /deps/memory/update | POST | updateMemory | 异步记忆更新 |
-| AC-IDMP-15 | /mid/dialogue/respond | POST | respondDialogue | 工具调用结构化记录 |
-| AC-IDMP-16 | /mid/dialogue/respond | POST | respondDialogue | 低置信度回退 |
-| AC-IDMP-17 | /mid/dialogue/respond | POST | respondDialogue | 灰度与回滚 |
-| AC-IDMP-18 | /mid/dialogue/respond | POST | respondDialogue | 运行指标输出 |
-| AC-IDMP-19 | /mid/dialogue/respond | POST | respondDialogue | MCP/Tool Registry 治理 |
-| AC-IDMP-20 | /mid/dialogue/respond | POST | respondDialogue | 高风险最小场景集 |
--- a/spec/intent-driven-mid-platform/scope.md
+++ b/spec/intent-driven-mid-platform/scope.md
@ -1,47 +0,0 @@
-# 意图驱动智能体中台改造 - 功能定界（仅中台侧）
-
-## 1. 模块边界（Module Scope）
-
-### 1.1 覆盖范围（In Scope）
- 中台会话编排：`policy_router` 决策 `agent | micro_flow | fixed | transfer`
- 中台输出协议：统一返回 `segments[] + trace`
- 中台工具编排：KB/Intent/Flow/Prompt/Memory 工具调用治理
- 中台记忆融合：会话前 recall、会话后异步更新
- 中台高风险微流程：退款/投诉/隐私承诺/转人工
- 中台可观测性：模式、工具调用、降级原因、护栏触发、请求代际
-
-### 1.2 不覆盖范围（Out of Scope）
- 渠道侧消息分段发送实现（队列、延迟、重试）
- 渠道侧中断管理实现（取消令牌、本地缓存）
- 前端/管理端页面改造
- 模型供应商切换与底层推理框架替换
-
---
-
-## 2. 依赖盘点（Dependencies）
-
- 渠道侧（Java）
-  - 提供：用户消息、已送达历史、可选中断片段
-  - 消费：中台返回 `segments[] + trace`
- 元数据与知识库服务
-  - 提供：KB 文档、intent 规则、flow 模板、prompt 模板
- 记忆服务
-  - 提供：用户长期记忆 recall / update 能力
-
---
-
-## 3. 依赖接口清单（Dependency Contracts）
-
-1. `POST /mid/dialogue/respond`（中台对渠道）
-2. `POST /mid/sessions/{sessionId}/mode`（会话模式切换）
-3. `POST /mid/messages/report`（全量消息上报）
-4. `POST /deps/metadata/query`（中台调用元数据服务）
-5. `POST /deps/memory/recall`、`POST /deps/memory/update`（中台调用记忆服务）
-
---
-
-## 4. 阶段目标
-
- Phase 1：协议统一与主链路可用
- Phase 2：高风险护栏与记忆增强
- Phase 3：稳定性治理与灰度回滚
--- a/spec/intent-driven-mid-platform/tasks.md
+++ b/spec/intent-driven-mid-platform/tasks.md
@ -1,117 +0,0 @@
-# 意图驱动智能体中台改造（IDMP）- 任务清单
-
-## 活跃版本：v0.2.0-0.3.0
-
---
-
-## Phase 1: 记忆服务增强（AC-IDMP-13/14）
-
-### Task-1.1: 记忆召回服务（AC-IDMP-13）⏳
-**验收标准**: WHEN 进行记忆读取 THEN 中台 SHALL 在响应前执行 recall 并注入基础属性、事实记忆与偏好记忆
-
-**子任务**:
- [ ] 1.1.1 创建 `services/mid/memory_adapter.py` - 记忆适配器
- [ ] 1.1.2 实现 `RecallRequest` / `RecallResponse` 数据模型
- [ ] 1.1.3 实现 `recall(user_id, session_id)` 方法，返回 profile/facts/preferences
- [ ] 1.1.4 集成到对话响应流程（响应前调用）
- [ ] 1.1.5 实现 recall 失败降级（不阻断主链路）
-
-**参考**: 
- `openapi.deps.yaml` - `/deps/memory/recall`
- `services/memory.py` - 现有 MemoryService
-
---
-
-### Task-1.2: 记忆更新服务（AC-IDMP-14）⏳
-**验收标准**: WHEN 一轮会话完成 THEN 中台 SHALL 异步执行记忆更新（含会话摘要），且不阻塞主响应
-
-**子任务**:
- [ ] 1.2.1 实现 `update(user_id, session_id, messages, summary)` 方法
- [ ] 1.2.2 创建异步任务队列（使用 asyncio.create_task）
- [ ] 1.2.3 实现会话摘要生成（可选，由中台异步生成后回写）
- [ ] 1.2.4 集成到对话响应流程（响应后异步调用）
- [ ] 1.2.5 添加错误处理与重试机制
-
-**参考**:
- `openapi.deps.yaml` - `/deps/memory/update`
-
---
-
-## Phase 2: 工具调用治理（AC-IDMP-15/19）
-
-### Task-2.1: 工具调用结构化记录（AC-IDMP-15）⏳
-**验收标准**: WHEN 中台调用工具 THEN 中台 SHALL 以结构化 `tool_call/tool_result` 记录参数摘要、耗时、结果与错误码
-
-**子任务**:
- [ ] 2.1.1 创建 `models/mid/tool_trace.py` - ToolCallTrace 数据模型
- [ ] 2.1.2 实现 `ToolCallRecorder` 服务
- [ ] 2.1.3 记录字段：tool_name, tool_type, registry_version, auth_applied, duration_ms, status, error_code, args_digest, result_digest
- [ ] 2.1.4 集成到 TraceInfo 输出
- [ ] 2.1.5 实现敏感参数脱敏（args_digest）
-
-**参考**:
- `openapi.provider.yaml` - `ToolCallTrace` schema
-
---
-
-### Task-2.2: Tool Registry 治理（AC-IDMP-19）⏳
-**验收标准**: WHEN 新增或调用工具能力（含 MCP 工具） THEN 中台 SHALL 通过统一 Tool Registry 完成注册、鉴权、超时策略、版本与启停治理，并将治理结果写入 trace/tool 日志
-
-**子任务**:
- [ ] 2.2.1 创建 `services/mid/tool_registry.py` - 工具注册表
- [ ] 2.2.2 实现 `ToolDefinition` 数据模型（name, type, version, timeout_ms, auth_required, is_enabled）
- [ ] 2.2.3 实现 `register_tool()` - 工具注册
- [ ] 2.2.4 实现 `authorize_tool()` - 工具鉴权
- [ ] 2.2.5 实现 `get_timeout_policy()` - 超时策略获取
- [ ] 2.2.6 实现 `is_tool_enabled()` - 启停状态检查
- [ ] 2.2.7 创建数据库表 `tool_registry`（可选，支持动态配置）
- [ ] 2.2.8 集成 MCP 工具支持
-
-**参考**:
- `openapi.provider.yaml` - `TraceInfo.tools_used`, `ToolCallTrace`
-
---
-
-## Phase 3: 测试与验证
-
-### Task-3.1: 单元测试 ⏳
-**覆盖路径**:
- [ ] 成功路径：正常 recall/update，工具调用成功
- [ ] 超时路径：recall 超时降级，工具调用超时
- [ ] 错误路径：recall 失败降级，工具调用错误
- [ ] 降级路径：记忆服务不可用时继续主链路
-
-**测试文件**:
- `tests/test_memory_adapter.py`
- `tests/test_tool_registry.py`
- `tests/test_tool_call_recorder.py`
-
---
-
-## 依赖关系
-
-```
-Task-1.1 (recall) ──┐
-                    ├──> Task-1.2 (update) ──> Task-3.1 (tests)
-Task-2.1 (trace) ───┘
-        │
-        └──> Task-2.2 (registry) ──> Task-3.1 (tests)
-```
-
---
-
-## 变更文件清单
-
-| 文件路径 | 状态 | 关联 AC |
-|---------|------|---------|
-| `services/mid/__init__.py` | 待创建 | - |
-| `services/mid/memory_adapter.py` | 待创建 | AC-IDMP-13/14 |
-| `services/mid/tool_registry.py` | 待创建 | AC-IDMP-19 |
-| `services/mid/tool_call_recorder.py` | 待创建 | AC-IDMP-15 |
-| `models/mid/__init__.py` | 待创建 | - |
-| `models/mid/memory.py` | 待创建 | AC-IDMP-13/14 |
-| `models/mid/tool_trace.py` | 待创建 | AC-IDMP-15 |
-| `models/mid/tool_registry.py` | 待创建 | AC-IDMP-19 |
-| `tests/test_memory_adapter.py` | 待创建 | AC-IDMP-13/14 |
-| `tests/test_tool_registry.py` | 待创建 | AC-IDMP-19 |
-| `tests/test_tool_call_recorder.py` | 待创建 | AC-IDMP-15 |
--- a/spec/intent-driven-script/metadata-intent-flow-operations.md
+++ b/spec/intent-driven-script/metadata-intent-flow-operations.md
@ -1,202 +0,0 @@
-# 元数据 + 意图识别 + 话术流程关联操作文档
-
-## 1. 目标
-
-本文用于指导运营/配置人员在管理界面完成以下闭环：
-1. 配置元数据字段
-2. 录入知识库文档并标注元数据
-3. 配置意图规则（触发路由）
-4. 配置话术流程（多轮推进）
-5. 打通“意图 -> 检索/流程 -> 话术生成”
-
-适用对象：产品、运营、实施、测试。
-
---
-
-## 2. 概念关系（先理解）
-
- **元数据（Metadata）**：用于“筛选与路由”的标签，如 `grade`、`subject`、`scene`。
- **意图规则（Intent Rule）**：用于判断“用户在说什么、系统该做什么”。
- **话术流程（Script Flow）**：用于处理“多轮引导”，例如收集年级 -> 收集薄弱点 -> 推荐课程。
-
-**执行链路**：
-
-用户消息 -> 意图匹配 -> 
- 命中 `response_type=rag`：进入知识库检索（带 metadata filter）
- 命中 `response_type=flow`：进入话术流程
- 命中 `fixed/transfer`：直接固定回复或转人工
-
---
-
-## 3. 第一步：配置元数据字段
-
-进入：`元数据配置` 页面 -> 新建字段。
-
-建议首批字段（教育场景）：
- `grade`（枚举）：初一/初二/初三/all
- `subject`（枚举）：语文/数学/英语/物理/化学/综合/all
- `scene`（枚举）：pain_point/transition/module_intro/faq/policy/closing
- `flow_step`（枚举）：step1/step2/step3/step4/step5/none
- `intent_type`（枚举）：ask_grade/ask_weak_point/module_recommend/next_action/faq_answer/compliance
- `audience`（枚举）：parent/student/all
- `priority`（数字）：1-10
- `status`（枚举）：draft/active/deprecated
-
-字段配置建议：
- `是否必填`：`grade/subject/scene/status` 建议必填
- `可过滤`：上述字段建议全部开启
- `排序特征`：仅 `priority` 建议开启
- `状态`：配置完成后由 `草稿 -> 激活`
-
-注意：字段标识只用小写字母/数字/下划线，避免后续过滤失败。
-
---
-
-## 4. 第二步：录入知识库文档并打元数据
-
-进入：`知识库文档` 页面 -> 新增文档。
-
-### 4.1 文档内容规范（你当前按行分块）
- 一行一个知识点
- 每行尽量只表达一个事实或一个建议
- 避免一行过长（建议 20~80 字）
-
-### 4.2 元数据填写规范
-示例（初一痛点文档）：
- `grade=初一`
- `subject=综合`
- `scene=pain_point`
- `flow_step=step2`
- `intent_type=ask_weak_point`
- `audience=parent`
- `status=active`
- `priority=8`
-
-### 4.3 无年级、仅学科的数据怎么填
- `grade=all`
- `subject=语文/数学/...`
- 其余按场景填写
-
-不建议因为“无年级”单独新建知识库，优先通过文档+metadata区分。
-
---
-
-## 5. 第三步：配置意图规则（路由入口）
-
-进入：`意图规则` 页面 -> 新建规则。
-
-### 5.1 必填项
- 名称
- `keywords`（关键词）和/或 `patterns`（正则）
- `response_type`：fixed / rag / flow / transfer
- 优先级
-
-### 5.2 推荐路由策略
- 课程咨询、薄弱点分析 -> `response_type=flow`
- 短问短答（价格、班型） -> `response_type=rag` 或 `fixed`
- 明确转人工 -> `response_type=transfer`
-
-### 5.3 metadata 关联
-在意图规则 metadata 中建议填写：
- `scene`
- `grade`（可选）
- `subject`（可选）
-
-用于后续检索过滤增强。
-
---
-
-## 6. 第四步：配置话术流程（与意图绑定）
-
-进入：`话术流程` 页面 -> 新建或编辑流程。
-
-### 6.1 你当前推荐流程（示例）
- Step1：确认年级
- Step2：年级特点 + 过渡到薄弱点
- Step3：确认薄弱科目/能力点
- Step4：针对薄弱点介绍课程模块
- Step5：给出下一步建议
-
-### 6.2 每步建议配置
- `script_mode = flexible`
- 配置 `intent` / `intent_description`
- 配置 `script_constraints`
- 配置 `fallback content`
- 配置 `expected_variables`（如 `grade`, `weak_points`）
-
-### 6.3 与知识库关联方式
- 在流程步骤中通过 `rag_config.tag_filter` 或上下文变量注入过滤（如 `grade`, `subject`, `scene`）
- Step2 常用 `scene=pain_point`
- Step4 常用 `scene=module_intro`
-
---
-
-## 7. 第五步：关联关系配置清单（上线前检查）
-
-### 7.1 意图 -> 流程
- 已有意图规则 `response_type=flow`
- `flow_id` 指向正确流程
-
-### 7.2 意图/上下文 -> 检索
- 命中 `response_type=rag` 或流程步骤需要RAG时
- metadata filter 至少包含 `grade/subject/scene` 之一
-
-### 7.3 文档 -> 检索命中
- 文档已 `status=active`
- 文档 metadata 与过滤字段一致（值完全一致）
- 文档内容按行分块且可检索
-
---
-
-## 8. 验收用例（建议）
-
-### 用例1：初一家长咨询
-输入：`孩子刚上初一，成绩上不去怎么办？`
-期望：
-1. 命中课程咨询意图
-2. 进入 flow（或rag+flow）
-3. 检索优先命中 `grade=初一` 且 `scene=pain_point` 文档
-
-### 用例2：仅学科问题
-输入：`语文阅读理解总是丢分`
-期望：
-1. 命中学科提升相关意图
-2. 检索命中 `subject=语文`、`grade=all` 或当前年级内容
-
-### 用例3：无召回兜底
-输入：冷门问题且无相关文档
-期望：
-1. 返回 fallback 话术
-2. 记录无召回原因（便于补文档）
-
---
-
-## 9. 常见问题与处理
-
-1. **有文档但检索不到**
- 检查 metadata 值是否一致（如“初一” vs “七年级”）
- 检查字段是否可过滤、状态是否 active
-
-2. **命中不准，答非所问**
- 缩小意图规则关键词
- 增加 metadata 过滤条件
- 拆分长行文本为更原子知识点
-
-3. **是否要新增知识库**
- 先不拆库，优先文档+metadata
- 仅当串库严重/权限隔离/规模过大再拆
-
---
-
-## 10. 运营维护建议
-
- 每周复盘：
-  - 高频命中问题
-  - 无召回问题
-  - 错召回问题
- 按复盘结果更新：
-  - 意图规则关键词与优先级
-  - 文档内容与metadata
-  - 流程约束与fallback
-
-目标：持续提高“命中率、相关性、可用率”。
--- a/spec/intent-driven-script/solution-java-channel-side.md
+++ b/spec/intent-driven-script/solution-java-channel-side.md
@ -1,200 +0,0 @@
-# AI客服智能体增强改造方案（渠道侧 / Java）
-
-## 1. 改造定位
-
-Java 渠道侧承担“用户体验与会话控制中枢”角色，核心职责：
- 分段消息发送（拟人化节奏）
- 打断处理（实时中断、无重复发送）
- 本地会话历史缓存（已送达真相）
- 全量消息异步上报（含人工阶段）
-
-目标：让智能体能力在前端体验上可感知且可控。
-
---
-
-## 2. 渠道侧总体设计
-
-## 2.1 新增组件
-1. `SegmentDispatcher`（消息调度器）
-2. `InterruptManager`（打断管理器）
-3. `DeliveredHistoryStore`（已送达历史存储）
-4. `MessageReporter`（全量消息上报器）
-5. `SessionModeManager`（机器人/人工状态管理）
-
-## 2.2 会话状态机
- `BOT_ACTIVE`：机器人对话中
- `BOT_SENDING`：机器人分段发送中
- `INTERRUPTED`：发送被打断，等待新请求
- `HUMAN_ACTIVE`：人工接管
-
---
-
-## 3. 核心流程设计
-
-## 3.1 正常流程（无打断）
-1. 用户消息进入渠道侧
-2. 组装请求（`session_id + user_message + delivered_history`）调用中台
-3. 收到 `segments[]`
-4. `SegmentDispatcher` 按 `delay_after` 顺序发送
-5. 每发送成功一段，写入 `DeliveredHistoryStore`
-6. 异步上报每条消息（用户/机器人）
-
-## 3.2 打断流程
-1. 发送中收到新用户消息
-2. `InterruptManager` 立即取消未发送段
-3. 仅保留“已送达段”作为历史
-4. 发起新请求，附带 `interrupted_segments`（可选）
-5. 中台重新决策后返回新 `segments[]`
-
-## 3.3 转人工流程
-1. 渠道侧调用中台状态接口标记 `HUMAN_ACTIVE`
-2. 后续消息不再调用智能体
-3. 用户与人工消息继续异步上报中台（保证闭环数据）
-
---
-
-## 4. 关键模块说明
-
-## 4.1 SegmentDispatcher
-
-职责：
- 顺序发送片段
- 尊重 `delay_after`
- 支持取消令牌（中断）
-
-建议实现点：
- 每个 session 单独队列
- 每段发送前检查会话是否仍为 `BOT_SENDING`
- 失败重试最多 1 次（避免重复刷屏）
-
-## 4.2 InterruptManager
-
-职责：
- 监听用户新消息事件
- 取消当前发送任务
- 记录中断点（最后已送达 segment_id）
-
-建议规则：
- 中断优先级高于发送
- 丢弃未送达段，不做补发
-
-## 4.3 DeliveredHistoryStore（Redis）
-
-建议键：`chat:{sessionId}:delivered`
-
-建议字段：
- role（user/assistant/human）
- content
- timestamp
- segment_id（assistant可选）
- source（bot/human）
-
-只存“已送达”消息，避免上下文污染。
-
-## 4.4 MessageReporter
-
-职责：
- 异步上报全量消息到中台历史服务
- 支持重试与死信队列
-
-必须上报：
- 用户消息
- 机器人每个已送达段
- 人工客服消息
- 模式切换事件（bot->human/human->bot）
-
---
-
-## 5. 请求与响应协议建议
-
-## 5.1 渠道 -> 中台请求
-```json
-{
-  "session_id": "sess_123",
-  "user_message": "孩子五年级，数学不好",
-  "history": [
-    {"role": "user", "content": "你好"},
-    {"role": "assistant", "content": "您好"}
-  ],
-  "interrupted_segments": [
-    {"segment_id": "seg_2", "content": "稍等，我查一下"}
-  ]
-}
-```
-
-## 5.2 中台 -> 渠道响应
-```json
-{
-  "segments": [
-    {"segment_id": "seg_3", "text": "好的，请问孩子具体是哪个知识点薄弱？", "delay_after": 800},
-    {"segment_id": "seg_4", "text": "比如分数应用题还是几何？", "delay_after": 0}
-  ]
-}
-```
-
---
-
-## 6. 与中台协同的关键约束
-
-1. 渠道侧只认 `segments[]`，不关心中台内部是 agent 还是 flow。
-2. 必须使用“已送达历史”回传，不能用“计划发送历史”。
-3. 打断后立即重发，不等待旧请求自然结束。
-4. 人工模式下继续上报，保证中台记忆与分析完整。
-
---
-
-## 7. 可观测性指标（渠道侧）
-
-建议上报指标：
- `segment_send_latency_ms`
- `segment_drop_count`（中断丢弃）
- `interrupt_count`
- `resend_request_count`
- `bot_to_human_switch_count`
- `history_report_success_rate`
-
-日志关键字段：
- session_id
- request_id
- segment_id
- mode
- interrupted_at
-
---
-
-## 8. 分阶段实施建议（渠道侧）
-
-## Phase 1
- 实现分段发送 + 本地已送达历史
- 接入中台 `segments` 协议
-
-## Phase 2
- 打断管理上线（实时取消 + 重新请求）
- 全量消息上报链路上线
-
-## Phase 3
- 人工接管状态机完善
- 失败重试、幂等、死信治理完善
-
---
-
-## 9. 风险与应对
-
-1. 重复发送风险
- 通过 `segment_id + session_id` 做幂等去重
-
-2. 中断竞态（旧包晚到）
- 用 `request_id` / `generation_id` 校验，只消费最新响应
-
-3. 历史错乱
- 仅以“已送达消息”入历史，不以“待发送消息”入历史
-
-4. 上报积压
- 异步队列 + 批量上报 + 失败重试
-
---
-
-## 10. 最终建议（渠道侧）
-
- 渠道侧的首要目标不是“更聪明”，而是“更稳定 + 更像人 + 可打断”。
- 只要分段、打断、历史上报三件事做稳，中台 Agent 的价值会显著放大。
--- a/spec/intent-driven-script/solution-python-mid-platform.md
+++ b/spec/intent-driven-script/solution-python-mid-platform.md
@ -1,201 +0,0 @@
-# AI客服智能体增强改造方案（仅中台侧）
-
-## 1. 改造定位
-
-中台侧承担“智能决策与会话编排核心”角色，负责在不改变渠道发送机制的前提下，输出可被渠道稳定消费的分段回复与可观测信息。
-
-本次文档范围**仅覆盖中台侧**，并以渠道侧协同接口为边界。
-
-### 核心目标
- 统一中台对外响应为 `segments[]` 协议，支撑渠道侧分段发送与打断重入。
- 构建“Agent 主链路 + Micro-flow 护栏兜底”的双轨决策。
- 仅基于“已送达历史”做上下文推理，避免上下文污染。
- 完整沉淀可观测字段，支持回放、归因与灰度治理。
-
-### 非目标（Out of Scope）
- 不定义渠道侧发送节奏实现细节（队列、重试、UI 展现）。
- 不新增渠道侧状态机实现要求（仅声明协同接口语义）。
- 不扩展 Python 代码级任务与语言实现细节。
-
---
-
-## 2. 中台侧能力边界
-
-## 2.1 中台负责
-1. 接收会话请求并执行策略路由（agent / micro_flow / fixed / transfer）。
-2. 组织工具调用（KB/Intent/Flow/Prompt/Memory）并生成结果。
-3. 返回分段回复 `segments[]` 与追踪信息 `trace`。
-4. 接收并落库全量消息上报（含人工阶段消息与模式切换事件）。
-5. 提供会话模式状态接口（机器人/人工）。
-
-## 2.2 中台不负责
-1. 渠道消息发送调度与本地打断取消逻辑。
-2. 渠道本地“已送达历史”缓存实现。
-3. 渠道端展示层节奏控制。
-
---
-
-## 3. 协同契约（与渠道侧）
-
-## 3.1 请求协议（渠道 -> 中台）
-```json
-{
-  "session_id": "sess_123",
-  "user_message": "孩子五年级，数学不好",
-  "history": [
-    {"role": "user", "content": "你好"},
-    {"role": "assistant", "content": "您好"}
-  ],
-  "interrupted_segments": [
-    {"segment_id": "seg_2", "content": "稍等，我查一下"}
-  ]
-}
-```
-
-约束：
- `history` 必须是**已送达历史**，不得包含未送达片段。
- `interrupted_segments` 为可选协同信息，中台可用于上下文修正与日志归因。
-
-## 3.2 响应协议（中台 -> 渠道）
-```json
-{
-  "segments": [
-    {"segment_id": "seg_3", "text": "好的，请问孩子具体是哪个知识点薄弱？", "delay_after": 800},
-    {"segment_id": "seg_4", "text": "比如分数应用题还是几何？", "delay_after": 0}
-  ],
-  "trace": {
-    "mode": "agent",
-    "intent": "course_consult",
-    "tools_used": ["get_knowledge_bases", "recall_memory"]
-  }
-}
-```
-
-约束：
- 中台必须保证 `segments[]` 可直接消费，不依赖渠道二次改写。
- `trace.mode` 必须返回，便于渠道侧与观测平台统一归因。
-
---
-
-## 4. 中台核心架构
-
-## 4.1 决策总线
- `policy_router`：根据意图、风险级别、置信度决定执行模式：
-  - `agent`（默认）
-  - `micro_flow`（高风险/强SOP）
-  - `fixed`（固定答复）
-  - `transfer`（转人工）
-
-## 4.2 Agent 执行层
- `Agent Orchestrator` 执行 ReAct 循环（思考-行动-观察）。
- 通过 `Tool Registry` 调用工具（含 MCP 扩展能力）。
- 输出统一分段结构，受输出护栏二次校验。
-
-## 4.3 微流程护栏层
-仅保留高风险场景的最小流程集：
- 退款/退费
- 投诉升级
- 隐私与敏感承诺
- 转人工
-
-## 4.4 记忆与元数据层
- 记忆读取：按 `user_id/session_id` 注入长期与短期上下文。
- 元数据检索：严格执行 `intent -> target_kbs -> metadata_filter -> vector_search -> rerank`。
- 标签规范遵循既有方法论（grade/subject/scene/flow_step/intent_type 等）。
-
---
-
-## 5. 中台执行时序
-
-## 5.1 正常对话
-1. 接收渠道请求（含已送达历史）。
-2. `policy_router` 决策模式。
-3. 进入 `agent` 或 `micro_flow`。
-4. 工具调用 + 护栏校验。
-5. 返回 `segments[] + trace`。
-6. 异步写入会话日志与观测事件。
-
-## 5.2 打断重入
-1. 渠道发送中断后发起新请求。
-2. 中台接收新的 `history` 与可选 `interrupted_segments`。
-3. 丢弃旧 generation 的输出影响，仅以最新请求生成结果。
-4. 返回新的 `segments[]`，保证语义连续。
-
-## 5.3 人工接管
-1. 渠道触发模式切换接口，标记 `HUMAN_ACTIVE`。
-2. 中台停用机器人主链路输出。
-3. 持续接收并记录人工消息上报。
-4. 必要时支持 `HUMAN_ACTIVE -> BOT_ACTIVE` 回切。
-
---
-
-## 6. 关键治理与门禁
-
-## 6.1 质量门禁
- 输出必须可分段消费（结构合法、字段完整）。
- 高风险场景必须命中 micro-flow，不允许 agent 直出。
- 低置信度或工具异常必须降级（fixed 或 transfer）。
-
-## 6.2 可观测性字段（必须）
- `mode`
- `intent_id/intent`
- `target_kbs`
- `applied_metadata_filters`
- `tool_calls`（参数摘要、耗时、状态）
- `fallback_reason_code`
- `guardrail_triggered`
- `session_id/request_id/generation_id`
-
-## 6.3 关键指标（建议）
- 响应时延 P50/P95
- 工具超时率
- fallback 触发率
- micro-flow 接管率
- 误转人工率
- 无召回率
-
---
-
-## 7. 分阶段落地（仅中台）
-
-## Phase 1：协议与主链路打通
- 固化 `segments[]` 对外响应协议。
- 打通 `policy_router + Agent Orchestrator` 最小闭环。
- 上线基础观测字段。
-
-## Phase 2：护栏与记忆增强
- 上线高风险 micro-flow 最小集。
- 接入记忆读写与 metadata 过滤检索全链路。
- 完善降级与回退策略。
-
-## Phase 3：稳定性与治理
- 完善 generation 级并发与乱序治理。
- 强化工具治理（超时、重试、熔断、错误映射）。
- 建立灰度策略与回滚开关。
-
---
-
-## 8. 主要风险与应对
-
-1. 打断竞态导致旧响应污染
- 应对：按 `generation_id/request_id` 只认最新请求结果。
-
-2. 工具调用抖动影响时延
- 应对：工具超时上限 + 快速降级 + 结果摘要缓存（可选）。
-
-3. 元数据质量不一致导致召回偏差
- 应对：统一 metadata 枚举、入库校验、周期巡检。
-
-4. 智能体自由度过高引发合规风险
- 应对：policy_router 前置约束 + 输出护栏 + micro-flow 强接管。
-
---
-
-## 9. 最终结论
-
-本方案坚持“仅中台侧改造”：
- 以统一分段协议承接渠道可打断发送能力。
- 以 Agent 主链路提升效果，以 Micro-flow 护栏保证可控。
- 以观测与治理机制保障可回放、可归因、可灰度演进。
-
-在不扩展渠道实现细节与不引入语言实现绑定的前提下，可支撑中台能力平滑升级。
--- a/spec/metadata-role-separation/design.md
+++ b/spec/metadata-role-separation/design.md
@ -1,615 +0,0 @@
-# 元数据职责分层优化 - 技术设计
-
-## 1. 系统架构
-
-### 1.1 整体架构
-
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│                           管理端 (ai-service-admin)                      │
-├─────────────────────────────────────────────────────────────────────────┤
-│  ┌─────────────────┐  ┌─────────────────┐  ┌─────────────────┐         │
-│  │ 元数据字段配置   │  │ 槽位定义配置     │  │ 按角色过滤视图   │         │
-│  │ (field_roles)   │  │ (SlotDefinition) │  │ (RoleFilter)    │         │
-│  └────────┬────────┘  └────────┬────────┘  └────────┬────────┘         │
-└───────────┼────────────────────┼────────────────────┼───────────────────┘
-            │                    │                    │
-            ▼                    ▼                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│                           后端服务 (ai-service)                          │
-├─────────────────────────────────────────────────────────────────────────┤
-│                                                                         │
-│  ┌─────────────────────────────────────────────────────────────────┐   │
-│  │                    API Layer                                     │   │
-│  │  ┌──────────────────────┐  ┌──────────────────────┐             │   │
-│  │  │ MetadataFieldAPI     │  │ SlotDefinitionAPI    │             │   │
-│  │  │ - CRUD               │  │ - CRUD               │             │   │
-│  │  │ - getByRole          │  │ - getByRole          │             │   │
-│  │  └──────────┬───────────┘  └──────────┬───────────┘             │   │
-│  └─────────────┼────────────────────────┼──────────────────────────┘   │
-│                │                        │                               │
-│  ┌─────────────┼────────────────────────┼──────────────────────────┐   │
-│  │             │      Service Layer     │                          │   │
-│  │  ┌──────────▼───────────┐  ┌────────▼──────────┐               │   │
-│  │  │ MetadataFieldService │  │ SlotDefinitionSvc │               │   │
-│  │  │ - create/update      │  │ - create/update   │               │   │
-│  │  │ - getByRole          │  │ - getByRole       │               │   │
-│  │  │ - validateRoles      │  │ - linkToField     │               │   │
-│  │  └──────────┬───────────┘  └────────┬──────────┘               │   │
-│  └─────────────┼────────────────────────┼──────────────────────────┘   │
-│                │                        │                               │
-│  ┌─────────────┼────────────────────────┼──────────────────────────┐   │
-│  │             │   Tool Integration     │                          │   │
-│  │  ┌──────────▼────────────────────────▼──────────┐              │   │
-│  │  │           RoleBasedFieldProvider              │              │   │
-│  │  │  - getFieldsByRole(role)                      │              │   │
-│  │  │  - getSlotDefinitionsByRole(role)             │              │   │
-│  │  └──────────────────────┬───────────────────────┘              │   │
-│  └─────────────────────────┼──────────────────────────────────────┘   │
-│                            │                                            │
-│  ┌─────────────────────────┼──────────────────────────────────────┐   │
-│  │                    Tool Consumers                               │   │
-│  │  ┌──────────────┐ ┌──────────────┐ ┌──────────────┐            │   │
-│  │  │kb_search_    │ │memory_recall │ │intent_hint/  │            │   │
-│  │  │dynamic       │ │              │ │high_risk_    │            │   │
-│  │  │[resource_    │ │[slot]        │ │check         │            │   │
-│  │  │filter]       │ │              │ │[routing_     │            │   │
-│  │  └──────────────┘ └──────────────┘ │signal]       │            │   │
-│  │                                    └──────────────┘            │   │
-│  │  ┌──────────────┐                                               │   │
-│  │  │template_     │                                               │   │
-│  │  │engine        │                                               │   │
-│  │  │[prompt_var]  │                                               │   │
-│  │  └──────────────┘                                               │   │
-│  └────────────────────────────────────────────────────────────────┘   │
-│                                                                         │
-└─────────────────────────────────────────────────────────────────────────┘
-            │                    │                    │
-            ▼                    ▼                    ▼
-┌─────────────────────────────────────────────────────────────────────────┐
-│                           数据层                                         │
-├─────────────────────────────────────────────────────────────────────────┤
-│  ┌─────────────────────┐  ┌─────────────────────┐                      │
-│  │ metadata_field_     │  │ slot_definitions    │                      │
-│  │ definitions         │  │                     │                      │
-│  │ - id                │  │ - id                │                      │
-│  │ - field_key         │  │ - slot_key          │                      │
-│  │ - field_roles[]     │  │ - type              │                      │
-│  │ - ...               │  │ - linked_field_id   │                      │
-│  └─────────────────────┘  └─────────────────────┘                      │
-│                                                                         │
-│  ┌─────────────────────┐  ┌─────────────────────┐                      │
-│  │ Redis Cache         │  │ PostgreSQL          │                      │
-│  │ - field_roles:by_tenant │ - 持久化存储        │                      │
-│  └─────────────────────┘  └─────────────────────┘                      │
-└─────────────────────────────────────────────────────────────────────────┘
-```
-
-### 1.2 模块依赖关系
-
-```
-metadata-role-separation
-    │
-    ├── metadata-governance (依赖)
-    │   └── 元数据字段定义基础能力
-    │
-    ├── intent-driven-mid-platform (依赖)
-    │   └── 中台运行时工具链
-    │
-    └── ai-service-admin (依赖)
-        └── 管理端配置界面
-```
-
---
-
-## 2. 数据模型设计
-
-### 2.1 MetadataFieldDefinition 扩展
-
-在现有 `metadata_field_definitions` 表基础上新增字段：
-
-```sql
-- 新增字段：field_roles
-ALTER TABLE metadata_field_definitions 
-ADD COLUMN field_roles JSONB DEFAULT '[]'::jsonb;
-
-- 创建 GIN 索引支持按角色查询
-CREATE INDEX idx_metadata_field_definitions_roles 
-ON metadata_field_definitions USING GIN (field_roles);
-
-- 注释
-COMMENT ON COLUMN metadata_field_definitions.field_roles IS '字段角色列表：resource_filter, slot, prompt_var, routing_signal';
-```
-
-**字段定义**：
-
-| 字段名 | 类型 | 必填 | 默认值 | 说明 |
-|-------|------|-----|-------|------|
-| `field_roles` | JSONB | 否 | `[]` | 字段角色列表，存储字符串数组 |
-
-**角色枚举值**：
-
-```python
-class FieldRole(str, Enum):
-    RESOURCE_FILTER = "resource_filter"  # 资源过滤
-    SLOT = "slot"                        # 运行时槽位
-    PROMPT_VAR = "prompt_var"            # 提示词变量
-    ROUTING_SIGNAL = "routing_signal"    # 路由信号
-```
-
-### 2.2 SlotDefinition 新增表
-
-```sql
-- 槽位定义表
-CREATE TABLE slot_definitions (
-    id UUID PRIMARY KEY DEFAULT gen_random_uuid(),
-    tenant_id UUID NOT NULL,
-    slot_key VARCHAR(100) NOT NULL,
-    type VARCHAR(20) NOT NULL,
-    required BOOLEAN NOT NULL DEFAULT FALSE,
-    extract_strategy VARCHAR(20),
-    validation_rule TEXT,
-    ask_back_prompt TEXT,
-    default_value JSONB,
-    linked_field_id UUID REFERENCES metadata_field_definitions(id),
-    created_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
-    updated_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
-    
-    CONSTRAINT uk_slot_definitions_tenant_key UNIQUE (tenant_id, slot_key),
-    CONSTRAINT chk_slot_definitions_type CHECK (type IN ('string', 'number', 'boolean', 'enum', 'array_enum')),
-    CONSTRAINT chk_slot_definitions_extract_strategy CHECK (extract_strategy IS NULL OR extract_strategy IN ('rule', 'llm', 'user_input'))
-);
-
-- 索引
-CREATE INDEX idx_slot_definitions_tenant ON slot_definitions(tenant_id);
-CREATE INDEX idx_slot_definitions_linked_field ON slot_definitions(linked_field_id);
-
-- 注释
-COMMENT ON TABLE slot_definitions IS '槽位定义表';
-COMMENT ON COLUMN slot_definitions.slot_key IS '槽位键名，可与元数据字段 field_key 关联';
-COMMENT ON COLUMN slot_definitions.linked_field_id IS '关联的元数据字段 ID';
-```
-
-### 2.3 ER 图
-
-```
-┌─────────────────────────────┐       ┌─────────────────────────────┐
-│  metadata_field_definitions │       │     slot_definitions        │
-├─────────────────────────────┤       ├─────────────────────────────┤
-│ id (PK)                     │◄──────│ linked_field_id (FK)        │
-│ tenant_id                   │       │ id (PK)                     │
-│ field_key                   │       │ tenant_id                   │
-│ label                       │       │ slot_key                    │
-│ type                        │       │ type                        │
-│ required                    │       │ required                    │
-│ options                     │       │ extract_strategy            │
-│ default_value               │       │ validation_rule             │
-│ scope                       │       │ ask_back_prompt             │
-│ is_filterable               │       │ default_value               │
-│ is_rank_feature             │       │ created_at                  │
-│ status                      │       │ updated_at                  │
-│ field_roles ◀── NEW         │       │                             │
-│ version                     │       │                             │
-│ created_at                  │       │                             │
-│ updated_at                  │       │                             │
-└─────────────────────────────┘       └─────────────────────────────┘
-```
-
---
-
-## 3. 核心流程设计
-
-### 3.1 字段职责配置流程
-
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│                        管理端配置流程                                    │
-├─────────────────────────────────────────────────────────────────────────┤
-│                                                                         │
-│  1. 管理员进入元数据字段配置页面                                          │
-│     │                                                                   │
-│     ▼                                                                   │
-│  2. 创建/编辑字段定义                                                    │
-│     │                                                                   │
-│     ├── 填写基本信息 (field_key, label, type, etc.)                     │
-│     │                                                                   │
-│     ├── 选择字段角色 (field_roles 多选)                                  │
-│     │   ├── [ ] resource_filter - 资源过滤                              │
-│     │   ├── [ ] slot - 运行时槽位                                       │
-│     │   ├── [ ] prompt_var - 提示词变量                                 │
-│     │   └── [ ] routing_signal - 路由信号                               │
-│     │                                                                   │
-│     ▼                                                                   │
-│  3. 后端校验                                                             │
-│     │                                                                   │
-│     ├── field_key 格式校验 (小写字母数字下划线)                           │
-│     ├── field_roles 枚举值校验                                          │
-│     └── 租户内唯一性校验                                                 │
-│     │                                                                   │
-│     ▼                                                                   │
-│  4. 保存到数据库                                                         │
-│     │                                                                   │
-│     └── field_roles 以 JSONB 格式存储                                   │
-│                                                                         │
-└─────────────────────────────────────────────────────────────────────────┘
-```
-
-### 3.2 按角色查询流程
-
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│                        按角色查询流程                                    │
-├─────────────────────────────────────────────────────────────────────────┤
-│                                                                         │
-│  工具/模块请求字段                                                       │
-│     │                                                                   │
-│     ▼                                                                   │
-│  RoleBasedFieldProvider.getFieldsByRole(role, tenant_id)               │
-│     │                                                                   │
-│     ▼                                                                   │
-│  ┌─────────────────────────────────────────────────────────────────┐   │
-│  │  SELECT * FROM metadata_field_definitions                        │   │
-│  │  WHERE tenant_id = :tenant_id                                    │   │
-│  │    AND status = 'active'                                         │   │
-│  │    AND field_roles ? :role  -- JSONB contains 查询               │   │
-│  └─────────────────────────────────────────────────────────────────┘   │
-│     │                                                                   │
-│     ▼                                                                   │
-│  返回字段定义列表                                                        │
-│     │                                                                   │
-│     ▼                                                                   │
-│  工具按需消费字段                                                        │
-│                                                                         │
-└─────────────────────────────────────────────────────────────────────────┘
-```
-
-### 3.3 工具协同改造流程
-
-```
-┌─────────────────────────────────────────────────────────────────────────┐
-│                     工具协同改造流程                                     │
-├─────────────────────────────────────────────────────────────────────────┤
-│                                                                         │
-│  ┌──────────────────────────────────────────────────────────────────┐  │
-│  │  kb_search_dynamic [AC-MRS-11]                                   │  │
-│  │                                                                  │  │
-│  │  改造前:                                                          │  │
-│  │    filterable_fields = get_filterable_fields(tenant_id)          │  │
-│  │                       WHERE is_filterable = true                 │  │
-│  │                                                                  │  │
-│  │  改造后:                                                          │  │
-│  │    filterable_fields = get_fields_by_role(                       │  │
-│  │        tenant_id, role='resource_filter'                         │  │
-│  │    )                                                             │  │
-│  └──────────────────────────────────────────────────────────────────┘  │
-│                                                                         │
-│  ┌──────────────────────────────────────────────────────────────────┐  │
-│  │  memory_recall [AC-MRS-12]                                       │  │
-│  │                                                                  │  │
-│  │  改造前:                                                          │  │
-│  │    slots = context.get('slots', {})  # 无角色过滤                │  │
-│  │                                                                  │  │
-│  │  改造后:                                                          │  │
-│  │    slot_fields = get_fields_by_role(                             │  │
-│  │        tenant_id, role='slot'                                    │  │
-│  │    )                                                             │  │
-│  │    slots = {k: v for k, v in context.items()                     │  │
-│  │             if k in slot_fields}                                 │  │
-│  └──────────────────────────────────────────────────────────────────┘  │
-│                                                                         │
-│  ┌──────────────────────────────────────────────────────────────────┐  │
-│  │  intent_hint / high_risk_check [AC-MRS-13]                       │  │
-│  │                                                                  │  │
-│  │  改造前:                                                          │  │
-│  │    routing_fields = all_metadata_fields  # 全量字段              │  │
-│  │                                                                  │  │
-│  │  改造后:                                                          │  │
-│  │    routing_fields = get_fields_by_role(                          │  │
-│  │        tenant_id, role='routing_signal'                          │  │
-│  │    )                                                             │  │
-│  └──────────────────────────────────────────────────────────────────┘  │
-│                                                                         │
-│  ┌──────────────────────────────────────────────────────────────────┐  │
-│  │  template_engine [AC-MRS-14]                                     │  │
-│  │                                                                  │  │
-│  │  改造前:                                                          │  │
-│  │    variables = extract_all_variables(template)                   │  │
-│  │    values = get_all_metadata_values(context)                     │  │
-│  │                                                                  │  │
-│  │  改造后:                                                          │  │
-│  │    prompt_var_fields = get_fields_by_role(                       │  │
-│  │        tenant_id, role='prompt_var'                              │  │
-│  │    )                                                             │  │
-│  │    values = {k: v for k, v in context.items()                    │  │
-│  │              if k in prompt_var_fields}                          │  │
-│  └──────────────────────────────────────────────────────────────────┘  │
-│                                                                         │
-└─────────────────────────────────────────────────────────────────────────┘
-```
-
---
-
-## 4. 接口设计
-
-### 4.1 后端 API 接口
-
-| 接口 | 方法 | 说明 | AC |
-|------|------|------|-----|
-| `/admin/metadata-schemas` | GET | 获取字段列表（支持 role 过滤） | AC-MRS-06 |
-| `/admin/metadata-schemas` | POST | 创建字段（含 field_roles） | AC-MRS-01,02,03 |
-| `/admin/metadata-schemas/{id}` | PUT | 更新字段（含 field_roles） | AC-MRS-01 |
-| `/admin/metadata-schemas/{id}` | DELETE | 删除字段（无需兼容） | AC-MRS-16 |
-| `/admin/metadata-schemas/by-role` | GET | 按角色查询字段 | AC-MRS-04,05 |
-| `/admin/slot-definitions` | GET | 获取槽位定义列表 | - |
-| `/admin/slot-definitions` | POST | 创建槽位定义 | AC-MRS-07,08 |
-| `/admin/slot-definitions/{id}` | PUT | 更新槽位定义 | - |
-| `/admin/slot-definitions/{id}` | DELETE | 删除槽位定义 | AC-MRS-16 |
-| `/mid/slots/by-role` | GET | 运行时按角色获取槽位 | AC-MRS-10 |
-| `/mid/slots/{slot_key}` | GET | 获取运行时槽位值 | AC-MRS-09 |
-
-### 4.2 内部服务接口
-
-```python
-class RoleBasedFieldProvider:
-    """基于角色的字段提供者"""
-    
-    async def get_fields_by_role(
-        self,
-        tenant_id: str,
-        role: FieldRole,
-        include_deprecated: bool = False,
-    ) -> list[MetadataFieldDefinition]:
-        """
-        按角色获取字段定义
-        
-        Args:
-            tenant_id: 租户ID
-            role: 字段角色
-            include_deprecated: 是否包含已废弃字段
-            
-        Returns:
-            字段定义列表
-        """
-        
-    async def get_slot_definitions_by_role(
-        self,
-        tenant_id: str,
-        role: FieldRole,
-    ) -> list[SlotDefinition]:
-        """
-        按角色获取槽位定义（包含关联字段信息）
-        """
-```
-
---
-
-## 5. 前端设计
-
-### 5.1 元数据字段配置页面改造
-
-**新增组件**：`FieldRolesSelector.vue`
-
-```vue
-<template>
-  <div class="field-roles-selector">
-    <label>字段角色</label>
-    <el-checkbox-group v-model="selectedRoles">
-      <el-checkbox 
-        v-for="role in availableRoles" 
-        :key="role.value"
-        :label="role.value"
-      >
-        <span>{{ role.label }}</span>
-        <el-tooltip :content="role.description">
-          <el-icon><QuestionFilled /></el-icon>
-        </el-tooltip>
-      </el-checkbox>
-    </el-checkbox-group>
-  </div>
-</template>
-
-<script setup>
-const availableRoles = [
-  { 
-    value: 'resource_filter', 
-    label: '资源过滤', 
-    description: '用于 KB 文档检索时的元数据过滤' 
-  },
-  { 
-    value: 'slot', 
-    label: '运行时槽位', 
-    description: '对话流程中的结构化槽位，用于信息收集' 
-  },
-  { 
-    value: 'prompt_var', 
-    label: '提示词变量', 
-    description: '注入到 LLM Prompt 中的变量' 
-  },
-  { 
-    value: 'routing_signal', 
-    label: '路由信号', 
-    description: '用于意图路由和风险判断的信号' 
-  },
-];
-</script>
-```
-
-### 5.2 按角色过滤视图
-
-**新增过滤器**：在元数据字段列表页面增加角色过滤下拉框
-
-```vue
-<template>
-  <div class="filter-bar">
-    <el-select 
-      v-model="selectedRole" 
-      placeholder="按角色过滤"
-      clearable
-      @change="handleRoleFilter"
-    >
-      <el-option 
-        v-for="role in availableRoles" 
-        :key="role.value"
-        :label="role.label"
-        :value="role.value"
-      />
-    </el-select>
-  </div>
-</template>
-```
-
---
-
-## 6. 缓存策略
-
-### 6.1 缓存设计
-
-```python
-class FieldRoleCache:
-    """字段角色缓存"""
-    
-    CACHE_KEY_PREFIX = "field_roles"
-    CACHE_TTL = 300  # 5分钟
-    
-    async def get_fields_by_role(
-        self,
-        tenant_id: str,
-        role: FieldRole,
-    ) -> list[MetadataFieldDefinition]:
-        cache_key = f"{self.CACHE_KEY_PREFIX}:{tenant_id}:{role}"
-        
-        # 尝试从缓存获取
-        cached = await self._redis.get(cache_key)
-        if cached:
-            return self._deserialize(cached)
-        
-        # 从数据库查询
-        fields = await self._db_query(tenant_id, role)
-        
-        # 写入缓存
-        await self._redis.setex(
-            cache_key,
-            self.CACHE_TTL,
-            self._serialize(fields),
-        )
-        
-        return fields
-    
-    async def invalidate(self, tenant_id: str):
-        """失效租户所有角色缓存"""
-        pattern = f"{self.CACHE_KEY_PREFIX}:{tenant_id}:*"
-        keys = await self._redis.keys(pattern)
-        if keys:
-            await self._redis.delete(*keys)
-```
-
-### 6.2 缓存失效策略
-
- 字段创建/更新/删除时失效该租户所有角色缓存
- 槽位定义变更时失效相关角色缓存
-
---
-
-## 7. 异常处理
-
-### 7.1 错误码定义
-
-| 错误码 | 说明 | HTTP 状态码 |
-|-------|------|------------|
-| `INVALID_ROLE` | 无效的角色参数 | 400 |
-| `FIELD_KEY_EXISTS` | field_key 已存在 | 409 |
-| `SLOT_KEY_EXISTS` | slot_key 已存在 | 409 |
-| `LINKED_FIELD_NOT_FOUND` | 关联字段不存在 | 404 |
-
-### 7.2 异常处理示例
-
-```python
-class InvalidRoleError(Exception):
-    """无效角色异常"""
-    def __init__(self, role: str):
-        self.role = role
-        self.valid_roles = [r.value for r in FieldRole]
-        super().__init__(
-            f"Invalid role '{role}'. Valid roles are: {', '.join(self.valid_roles)}"
-        )
-```
-
---
-
-## 8. 测试策略
-
-### 8.1 单元测试
-
- `RoleBasedFieldProvider` 按角色查询测试
- `FieldRoleCache` 缓存读写测试
- 字段角色校验测试
-
-### 8.2 集成测试
-
- API 端点测试（CRUD + 按角色查询）
- 工具协同改造测试（验证各工具只消费对应角色字段）
-
-### 8.3 契约测试
-
- OpenAPI Schema 校验
- 响应结构验证
-
---
-
-## 9. 迁移与部署
-
-### 9.1 数据库迁移
-
-```sql
-- 1. 新增 field_roles 字段
-ALTER TABLE metadata_field_definitions 
-ADD COLUMN field_roles JSONB DEFAULT '[]'::jsonb;
-
-- 2. 创建索引
-CREATE INDEX idx_metadata_field_definitions_roles 
-ON metadata_field_definitions USING GIN (field_roles);
-
-- 3. 创建 slot_definitions 表
-CREATE TABLE slot_definitions (
-    -- ... 见 2.2 节
-);
-
-- 4. 初始化现有字段的默认角色（可选）
-- 根据 is_filterable 推断 resource_filter 角色
-UPDATE metadata_field_definitions 
-SET field_roles = '["resource_filter"]'::jsonb
-WHERE is_filterable = true AND status = 'active';
-```
-
-### 9.2 部署顺序
-
-1. 执行数据库迁移脚本
-2. 部署后端服务（向后兼容，field_roles 可为空）
-3. 部署前端页面
-4. 配置字段角色
-
---
-
-## 10. 监控与观测
-
-### 10.1 关键指标
-
-| 指标名 | 说明 |
-|-------|------|
-| `field_role_query_count` | 按角色查询次数 |
-| `field_role_query_latency` | 按角色查询延迟 |
-| `field_role_cache_hit_rate` | 角色缓存命中率 |
-| `tool_field_consumption` | 工具字段消费统计 |
-
-### 10.2 日志字段
-
-```json
-{
-  "event": "field_role_query",
-  "tenant_id": "xxx",
-  "role": "resource_filter",
-  "field_count": 5,
-  "duration_ms": 12,
-  "cache_hit": true
-}
-```
--- a/spec/metadata-role-separation/openapi.deps.yaml
+++ b/spec/metadata-role-separation/openapi.deps.yaml
@ -1,272 +0,0 @@
-openapi: 3.0.3
-info:
-  title: 元数据职责分层模块 - 外部依赖
-  description: |
-    本模块依赖的外部 API 契约，用于生成 Mock/SDK。
-    
-    ## 依赖说明
-    - metadata-governance: 元数据字段定义基础能力
-    - intent-driven-mid-platform: 中台运行时工具链
-  version: 0.1.0
-  x-contract-level: L1
-  contact:
-    name: AI Robot Core Team
-
-servers:
-  - url: /api
-    description: API Server
-
-tags:
-  - name: MetadataGovernance
-    description: 元数据治理模块依赖
-  - name: MidPlatform
-    description: 中台模块依赖
-
-paths:
-  /admin/metadata-schemas:
-    get:
-      summary: 获取元数据字段定义列表（依赖）
-      description: 依赖 metadata-governance 模块的元数据字段列表查询能力
-      operationId: depsListMetadataSchemas
-      tags:
-        - MetadataGovernance
-      parameters:
-        - name: status
-          in: query
-          schema:
-            type: string
-            enum:
-              - draft
-              - active
-              - deprecated
-        - name: scope
-          in: query
-          schema:
-            type: string
-            enum:
-              - kb_document
-              - intent_rule
-              - script_flow
-              - prompt_template
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-      responses:
-        '200':
-          description: 成功返回字段定义列表
-          content:
-            application/json:
-              schema:
-                type: array
-                items:
-                  $ref: '#/components/schemas/MetadataFieldDefinition'
-
-  /admin/kb/documents:
-    get:
-      summary: 获取 KB 文档列表（依赖）
-      description: 依赖 KB 文档管理能力，用于验证字段使用情况
-      operationId: depsListKbDocuments
-      tags:
-        - MetadataGovernance
-      parameters:
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-        - name: kb_id
-          in: query
-          schema:
-            type: string
-            format: uuid
-      responses:
-        '200':
-          description: 成功返回文档列表
-          content:
-            application/json:
-              schema:
-                type: array
-                items:
-                  $ref: '#/components/schemas/KbDocument'
-
-  /mid/dialogue/respond:
-    post:
-      summary: 中台对话响应（依赖）
-      description: 依赖中台对话响应能力，用于工具协同改造验证
-      operationId: depsRespondDialogue
-      tags:
-        - MidPlatform
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/DialogueRequest'
-      responses:
-        '200':
-          description: 成功返回对话响应
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/DialogueResponse'
-
-components:
-  schemas:
-    MetadataFieldStatus:
-      type: string
-      enum:
-        - draft
-        - active
-        - deprecated
-
-    MetadataScope:
-      type: string
-      enum:
-        - kb_document
-        - intent_rule
-        - script_flow
-        - prompt_template
-
-    MetadataFieldType:
-      type: string
-      enum:
-        - string
-        - number
-        - boolean
-        - enum
-        - array_enum
-
-    MetadataFieldDefinition:
-      type: object
-      properties:
-        id:
-          type: string
-          format: uuid
-        tenant_id:
-          type: string
-          format: uuid
-        field_key:
-          type: string
-        label:
-          type: string
-        type:
-          $ref: '#/components/schemas/MetadataFieldType'
-        required:
-          type: boolean
-        options:
-          type: array
-          items:
-            type: string
-        default_value:
-          type: object
-        scope:
-          type: array
-          items:
-            $ref: '#/components/schemas/MetadataScope'
-        is_filterable:
-          type: boolean
-        is_rank_feature:
-          type: boolean
-        status:
-          $ref: '#/components/schemas/MetadataFieldStatus'
-        version:
-          type: integer
-        created_at:
-          type: string
-          format: date-time
-        updated_at:
-          type: string
-          format: date-time
-
-    KbDocument:
-      type: object
-      properties:
-        id:
-          type: string
-          format: uuid
-        tenant_id:
-          type: string
-          format: uuid
-        kb_id:
-          type: string
-          format: uuid
-        title:
-          type: string
-        content:
-          type: string
-        metadata:
-          type: object
-          additionalProperties: true
-        created_at:
-          type: string
-          format: date-time
-        updated_at:
-          type: string
-          format: date-time
-
-    DialogueRequest:
-      type: object
-      required:
-        - tenant_id
-        - user_id
-        - session_id
-        - user_message
-      properties:
-        tenant_id:
-          type: string
-          format: uuid
-        user_id:
-          type: string
-          format: uuid
-        session_id:
-          type: string
-          format: uuid
-        user_message:
-          type: string
-        context:
-          type: object
-          additionalProperties: true
-          description: 会话上下文，包含槽位信息
-
-    DialogueResponse:
-      type: object
-      properties:
-        session_id:
-          type: string
-          format: uuid
-        request_id:
-          type: string
-          format: uuid
-        segments:
-          type: array
-          items:
-            type: object
-            properties:
-              segment_id:
-                type: string
-              text:
-                type: string
-              delay_after:
-                type: integer
-        trace:
-          type: object
-          properties:
-            mode:
-              type: string
-              enum:
-                - agent
-                - micro_flow
-                - fixed
-                - transfer
-            intent:
-              type: string
-            tool_calls:
-              type: array
-              items:
-                type: object
-            fallback_reason_code:
-              type: string
--- a/spec/metadata-role-separation/openapi.provider.yaml
+++ b/spec/metadata-role-separation/openapi.provider.yaml
@ -1,774 +0,0 @@
-openapi: 3.0.3
-info:
-  title: 元数据职责分层 API
-  description: |
-    提供元数据字段职责分层配置、槽位定义管理及按角色查询能力。
-    
-    ## 字段角色定义
-    - `resource_filter`: 资源过滤角色，用于 KB 文档检索
-    - `slot`: 运行时槽位角色，用于对话信息收集
-    - `prompt_var`: 提示词变量角色，用于 Prompt 注入
-    - `routing_signal`: 路由信号角色，用于意图路由判断
-  version: 0.1.0
-  x-contract-level: L1
-  contact:
-    name: AI Robot Core Team
-
-servers:
-  - url: /api
-    description: API Server
-
-tags:
-  - name: MetadataSchema
-    description: 元数据字段定义管理（扩展 field_roles）
-  - name: SlotDefinition
-    description: 槽位定义管理
-  - name: RuntimeSlot
-    description: 运行时槽位查询
-
-paths:
-  /admin/metadata-schemas:
-    get:
-      summary: 获取元数据字段定义列表
-      description: |-
-        [AC-MRS-06] 支持按状态、范围、角色过滤查询元数据字段定义列表。
-      operationId: listMetadataSchemas
-      tags:
-        - MetadataSchema
-      parameters:
-        - name: status
-          in: query
-          description: 字段状态过滤
-          schema:
-            $ref: '#/components/schemas/MetadataFieldStatus'
-        - name: scope
-          in: query
-          description: 适用范围过滤
-          schema:
-            $ref: '#/components/schemas/MetadataScope'
-        - name: field_role
-          in: query
-          description: 字段角色过滤
-          schema:
-            $ref: '#/components/schemas/FieldRole'
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-      responses:
-        '200':
-          description: 成功返回字段定义列表
-          content:
-            application/json:
-              schema:
-                type: array
-                items:
-                  $ref: '#/components/schemas/MetadataFieldDefinition'
-        '400':
-          $ref: '#/components/responses/BadRequest'
-        '401':
-          $ref: '#/components/responses/Unauthorized'
-
-    post:
-      summary: 创建元数据字段定义
-      description: |-
-        [AC-MRS-01][AC-MRS-02][AC-MRS-03] 创建新的元数据字段定义，支持 field_roles 多选配置。
-      operationId: createMetadataSchema
-      tags:
-        - MetadataSchema
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/MetadataFieldDefinitionCreate'
-      responses:
-        '201':
-          description: 创建成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/MetadataFieldDefinition'
-        '400':
-          $ref: '#/components/responses/BadRequest'
-        '409':
-          description: field_key 已存在
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/ErrorResponse'
-
-  /admin/metadata-schemas/{id}:
-    parameters:
-      - name: id
-        in: path
-        required: true
-        schema:
-          type: string
-          format: uuid
-
-    get:
-      summary: 获取单个元数据字段定义
-      operationId: getMetadataSchema
-      tags:
-        - MetadataSchema
-      responses:
-        '200':
-          description: 成功返回字段定义
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/MetadataFieldDefinition'
-        '404':
-          $ref: '#/components/responses/NotFound'
-
-    put:
-      summary: 更新元数据字段定义
-      description: |-
-        [AC-MRS-01] 更新元数据字段定义，支持修改 field_roles。
-      operationId: updateMetadataSchema
-      tags:
-        - MetadataSchema
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/MetadataFieldDefinitionUpdate'
-      responses:
-        '200':
-          description: 更新成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/MetadataFieldDefinition'
-        '400':
-          $ref: '#/components/responses/BadRequest'
-        '404':
-          $ref: '#/components/responses/NotFound'
-
-    delete:
-      summary: 删除元数据字段定义
-      description: |-
-        [AC-MRS-16] 删除元数据字段定义，无需考虑历史数据兼容性。
-      operationId: deleteMetadataSchema
-      tags:
-        - MetadataSchema
-      responses:
-        '204':
-          description: 删除成功
-        '404':
-          $ref: '#/components/responses/NotFound'
-
-  /admin/metadata-schemas/by-role:
-    get:
-      summary: 按角色查询元数据字段定义
-      description: |-
-        [AC-MRS-04][AC-MRS-05] 按指定角色查询所有包含该角色的活跃字段定义。
-      operationId: getMetadataSchemasByRole
-      tags:
-        - MetadataSchema
-      parameters:
-        - name: role
-          in: query
-          required: true
-          description: 字段角色
-          schema:
-            $ref: '#/components/schemas/FieldRole'
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-        - name: include_deprecated
-          in: query
-          description: 是否包含已废弃字段
-          schema:
-            type: boolean
-            default: false
-      responses:
-        '200':
-          description: 成功返回字段定义列表
-          content:
-            application/json:
-              schema:
-                type: array
-                items:
-                  $ref: '#/components/schemas/MetadataFieldDefinition'
-        '400':
-          description: 无效的角色参数
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/ErrorResponse'
-              example:
-                error_code: INVALID_ROLE
-                message: "Invalid role 'invalid_role'. Valid roles are: resource_filter, slot, prompt_var, routing_signal"
-
-  /admin/slot-definitions:
-    get:
-      summary: 获取槽位定义列表
-      operationId: listSlotDefinitions
-      tags:
-        - SlotDefinition
-      parameters:
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-        - name: required
-          in: query
-          description: 是否必填槽位过滤
-          schema:
-            type: boolean
-      responses:
-        '200':
-          description: 成功返回槽位定义列表
-          content:
-            application/json:
-              schema:
-                type: array
-                items:
-                  $ref: '#/components/schemas/SlotDefinition'
-
-    post:
-      summary: 创建槽位定义
-      description: |-
-        [AC-MRS-07][AC-MRS-08] 创建新的槽位定义，可关联已有元数据字段。
-      operationId: createSlotDefinition
-      tags:
-        - SlotDefinition
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/SlotDefinitionCreate'
-      responses:
-        '201':
-          description: 创建成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/SlotDefinition'
-        '400':
-          $ref: '#/components/responses/BadRequest'
-
-  /admin/slot-definitions/{id}:
-    parameters:
-      - name: id
-        in: path
-        required: true
-        schema:
-          type: string
-          format: uuid
-
-    get:
-      summary: 获取单个槽位定义
-      operationId: getSlotDefinition
-      tags:
-        - SlotDefinition
-      responses:
-        '200':
-          description: 成功返回槽位定义
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/SlotDefinition'
-        '404':
-          $ref: '#/components/responses/NotFound'
-
-    put:
-      summary: 更新槽位定义
-      operationId: updateSlotDefinition
-      tags:
-        - SlotDefinition
-      requestBody:
-        required: true
-        content:
-          application/json:
-            schema:
-              $ref: '#/components/schemas/SlotDefinitionUpdate'
-      responses:
-        '200':
-          description: 更新成功
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/SlotDefinition'
-        '400':
-          $ref: '#/components/responses/BadRequest'
-        '404':
-          $ref: '#/components/responses/NotFound'
-
-    delete:
-      summary: 删除槽位定义
-      description: |-
-        [AC-MRS-16] 删除槽位定义，无需考虑历史数据兼容性。
-      operationId: deleteSlotDefinition
-      tags:
-        - SlotDefinition
-      responses:
-        '204':
-          description: 删除成功
-        '404':
-          $ref: '#/components/responses/NotFound'
-
-  /mid/slots/by-role:
-    get:
-      summary: 运行时按角色获取槽位定义
-      description: |-
-        [AC-MRS-10] 运行时接口，按角色获取槽位定义及关联的元数据字段信息。
-      operationId: getSlotsByRole
-      tags:
-        - RuntimeSlot
-      parameters:
-        - name: role
-          in: query
-          required: true
-          schema:
-            $ref: '#/components/schemas/FieldRole'
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-      responses:
-        '200':
-          description: 成功返回槽位定义列表
-          content:
-            application/json:
-              schema:
-                type: array
-                items:
-                  $ref: '#/components/schemas/SlotDefinitionWithField'
-        '400':
-          $ref: '#/components/responses/BadRequest'
-
-  /mid/slots/{slot_key}:
-    parameters:
-      - name: slot_key
-        in: path
-        required: true
-        schema:
-          type: string
-
-    get:
-      summary: 获取运行时槽位值
-      description: |-
-        [AC-MRS-09] 获取指定槽位的运行时值，包含来源、置信度、更新时间。
-      operationId: getSlotValue
-      tags:
-        - RuntimeSlot
-      parameters:
-        - name: tenant_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-        - name: user_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-        - name: session_id
-          in: query
-          required: true
-          schema:
-            type: string
-            format: uuid
-      responses:
-        '200':
-          description: 成功返回槽位值
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/RuntimeSlotValue'
-        '404':
-          description: 槽位不存在
-          content:
-            application/json:
-              schema:
-                $ref: '#/components/schemas/ErrorResponse'
-
-components:
-  schemas:
-    FieldRole:
-      type: string
-      enum:
-        - resource_filter
-        - slot
-        - prompt_var
-        - routing_signal
-      description: |
-        字段角色类型：
-        - resource_filter: 资源过滤角色
-        - slot: 运行时槽位角色
-        - prompt_var: 提示词变量角色
-        - routing_signal: 路由信号角色
-
-    MetadataFieldStatus:
-      type: string
-      enum:
-        - draft
-        - active
-        - deprecated
-
-    MetadataScope:
-      type: string
-      enum:
-        - kb_document
-        - intent_rule
-        - script_flow
-        - prompt_template
-
-    MetadataFieldType:
-      type: string
-      enum:
-        - string
-        - number
-        - boolean
-        - enum
-        - array_enum
-
-    MetadataFieldDefinition:
-      type: object
-      required:
-        - id
-        - tenant_id
-        - field_key
-        - label
-        - type
-        - status
-        - field_roles
-      properties:
-        id:
-          type: string
-          format: uuid
-        tenant_id:
-          type: string
-          format: uuid
-        field_key:
-          type: string
-          pattern: '^[a-z][a-z0-9_]*$'
-          description: 字段键名，仅允许小写字母数字下划线
-        label:
-          type: string
-          description: 显示名称
-        type:
-          $ref: '#/components/schemas/MetadataFieldType'
-        required:
-          type: boolean
-          default: false
-        options:
-          type: array
-          items:
-            type: string
-          description: 选项列表（enum/array_enum 类型必填）
-        default_value:
-          description: 默认值
-        scope:
-          type: array
-          items:
-            $ref: '#/components/schemas/MetadataScope'
-          description: 适用范围
-        is_filterable:
-          type: boolean
-          default: false
-        is_rank_feature:
-          type: boolean
-          default: false
-        field_roles:
-          type: array
-          items:
-            $ref: '#/components/schemas/FieldRole'
-          description: 字段角色列表
-        status:
-          $ref: '#/components/schemas/MetadataFieldStatus'
-        version:
-          type: integer
-        created_at:
-          type: string
-          format: date-time
-        updated_at:
-          type: string
-          format: date-time
-
-    MetadataFieldDefinitionCreate:
-      type: object
-      required:
-        - tenant_id
-        - field_key
-        - label
-        - type
-      properties:
-        tenant_id:
-          type: string
-          format: uuid
-        field_key:
-          type: string
-          pattern: '^[a-z][a-z0-9_]*$'
-        label:
-          type: string
-          minLength: 1
-          maxLength: 100
-        type:
-          $ref: '#/components/schemas/MetadataFieldType'
-        required:
-          type: boolean
-          default: false
-        options:
-          type: array
-          items:
-            type: string
-        default_value:
-          type: object
-        scope:
-          type: array
-          items:
-            $ref: '#/components/schemas/MetadataScope'
-        is_filterable:
-          type: boolean
-          default: false
-        is_rank_feature:
-          type: boolean
-          default: false
-        field_roles:
-          type: array
-          items:
-            $ref: '#/components/schemas/FieldRole'
-          description: 字段角色列表，可为空
-
-    MetadataFieldDefinitionUpdate:
-      type: object
-      properties:
-        label:
-          type: string
-          minLength: 1
-          maxLength: 100
-        required:
-          type: boolean
-        options:
-          type: array
-          items:
-            type: string
-        default_value:
-          type: object
-        scope:
-          type: array
-          items:
-            $ref: '#/components/schemas/MetadataScope'
-        is_filterable:
-          type: boolean
-        is_rank_feature:
-          type: boolean
-        field_roles:
-          type: array
-          items:
-            $ref: '#/components/schemas/FieldRole'
-        status:
-          $ref: '#/components/schemas/MetadataFieldStatus'
-
-    ExtractStrategy:
-      type: string
-      enum:
-        - rule
-        - llm
-        - user_input
-      description: |
-        槽位提取策略：
-        - rule: 规则提取
-        - llm: LLM 推断
-        - user_input: 用户输入
-
-    SlotSource:
-      type: string
-      enum:
-        - user_confirmed
-        - rule_extracted
-        - llm_inferred
-        - default
-
-    SlotDefinition:
-      type: object
-      required:
-        - id
-        - tenant_id
-        - slot_key
-        - type
-        - required
-      properties:
-        id:
-          type: string
-          format: uuid
-        tenant_id:
-          type: string
-          format: uuid
-        slot_key:
-          type: string
-          pattern: '^[a-z][a-z0-9_]*$'
-          description: 槽位键名
-        type:
-          $ref: '#/components/schemas/MetadataFieldType'
-        required:
-          type: boolean
-          description: 是否必填槽位
-        extract_strategy:
-          $ref: '#/components/schemas/ExtractStrategy'
-        validation_rule:
-          type: string
-          description: 校验规则（正则或 JSON Schema）
-        ask_back_prompt:
-          type: string
-          description: 追问提示语模板
-        default_value:
-          description: 默认值
-        linked_field_id:
-          type: string
-          format: uuid
-          description: 关联的元数据字段 ID
-        created_at:
-          type: string
-          format: date-time
-        updated_at:
-          type: string
-          format: date-time
-
-    SlotDefinitionCreate:
-      type: object
-      required:
-        - tenant_id
-        - slot_key
-        - type
-        - required
-      properties:
-        tenant_id:
-          type: string
-          format: uuid
-        slot_key:
-          type: string
-          pattern: '^[a-z][a-z0-9_]*$'
-        type:
-          $ref: '#/components/schemas/MetadataFieldType'
-        required:
-          type: boolean
-        extract_strategy:
-          $ref: '#/components/schemas/ExtractStrategy'
-        validation_rule:
-          type: string
-        ask_back_prompt:
-          type: string
-        default_value:
-          type: object
-        linked_field_id:
-          type: string
-          format: uuid
-
-    SlotDefinitionUpdate:
-      type: object
-      properties:
-        type:
-          $ref: '#/components/schemas/MetadataFieldType'
-        required:
-          type: boolean
-        extract_strategy:
-          $ref: '#/components/schemas/ExtractStrategy'
-        validation_rule:
-          type: string
-        ask_back_prompt:
-          type: string
-        default_value:
-          type: object
-        linked_field_id:
-          type: string
-          format: uuid
-
-    SlotDefinitionWithField:
-      type: object
-      description: 槽位定义与关联字段信息
-      allOf:
-        - $ref: '#/components/schemas/SlotDefinition'
-        - type: object
-          properties:
-            linked_field:
-              $ref: '#/components/schemas/MetadataFieldDefinition'
-
-    RuntimeSlotValue:
-      type: object
-      required:
-        - key
-        - value
-        - source
-        - confidence
-      properties:
-        key:
-          type: string
-          description: 槽位键名
-        value:
-          description: 槽位值
-        source:
-          $ref: '#/components/schemas/SlotSource'
-          description: 槽位来源
-        confidence:
-          type: number
-          format: float
-          minimum: 0.0
-          maximum: 1.0
-          description: 置信度
-        updated_at:
-          type: string
-          format: date-time
-          description: 最后更新时间
-
-    ErrorResponse:
-      type: object
-      required:
-        - error_code
-        - message
-      properties:
-        error_code:
-          type: string
-        message:
-          type: string
-        details:
-          type: object
-          additionalProperties: true
-
-  responses:
-    BadRequest:
-      description: 请求参数错误
-      content:
-        application/json:
-          schema:
-            $ref: '#/components/schemas/ErrorResponse'
-          example:
-            error_code: BAD_REQUEST
-            message: "Invalid request parameters"
-
-    Unauthorized:
-      description: 未授权
-      content:
-        application/json:
-          schema:
-            $ref: '#/components/schemas/ErrorResponse'
-          example:
-            error_code: UNAUTHORIZED
-            message: "Authentication required"
-
-    NotFound:
-      description: 资源不存在
-      content:
-        application/json:
-          schema:
-            $ref: '#/components/schemas/ErrorResponse'
-          example:
-            error_code: NOT_FOUND
-            message: "Resource not found"
--- a/spec/metadata-role-separation/requirements.md
+++ b/spec/metadata-role-separation/requirements.md
@ -1,156 +0,0 @@
---
-feature_id: "MRS"
-title: "元数据职责分层优化"
-status: "draft"
-version: "0.1.0"
-active_version: "0.1.0"
-version_history:
-  - version: "0.1.0"
-    ac_range: "AC-MRS-01~16"
-    description: "元数据字段职责分层、槽位模型独立化、工具协同改造与管理端配置能力"
-owners:
-  - "product"
-  - "backend"
-  - "frontend"
-last_updated: "2026-03-05"
-source:
-  type: "conversation"
-  ref: "元数据职责过载问题优化需求"
---
-
-# 元数据职责分层优化（MRS）
-
-## 1. 背景与目标
-
-### 1.1 背景
-
-当前系统中元数据字段承担了多种隐式职责，导致以下问题：
-
-1. **职责混淆**：同一字段（如 `grade`）同时用于资源过滤、运行时槽位、提示词变量和路由信号，但系统无法区分其具体用途
-2. **工具耦合**：`kb_search_dynamic`、`memory_recall`、`intent_hint`、`high_risk_check` 等工具全量消费元数据字段，无法按需筛选
-3. **槽位语义模糊**：槽位与元数据字段概念混淆，缺少独立的槽位定义模型
-4. **配置不可视**：管理端无法按职责视角查看和配置字段
-
-### 1.2 目标
-
- 为元数据字段引入显式的 `field_roles` 职责分层
- 建立独立的槽位定义模型，与元数据字段解耦但可复用
- 改造工具链按职责角色消费字段，实现精准消费
- 提供管理端按角色配置和查看的能力
-
-### 1.3 非目标（Out of Scope）
-
- 不处理历史数据迁移，允许删除重建配置
- 不新增元数据字段类型（保持 string/number/boolean/enum/array_enum）
- 不替换向量引擎或 LLM 供应商
- 不覆盖渠道端具体实现
-
-## 2. 模块边界（Scope）
-
- 覆盖：元数据字段职责分层、槽位定义模型、工具协同改造、管理端配置能力
- 不覆盖：历史数据迁移、向量引擎替换、模型切换、渠道端实现
-
-## 3. 依赖盘点（Dependencies）
-
- `metadata-governance` 模块：元数据字段定义基础能力
- `intent-driven-mid-platform` 模块：中台运行时工具链
- `ai-service-admin` 前端：管理端配置界面
-
-## 4. 用户故事（User Stories）
-
- [US-MRS-01] 作为系统架构师，我希望元数据字段有明确的职责角色标记，以便工具能按需消费。
- [US-MRS-02] 作为后端开发者，我希望通过接口按角色查询字段定义，以便精准获取所需字段。
- [US-MRS-03] 作为运营配置人员，我希望在管理端按角色过滤查看字段，以便快速定位配置。
- [US-MRS-04] 作为对话系统开发者，我希望槽位有独立的定义模型，以便管理运行时槽位语义。
- [US-MRS-05] 作为工具开发者，我希望 `kb_search_dynamic` 只消费资源过滤角色字段，以便避免无关字段干扰。
- [US-MRS-06] 作为工具开发者，我希望 `memory_recall` 只消费槽位角色字段，以便精准管理对话槽位。
- [US-MRS-07] 作为工具开发者，我希望 `intent_hint/high_risk_check` 只消费路由信号角色字段，以便精准路由决策。
- [US-MRS-08] 作为提示词工程师，我希望 prompt 渲染只消费提示词变量角色字段，以便控制注入范围。
-
-## 5. 验收标准（Acceptance Criteria, EARS）
-
-### 5.1 字段职责分层
-
- [AC-MRS-01] WHEN 管理员创建或编辑元数据字段 THEN 系统 SHALL 支持 `field_roles` 多选配置，可选值为 `resource_filter`、`slot`、`prompt_var`、`routing_signal`。
- [AC-MRS-02] WHEN 保存字段定义时 `field_roles` 为空 THEN 系统 SHALL 允许保存（默认无职责）。
- [AC-MRS-03] WHEN 字段定义包含多个 `field_roles` THEN 系统 SHALL 正确存储并返回所有角色。
-
-### 5.2 分层视图能力
-
- [AC-MRS-04] WHEN 调用 `GET /admin/metadata-schemas/by-role?role=resource_filter` THEN 系统 SHALL 返回所有 `field_roles` 包含 `resource_filter` 的活跃字段定义。
- [AC-MRS-05] WHEN 调用按角色查询接口且角色参数无效 THEN 系统 SHALL 返回 400 错误并提示有效角色列表。
- [AC-MRS-06] WHEN 管理端请求字段列表 THEN 系统 SHALL 支持按 `field_roles` 过滤展示。
-
-### 5.3 槽位模型
-
- [AC-MRS-07] WHEN 管理员创建槽位定义 THEN 系统 SHALL 支持 `slot_key`、`type`、`required`、`extract_strategy`、`validation_rule`、`ask_back_prompt` 属性配置。
- [AC-MRS-08] WHEN 槽位定义的 `slot_key` 与已有元数据字段 `field_key` 相同 THEN 系统 SHALL 允许创建并建立关联关系。
- [AC-MRS-09] WHEN 运行时读取槽位值 THEN 系统 SHALL 返回 `source`（来源）、`confidence`（置信度）、`updated_at`（更新时间）属性。
- [AC-MRS-10] WHEN 调用 `GET /mid/slots/by-role?role=slot` THEN 系统 SHALL 返回所有 `field_roles` 包含 `slot` 的字段定义及关联的槽位定义。
-
-### 5.4 工具协同改造
-
- [AC-MRS-11] WHEN `kb_search_dynamic` 构建过滤器 THEN 系统 SHALL 仅使用 `field_roles` 包含 `resource_filter` 的字段。
- [AC-MRS-12] WHEN `memory_recall` 召回槽位 THEN 系统 SHALL 仅使用 `field_roles` 包含 `slot` 的字段。
- [AC-MRS-13] WHEN `intent_hint` 或 `high_risk_check` 进行路由判断 THEN 系统 SHALL 仅使用 `field_roles` 包含 `routing_signal` 的字段。
- [AC-MRS-14] WHEN 模板引擎渲染 prompt THEN 系统 SHALL 仅使用 `field_roles` 包含 `prompt_var` 的字段。
-
-### 5.5 管理端可配置能力
-
- [AC-MRS-15] WHEN 管理员在元数据字段编辑界面 THEN 系统 SHALL 显示 `field_roles` 多选配置组件。
- [AC-MRS-16] WHEN 管理员删除字段或槽位定义 THEN 系统 SHALL 允许删除且无需考虑历史数据兼容性。
-
-## 6. 追踪映射（Traceability）
-
-| AC ID | Endpoint | 方法 | operationId | 备注 |
-|------|----------|------|-------------|------|
-| AC-MRS-01 | /admin/metadata-schemas | POST | createMetadataSchema | field_roles 配置 |
-| AC-MRS-01 | /admin/metadata-schemas/{id} | PUT | updateMetadataSchema | field_roles 配置 |
-| AC-MRS-02 | /admin/metadata-schemas | POST | createMetadataSchema | 空角色允许 |
-| AC-MRS-03 | /admin/metadata-schemas | POST | createMetadataSchema | 多角色存储 |
-| AC-MRS-04 | /admin/metadata-schemas/by-role | GET | getMetadataSchemasByRole | 按角色查询 |
-| AC-MRS-05 | /admin/metadata-schemas/by-role | GET | getMetadataSchemasByRole | 无效角色校验 |
-| AC-MRS-06 | /admin/metadata-schemas | GET | listMetadataSchemas | 按角色过滤 |
-| AC-MRS-07 | /admin/slot-definitions | POST | createSlotDefinition | 槽位定义创建 |
-| AC-MRS-08 | /admin/slot-definitions | POST | createSlotDefinition | 关联元数据字段 |
-| AC-MRS-09 | /mid/slots/{slot_key} | GET | getSlotValue | 运行时槽位值 |
-| AC-MRS-10 | /mid/slots/by-role | GET | getSlotsByRole | 按角色获取槽位 |
-| AC-MRS-11 | 内部调用 | - | kb_search_dynamic | resource_filter 消费 |
-| AC-MRS-12 | 内部调用 | - | memory_recall | slot 消费 |
-| AC-MRS-13 | 内部调用 | - | intent_hint/high_risk_check | routing_signal 消费 |
-| AC-MRS-14 | 内部调用 | - | template_engine | prompt_var 消费 |
-| AC-MRS-15 | 前端页面 | - | - | field_roles 配置组件 |
-| AC-MRS-16 | /admin/metadata-schemas/{id} | DELETE | deleteMetadataSchema | 删除无需兼容 |
-| AC-MRS-16 | /admin/slot-definitions/{id} | DELETE | deleteSlotDefinition | 删除无需兼容 |
-
-## 7. 字段角色定义
-
-| 角色标识 | 中文名称 | 用途说明 | 消费工具 |
-|---------|---------|---------|---------|
-| `resource_filter` | 资源过滤 | 用于 KB 文档检索时的元数据过滤 | `kb_search_dynamic` |
-| `slot` | 运行时槽位 | 对话流程中的结构化槽位，用于信息收集 | `memory_recall` |
-| `prompt_var` | 提示词变量 | 注入到 LLM Prompt 中的变量 | `template_engine` |
-| `routing_signal` | 路由信号 | 用于意图路由和风险判断的信号 | `intent_hint`, `high_risk_check` |
-
-## 8. 槽位定义属性
-
-| 属性 | 类型 | 必填 | 说明 |
-|-----|------|-----|------|
-| `slot_key` | string | 是 | 槽位键名，可与 field_key 关联 |
-| `type` | enum | 是 | 槽位类型：string/number/boolean/enum/array_enum |
-| `required` | boolean | 是 | 是否必填槽位 |
-| `extract_strategy` | enum | 否 | 提取策略：rule/llm/user_input |
-| `validation_rule` | string | 否 | 校验规则（正则或 JSON Schema） |
-| `ask_back_prompt` | string | 否 | 追问提示语模板 |
-| `default_value` | any | 否 | 默认值 |
-| `linked_field_id` | uuid | 否 | 关联的元数据字段 ID |
-
-## 9. 运行时槽位值属性
-
-| 属性 | 类型 | 说明 |
-|-----|------|------|
-| `key` | string | 槽位键名 |
-| `value` | any | 槽位值 |
-| `source` | enum | 来源：user_confirmed/rule_extracted/llm_inferred/default |
-| `confidence` | float | 置信度 0.0~1.0 |
-| `updated_at` | datetime | 最后更新时间 |
--- a/spec/metadata-role-separation/scope.md
+++ b/spec/metadata-role-separation/scope.md
@ -1,145 +0,0 @@
-# 元数据职责分层模块边界（Scope）
-
-## 1. 模块边界说明
-
-### 1.1 覆盖范围
-
-本模块聚焦于**元数据字段的职责分层与运行时消费解耦**，具体包括：
-
-1. **字段职责分层**
-   - 为元数据字段引入 `field_roles` 多选属性
-   - 支持四种职责角色：`resource_filter`、`slot`、`prompt_var`、`routing_signal`
-   - 单个字段可同时承担多种职责
-
-2. **分层视图能力**
-   - 后端提供按 role 查询字段的能力
-   - 工具与模块按 role 消费，不再全量混用
-
-3. **槽位模型增强**
-   - 引入独立的槽位定义模型（可复用元数据字段但有独立运行时语义）
-   - 支持 `slot_key/type/required/extract_strategy/validation_rule/ask_back_prompt`
-   - 运行时值包含 `source/confidence/updated_at`
-
-4. **工具协同改造**
-   - `kb_search_dynamic` 只消费 `resource_filter` 角色
-   - `memory_recall` 只消费 `slot` 角色
-   - `intent_hint/high_risk_check` 只消费 `routing_signal` 角色
-   - prompt 渲染只消费 `prompt_var` 角色
-
-5. **管理端可配置能力**
-   - 元数据字段编辑界面增加 `field_roles` 配置
-   - 提供"按 role 过滤查看"能力
-   - 允许删除重建配置（无需迁移兼容）
-
-### 1.2 不覆盖范围
-
- **历史数据迁移**：本迭代不负责历史数据的自动迁移，允许删除重建配置
- **向量引擎替换**：不涉及 Qdrant 或其他向量引擎的替换
- **LLM 模型切换**：不涉及模型供应商或模型选型的变更
- **渠道端实现**：不覆盖渠道侧 SegmentDispatcher/InterruptManager 等具体实现
- **元数据字段类型扩展**：不新增字段类型（string/number/boolean/enum/array_enum 保持不变）
-
---
-
-## 2. 依赖盘点
-
-### 2.1 内部依赖
-
-| 依赖模块 | 用途说明 | 接口 |
-|---------|---------|------|
-| `metadata-governance` | 元数据字段定义基础能力 | `/admin/metadata-schemas` |
-| `intent-driven-mid-platform` | 中台运行时工具链 | `kb_search_dynamic`, `memory_recall`, `intent_hint`, `high_risk_check` |
-| `ai-service-admin` | 管理端前端页面 | 元数据配置界面 |
-
-### 2.2 外部依赖
-
-| 依赖服务 | 用途说明 |
-|---------|---------|
-| PostgreSQL | 元数据字段定义、槽位定义存储 |
-| Redis | 运行时缓存 |
-
---
-
-## 3. 依赖接口清单
-
-### 3.1 本模块依赖的外部接口（Consumer）
-
-| 接口 | 来源模块 | 用途 |
-|------|---------|------|
-| `GET /admin/metadata-schemas` | metadata-governance | 获取元数据字段列表 |
-| `POST /admin/metadata-schemas` | metadata-governance | 创建元数据字段 |
-| `PUT /admin/metadata-schemas/{id}` | metadata-governance | 更新元数据字段 |
-| `DELETE /admin/metadata-schemas/{id}` | metadata-governance | 删除元数据字段 |
-
-### 3.2 本模块对外提供的接口（Provider）
-
-| 接口 | 用途 |
-|------|------|
-| `GET /admin/metadata-schemas/by-role` | 按 role 查询字段定义 |
-| `GET /admin/slot-definitions` | 获取槽位定义列表 |
-| `POST /admin/slot-definitions` | 创建槽位定义 |
-| `PUT /admin/slot-definitions/{id}` | 更新槽位定义 |
-| `DELETE /admin/slot-definitions/{id}` | 删除槽位定义 |
-| `GET /mid/slots/by-role` | 运行时按 role 获取槽位定义 |
-
---
-
-## 4. 数据模型边界
-
-### 4.1 新增模型
-
-| 模型名 | 说明 |
-|-------|------|
-| `SlotDefinition` | 槽位定义表（独立于 MetadataFieldDefinition） |
-
-### 4.2 扩展模型
-
-| 模型名 | 扩展字段 |
-|-------|---------|
-| `MetadataFieldDefinition` | 新增 `field_roles: list[str]` 字段 |
-
---
-
-## 5. 工具消费关系
-
-```
-┌─────────────────────────────────────────────────────────────────┐
-│                    元数据字段职责分层                            │
-├─────────────────────────────────────────────────────────────────┤
-│                                                                 │
-│  MetadataFieldDefinition                                        │
-│  └── field_roles: [resource_filter, slot, prompt_var,          │
-│                    routing_signal]                              │
-│                                                                 │
-└─────────────────────────────────────────────────────────────────┘
-                              │
-        ┌─────────────────────┼─────────────────────┐
-        ▼                     ▼                     ▼
-┌───────────────┐   ┌───────────────┐   ┌───────────────┐
-│resource_filter│   │     slot      │   │  prompt_var   │
-└───────┬───────┘   └───────┬───────┘   └───────┬───────┘
-        │                   │                   │
-        ▼                   ▼                   ▼
-┌───────────────┐   ┌───────────────┐   ┌───────────────┐
-│kb_search_     │   │memory_recall  │   │template_engine│
-│dynamic        │   │               │   │               │
-└───────────────┘   └───────────────┘   └───────────────┘
-
-┌─────────────────────────────────────────────────────────────────┐
-│  routing_signal                                                 │
-└───────┬─────────────────────────────────────────────────────────┘
-        │
-        ├───────────────┐
-        ▼               ▼
-┌───────────────┐ ┌───────────────┐
-│intent_hint    │ │high_risk_check│
-└───────────────┘ └───────────────┘
-```
-
---
-
-## 6. 版本与迭代
-
- 当前版本：`v0.1.0`
- AC 范围：`AC-MRS-01 ~ AC-MRS-16`
- 迭代策略：允许删除重建配置，不考虑历史数据迁移
--- a/spec/metadata-role-separation/tasks.md
+++ b/spec/metadata-role-separation/tasks.md
@ -1,317 +0,0 @@
-# 元数据职责分层优化 - 任务清单
-
-## 任务概览
-
-| 阶段 | 任务数 | 状态 |
-|------|-------|------|
-| Phase 1: 数据模型扩展 | 4 | ✅ 已完成 |
-| Phase 2: 后端服务实现 | 6 | ✅ 已完成 |
-| Phase 3: 工具协同改造 | 4 | ✅ 已完成 |
-| Phase 4: 前端页面改造 | 3 | ✅ 已完成 |
-| Phase 5: 测试与验收 | 3 | ⏳ 待开始 |
-
---
-
-## Phase 1: 数据模型扩展
-
-### Task 1.1: 扩展 MetadataFieldDefinition 模型
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-01, AC-MRS-02, AC-MRS-03
- **描述**: 在现有 `MetadataFieldDefinition` 模型中新增 `field_roles` 字段
- **产出**:
-  - 修改 `ai-service/app/models/entities.py` 中的 `MetadataFieldDefinition` 类
-  - 新增 `FieldRole` 枚举类
- **验收标准**:
-  - `field_roles` 字段类型为 `list[str]`
-  - 支持存储多个角色
-  - 允许空列表
-
-### Task 1.2: 创建 SlotDefinition 模型
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-07, AC-MRS-08
- **描述**: 创建独立的槽位定义模型
- **产出**:
-  - 在 `ai-service/app/models/entities.py` 中新增 `SlotDefinition` 类
-  - 新增 `ExtractStrategy` 枚举类
- **验收标准**:
-  - 包含所有必需字段：slot_key, type, required, extract_strategy, validation_rule, ask_back_prompt
-  - 支持 linked_field_id 关联元数据字段
-
-### Task 1.3: 编写数据库迁移脚本
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-01, AC-MRS-07
- **描述**: 编写 PostgreSQL 迁移脚本
- **产出**:
-  - 创建 `ai-service/scripts/migrations/007_add_field_roles_and_slot_definitions.sql`
- **验收标准**:
-  - 为 `metadata_field_definitions` 表新增 `field_roles` 列
-  - 创建 `slot_definitions` 表
-  - 创建必要的索引
-
-### Task 1.4: 更新 Pydantic Schema
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-01, AC-MRS-07
- **描述**: 更新请求/响应 Schema
- **产出**:
-  - 创建 `ai-service/app/schemas/metadata.py`
-  - 新增 `SlotDefinitionCreate/Update/Response` Schema
- **验收标准**:
-  - Schema 与 OpenAPI 契约一致
-  - 包含完整的字段校验规则
-
---
-
-## Phase 2: 后端服务实现
-
-### Task 2.1: 实现 RoleBasedFieldProvider 服务
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-04, AC-MRS-05, AC-MRS-10
- **描述**: 实现按角色查询字段的核心服务
- **产出**:
-  - 创建 `ai-service/app/services/mid/role_based_field_provider.py`
- **验收标准**:
-  - `get_fields_by_role()` 方法正确查询指定角色的字段
-  - 无效角色返回 400 错误
-  - 支持缓存机制
-
-### Task 2.2: 扩展 MetadataFieldDefinitionService
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-01, AC-MRS-02, AC-MRS-03, AC-MRS-06
- **描述**: 扩展现有服务支持 field_roles
- **产出**:
-  - 修改 `ai-service/app/services/metadata_field_definition_service.py`
- **验收标准**:
-  - 创建/更新时支持 field_roles 字段
-  - 支持按 role 过滤查询
-  - field_roles 校验正确
-
-### Task 2.3: 实现 SlotDefinitionService
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-07, AC-MRS-08
- **描述**: 实现槽位定义管理服务
- **产出**:
-  - 创建 `ai-service/app/services/slot_definition_service.py`
- **验收标准**:
-  - CRUD 操作正确
-  - 支持关联元数据字段
-  - slot_key 租户内唯一
-
-### Task 2.4: 扩展 MetadataFieldDefinition API
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-01, AC-MRS-04, AC-MRS-05, AC-MRS-06, AC-MRS-16
- **描述**: 扩展现有 API 端点
- **产出**:
-  - 修改 `ai-service/app/api/admin/metadata_field_definition.py`
-  - 新增 `/by-role` 端点
- **验收标准**:
-  - 所有端点符合 OpenAPI 契约
-  - 包含 AC 注释
-
-### Task 2.5: 实现 SlotDefinition API
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-07, AC-MRS-08, AC-MRS-16
- **描述**: 实现槽位定义管理 API
- **产出**:
-  - 创建 `ai-service/app/api/admin/slot_definition.py`
- **验收标准**:
-  - CRUD 端点符合 OpenAPI 契约
-  - 包含 AC 注释
-
-### Task 2.6: 实现运行时槽位 API
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-09, AC-MRS-10
- **描述**: 实现运行时槽位查询 API
- **产出**:
-  - 创建 `ai-service/app/api/mid/slots.py`
- **验收标准**:
-  - `/mid/slots/by-role` 端点正确返回槽位定义
-  - `/mid/slots/{slot_key}` 端点正确返回运行时值
-
---
-
-## Phase 3: 工具协同改造
-
-### Task 3.1: 改造 kb_search_dynamic 工具
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-11
- **描述**: 改造 KB 动态检索工具，只消费 resource_filter 角色
- **产出**:
-  - 修改 `ai-service/app/services/mid/kb_search_dynamic_tool.py`
-  - 修改 `ai-service/app/services/mid/metadata_filter_builder.py`
- **验收标准**:
-  - 只使用 field_roles 包含 resource_filter 的字段
-  - 不影响现有功能
-
-### Task 3.2: 改造 memory_recall 工具
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-12
- **描述**: 改造记忆召回工具，只消费 slot 角色
- **产出**:
-  - 修改 `ai-service/app/services/mid/memory_recall_tool.py`
- **验收标准**:
-  - 只使用 field_roles 包含 slot 的字段
-  - 槽位合并逻辑正确
-
-### Task 3.3: 改造 intent_hint 和 high_risk_check 工具
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-13
- **描述**: 改造意图提示和高风险检测工具，只消费 routing_signal 角色
- **产出**:
-  - 修改 `ai-service/app/services/mid/intent_hint_tool.py`
-  - 修改 `ai-service/app/services/mid/high_risk_check_tool.py`
- **验收标准**:
-  - 只使用 field_roles 包含 routing_signal 的字段
-  - 路由判断逻辑正确
-
-### Task 3.4: 改造 template_engine
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-14
- **描述**: 改造模板引擎，只消费 prompt_var 角色
- **产出**:
-  - 修改 `ai-service/app/services/flow/template_engine.py`
- **验收标准**:
-  - 只使用 field_roles 包含 prompt_var 的字段
-  - 模板渲染正确
-
---
-
-## Phase 4: 前端页面改造
-
-### Task 4.1: 元数据字段配置页面增加 field_roles
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-15
- **描述**: 在元数据字段编辑表单中增加角色选择组件
- **产出**:
-  - 创建 `ai-service-admin/src/components/metadata/FieldRolesSelector.vue`
-  - 修改 `ai-service-admin/src/views/admin/metadata-schema/index.vue`
- **验收标准**:
-  - 支持多选角色
-  - 显示角色说明
-  - 保存时正确提交
-
-### Task 4.2: 增加按角色过滤视图
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-06
- **描述**: 在元数据字段列表页面增加角色过滤功能
- **产出**:
-  - 修改 `ai-service-admin/src/views/admin/metadata-schema/index.vue`
- **验收标准**:
-  - 下拉框选择角色
-  - 过滤结果正确
-
-### Task 4.3: 槽位定义管理页面
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-07, AC-MRS-08, AC-MRS-16
- **描述**: 创建槽位定义管理页面
- **产出**:
-  - 创建 `ai-service-admin/src/views/admin/slot-definition/index.vue`
-  - 创建 `ai-service-admin/src/api/slot-definition.ts`
-  - 创建 `ai-service-admin/src/types/slot-definition.ts`
-  - 更新 `ai-service-admin/src/router/index.ts`
-  - 更新 `ai-service-admin/src/App.vue` 导航菜单
- **验收标准**:
-  - 支持 CRUD 操作
-  - 支持关联元数据字段
-
---
-
-## Phase 5: 测试与验收
-
-### Task 5.1: 单元测试
- [x] **状态**: ✅ 已完成
- **AC**: AC-MRS-01~16
- **描述**: 编写单元测试
- **产出**:
-  - `ai-service/tests/test_role_based_field_provider.py`
-  - `ai-service/tests/test_slot_definition_service.py`
-  - `ai-service/tests/test_field_roles.py`
- **验收标准**:
-  - 覆盖核心逻辑
-  - 测试通过
-
-### Task 5.2: 集成测试
- [ ] **状态**: ⏳ 待开始
- **AC**: AC-MRS-01~16
- **描述**: 编写 API 集成测试
- **产出**:
-  - `ai-service/tests/api/test_metadata_field_roles.py`
-  - `ai-service/tests/api/test_slot_definition.py`
- **验收标准**:
-  - 覆盖所有 API 端点
-  - 测试通过
-
-### Task 5.3: 契约测试
- [ ] **状态**: ⏳ 待开始
- **AC**: AC-MRS-01~16
- **描述**: 验证 API 符合 OpenAPI 契约
- **产出**:
-  - 运行契约测试
-  - 修复不一致
- **验收标准**:
-  - Provider 契约达到 L2 级别
-  - 所有响应符合 Schema
-
---
-
-## 任务依赖关系
-
-```
-Phase 1 (数据模型)
-    │
-    ├── Task 1.1 (扩展 MetadataFieldDefinition)
-    ├── Task 1.2 (创建 SlotDefinition)
-    ├── Task 1.3 (迁移脚本) ← 依赖 1.1, 1.2
-    └── Task 1.4 (更新 Schema) ← 依赖 1.1, 1.2
-            │
-            ▼
-Phase 2 (后端服务)
-    │
-    ├── Task 2.1 (RoleBasedFieldProvider) ← 依赖 1.4
-    ├── Task 2.2 (扩展 MetadataFieldService) ← 依赖 1.4
-    ├── Task 2.3 (SlotDefinitionService) ← 依赖 1.4
-    ├── Task 2.4 (扩展 MetadataField API) ← 依赖 2.2
-    ├── Task 2.5 (SlotDefinition API) ← 依赖 2.3
-    └── Task 2.6 (运行时槽位 API) ← 依赖 2.1, 2.3
-            │
-            ▼
-Phase 3 (工具改造)
-    │
-    ├── Task 3.1 (kb_search_dynamic) ← 依赖 2.1
-    ├── Task 3.2 (memory_recall) ← 依赖 2.1
-    ├── Task 3.3 (intent_hint/high_risk_check) ← 依赖 2.1
-    └── Task 3.4 (template_engine) ← 依赖 2.1
-            │
-            ▼
-Phase 4 (前端改造)
-    │
-    ├── Task 4.1 (field_roles 组件) ← 依赖 2.4
-    ├── Task 4.2 (角色过滤视图) ← 依赖 2.4
-    └── Task 4.3 (槽位定义页面) ← 依赖 2.5
-            │
-            ▼
-Phase 5 (测试验收)
-    │
-    ├── Task 5.1 (单元测试) ← 依赖 Phase 2
-    ├── Task 5.2 (集成测试) ← 依赖 Phase 2, 3, 4
-    └── Task 5.3 (契约测试) ← 依赖 5.2
-```
-
---
-
-## 执行顺序建议
-
-1. **Phase 1** → **Phase 2** → **Phase 3** → **Phase 4** → **Phase 5**
-2. Phase 3 和 Phase 4 可并行执行
-3. 每个 Task 完成后需更新状态并提交
-
---
-
-## 变更记录
-
-| 日期 | 变更内容 | 变更人 |
-|------|---------|-------|
-| 2026-03-05 | 初始创建 | AI Agent |
-| 2026-03-05 | 完成 Phase 3 工具协同改造 (Task 3.1-3.4) [AC-MRS-11~14] | AI Agent |
-| 2026-03-05 | 完成 Phase 4 前端页面改造 (Task 4.1-4.3) [AC-MRS-06,07,08,15,16] | AI Agent |
-| 2026-03-05 | 完成 Phase 2 后端服务实现 (Task 2.3-2.6) [AC-MRS-07~10,16] | AI Agent |
-| 2026-03-05 | 完成 Task 5.1 单元测试 [AC-MRS-01~16] | AI Agent |
--- a/spec/mid-agent-runtime-hardening/customer-material-classification-todo.md
+++ b/spec/mid-agent-runtime-hardening/customer-material-classification-todo.md
@ -1,137 +0,0 @@
-# 客户资料分类 TODO
-
-> 目的：沉淀“客户提供的文字资料”并拆解为可录入中台的结构化项。
-> 状态：进行中
-
-## 分类维度（固定）
-
-1. 数据库录入项（结构化事实）
-   - 企业/品牌信息
-   - 产品与服务目录
-   - 业务规则（价格、时效、区域、限制）
-   - 售后/退款/改期政策
-   - 联系方式与升级路径
-
-2. 流程编排项（Flow / Script）
-   - 意图入口
-   - 主流程节点
-   - 异常分支
-   - 打断恢复策略
-   - 降级与转人工
-
-3. 元数据配置项（Metadata）
-   - 租户配置
-   - 检索配置
-   - 超时配置
-   - 分段与 delay 配置
-   - 观测与审计字段
-
-4. 提示词约束项（Prompt Guardrails）
-   - 角色与边界
-   - 合规/禁答策略
-   - 回复风格
-   - 事实优先级
-   - 工具调用策略
-
---
-
-## 待处理清单
-
- [x] TODO-001：首批销售转化话术资料（已完成分类）
-
---
-
-## 处理记录
-
-### TODO-001
- 原始资料：
-  - 主题包含：用户分层、下危机铺垫、课程效果外化、上直播、加微与辅导老师价值说明。
-  - 输入特征：左侧为“问题/场景”，右侧为“思路及参考话术”。
-
- 分类结果：
-  - 数据库录入项：
-    - 产品信息：
-      - 课程名称（暂命名）：方法技巧提升课
-      - 课程节数：10节
-      - 授课形式：线上直播
-      - 上课时间：周末
-      - 回放机制：有回放，可反复观看
-      - 教学配置：辅导老师 1对1 学情分析与答疑
-      - 主讲卖点：清北老师主讲（需合规校验后再对外展示）
-      - 覆盖题型：阅读、计算、证明题、完形填空等
-    - 价格与营销规则：
-      - 原价：299元
-      - 活动价：19元
-      - 活动标签：限时优惠
-      - 收费口径：无二次收费
-    - 用户分层维度（建议入库为标签/槽位）：
-      - 年级（必填）
-      - 薄弱科目/薄弱能力（如基础不扎实、举一反三弱）
-      - 最近考试表现（可选）
-    - 转化动作记录字段：
-      - 是否建议上直播
-      - 是否引导加微
-      - 是否已添加班主任/辅导老师
-      - 用户对价格接受度（accept_price: yes/no/hesitate）
-
-  - 流程编排项：
-    - 主流程（建议）：
-      1) 用户分层：先确认年级
-      2) 画像共情：结合最近考试与薄弱点做问题诊断
-      3) 方案匹配：给出“10节提升方案 + 1对1答疑”
-      4) 价值外化：讲课程内容、题型覆盖、预期效果与试错成本低
-      5) 价格锚定：原价299→活动19，并确认接受度
-      6) 行动推进：建议上直播（强调互动答疑）
-      7) 私域沉淀：引导加班主任/辅导老师并要求截图确认
-    - 关键分支：
-      - 若用户提出薄弱项在课程覆盖内：强化“针对性内容”介绍
-      - 若用户只关注单科：补充“学科能力关联性，不止提升一科”
-      - 若用户价格犹豫：强化“单节成本低、无二次收费、可回放”
-      - 若用户时间顾虑：强调“周末上课+回放复习”
-    - 收口动作：
-      - 明确收口问题：“您看价格这一块可以接受么？”
-      - 明确动作闭环：加微完成截图回传
-
-  - 元数据配置项：
-    - 槽位定义（slot schema）：
-      - grade
-      - weak_subjects[]
-      - weak_skills[]（如举一反三、题型迁移）
-      - recent_exam_summary
-      - price_acceptance_status
-      - wechat_added_status
-      - live_attendance_intent
-    - 话术模板参数：
-      - promo_price=19
-      - original_price=299
-      - lesson_count=10
-      - has_replay=true
-      - has_tutor_1v1=true
-    - 触发条件配置：
-      - 未确认年级时，禁止进入报价节点
-      - 未完成价格确认时，优先输出价值外化话术
-      - 未加微完成时，优先输出加微引导
-    - 观测埋点建议：
-      - node_entered（分层/危机/方案/报价/加微）
-      - objection_type（价格/时间/效果/信任）
-      - conversion_step（接受报价、同意听直播、完成加微）
-
-  - 提示词约束项：
-    - 角色约束：
-      - 以课程顾问身份进行诊断式沟通，先分层再推荐，不直接硬推销。
-    - 对话策略约束：
-      - 必须先确认年级，再给课程针对性说明。
-      - 必须至少询问一个薄弱点或考试表现，再进入方案介绍。
-      - 报价后必须发起“价格可接受度确认”。
-    - 风格约束：
-      - 语气专业、关怀、不过度夸张。
-      - 用“问题-方案-结果-行动”结构表达。
-    - 合规约束：
-      - 对“清北老师主讲”“达成深度合作”等表述增加可校验开关，默认不做绝对化承诺。
-      - 禁止承诺“保证提分/保证结果”。
-    - 工具/事实优先级：
-      - 优先读取课程事实库（价格、节数、上课形式、回放、服务）再组织话术。
-      - 未命中事实时使用保守表述，并引导人工确认。
-
- 备注：
-  - 本条属于“销售转化脚本型资料”，不是完整产品手册；建议后续补充：适用年级范围、开班频次、退款规则、资质证明，用于增强可落库完整性。
--- a/spec/mid-agent-runtime-hardening/design.md
+++ b/spec/mid-agent-runtime-hardening/design.md
@ -1,89 +0,0 @@
-# 中台智能体运行时加固（MARH）设计文档
-
-## 1. 设计目标
-
-围绕 AC-MARH-01~12，构建中台运行时加固层，确保输出可控、打断可消费、检索可依赖、时延可治理、节奏可调优。
-
---
-
-## 2. 架构改造点
-
-## 2.1 对话主链路增强
-在 `respond` 主流程增加以下阶段：
-1. `interrupt_context_enricher`：消费 `interrupted_segments`
-2. `kb_default_tool_stage`：Agent 默认检索工具尝试
-3. `output_guardrail_stage`：输出前强制过滤
-4. `segment_humanizer`：语义+长度分段与 delay 计算
-5. `runtime_observer`：补齐 trace 与 metrics
-
-## 2.2 超时统一治理
- ReAct 最大循环：3~5
- 单工具超时：<=2000ms
- 全链路超时：<=8000ms
- 超时均落 `fallback_reason_code`
-
---
-
-## 3. 关键流程
-
-## 3.1 正常响应流程
-1. 收到请求并校验已送达历史
-2. 若存在中断片段则做语义去重标记
-3. policy_router 决策
-4. Agent 模式默认调用 KB 工具（可降级）
-5. 生成候选文本
-6. 输出护栏强制过滤
-7. 分段拟人策略生成 `segments[]`
-8. 输出 trace（guardrail/interrupt/kb_hit/timeouts/segment_stats）
-
-## 3.2 打断重入流程
-1. 新请求带 `interrupted_segments`
-2. 重规划时避开被打断语义
-3. 若中断信息异常则兜底继续
-
---
-
-## 4. 数据结构与字段补充
-
-新增或强化 trace 字段：
- `guardrail_triggered`
- `guardrail_rule_id`
- `interrupt_consumed`
- `kb_tool_called`
- `kb_hit`
- `segment_stats`
- `timeout_profile`
-
---
-
-## 5. 组件职责
-
- `OutputGuardrailExecutor`
-  - 强制执行输出过滤
-  - 返回 blocked/filtered_text/rule_id
-
- `InterruptContextEnricher`
-  - 将中断片段转成重规划上下文
-  - 提供异常兜底
-
- `DefaultKbToolRunner`
-  - Agent 默认 KB 检索
-  - 失败时返回降级信号
-
- `SegmentHumanizer`
-  - 文本分段与 delay 生成
-  - 支持租户覆盖配置
-
- `RuntimeObserver`
-  - 汇总 trace 与 metrics
-
---
-
-## 6. 与 AC 映射
-
- AC-MARH-01/02：`OutputGuardrailExecutor`
- AC-MARH-03/04：`InterruptContextEnricher`
- AC-MARH-05/06：`DefaultKbToolRunner`
- AC-MARH-07/08/09：`TimeoutGovernor + Orchestrator`
- AC-MARH-10/11：`SegmentHumanizer`
- AC-MARH-12：`RuntimeObserver`
--- a/Show More
+++ b/Show More