MCP Call Recording Server

An MCP (Model Context Protocol) server that provides semantic search over VTT transcript files using AI-powered structured summaries. This server enables business stakeholders to query client call transcripts using natural language through Claude Desktop and Microsoft Copilot 365 Studio.

Now powered by OpenAI GPT-4-turbo for best-in-class summary quality and structured extraction.

Architecture Diagram (generated using Gemini Nano Banana)

Data Flow

This is improved version of data flow after we identified improvements we can make at the embedding levels. <img width="736" height="427" alt="image" src="https://github.com/user-attachments/assets/abd7d122-c602-4bed-a89f-1b239432525b" />

Features

Automatic Indexing: Monitors a directory for VTT transcript files and automatically indexes them in the background
AI-Powered Summaries: Uses OpenAI GPT-4-turbo to generate perfect structured summaries with CALL TYPE, PARTICIPANTS, COMPANY/COMPANIES, KEY TOPICS, ACTION ITEMS, and DECISIONS MADE
Advanced Embeddings: OpenAI text-embedding-3-small (1536 dimensions) for superior semantic search quality
Natural Language Queries: Ask questions like "Summarize the Bank of America sales call" or "What were the action items from the Capital One call?"
100% Consistent Format: GPT-4-turbo maintains perfect structure even for long transcripts (860+ lines, 286+ segments)
All Participants Captured: Never miss a meeting attendee - all speakers are identified correctly
No Hallucinations: Correctly shows "Unknown" for missing information instead of fabricating data
Single Tool Interface: Simple query_transcripts tool that handles all queries (pure semantic search)

The server uses:

OpenAI GPT-4-turbo: Best-in-class LLM for generating structured summaries (~$0.01-0.05 per transcript)
OpenAI Embeddings: text-embedding-3-small for 1536-dimensional semantic search (~$0.0001 per transcript)
Chroma: Vector database; the Node.js client connects to a ChromaDB server running at http://localhost:8000. Persisted data is stored in a directory you configure when starting the Chroma server (see CHROMADB_SETUP.md).
File Watcher (chokidar): Automatically detects and indexes new, changed, or deleted VTT files
MCP Protocol: Standard protocol for AI assistant integration (stdio transport)

Prerequisites

Node.js 18+
OpenAI API key (get one at https://platform.openai.com/api-keys)
$5+ OpenAI credit (covers ~200 transcripts)
Directory containing VTT transcript files
ChromaDB server running on http://localhost:8000 (see CHROMADB_SETUP.md)

Installation

Clone or download this repository.

Install dependencies:

   npm install

Build the TypeScript code:

   npm run build

Create a .env file in the project root:

   OPENAI_API_KEY=sk-proj-your-key-here
   VTT_DIRECTORY=/path/to/vtt/transcript/files
   CHROMA_DB_PATH=./chroma_db

See ENV_SETUP.md for detailed environment setup instructions.

Start the ChromaDB server (in a separate terminal) before running the MCP server—see CHROMADB_SETUP.md.

Configuration

Environment Variables

The MCP server reads configuration from environment variables (e.g. from a .env file in the project root):

| Variable | Required | Description | |--------------------|----------|-----------------------------------------------------------------------------| | OPENAI_API_KEY | Yes | Your OpenAI API key for GPT-4-turbo summaries and embeddings | | VTT_DIRECTORY | Yes | Path to the directory containing VTT transcript files | | CHROMA_DB_PATH | No | Path for Chroma server data (default: ./chroma_db) |

Cost: ~$0.02 per transcript on average (varies with length). Your $5 credit covers approximately 200 transcripts.

Claude Desktop Setup

Edit your Claude Desktop configuration file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
Linux: ~/.config/claude/claude_desktop_config.json

Add the MCP server (use the absolute path to dist/index.js):

   {
     "mcpServers": {
       "call-recording": {
         "command": "node",
         "args": ["/absolute/path/to/MCP_Call_Recording/dist/index.js"]
       }
     }
   }

Example config is in config/claude-desktop.json.

Restart Claude Desktop.

The server will load .env, index existing VTT files in VTT_DIRECTORY, and watch for new, changed, or deleted files.

Microsoft Copilot 365 Studio Setup

The server currently uses stdio transport. For Copilot Studio you would need to run it as an HTTP MCP server (e.g. add an HTTP transport in src/index.ts or run behind an adapter).

For reference, config/copilot-studio.json illustrates a possible structure (name, description, transport type, environment variables). Set VTT_DIRECTORY, CHROMA_DB_PATH, and OPENAI_API_KEY in your deployment environment.

Usage

Once configured, you can ask Claude Desktop (or Copilot Studio once HTTP is set up) questions about your call transcripts, for example:

"What were the main risks discussed in the last call with Bank of America?"
"Identify the top risk identified in last client call with Bank of America with Sales"
"What decisions were made in calls with Acme Corp this month?"
"Summarize the key points from the call with TechCorp on January 15th"

The server will:

Generate an embedding for your question
Run a semantic search in the vector database (no metadata filtering)
Return a formatted answer with relevant segments, metadata (client, date, speaker), and relevance scores

Tool: `query_transcripts`

question (required): Natural language question about the transcripts.
limit (optional, default: 10): Maximum number of results to return.
minScore (optional, default: 0.0): Minimum relevance score (0–1). Segments below this are excluded.

How It Works

Automatic Indexing

On startup the server:

Scans VTT_DIRECTORY for existing .vtt files
Parses each file into segments with timestamps
Extracts metadata (client name, date, participants, call type) from filenames and VTT content
Generates AI-powered structured summary using OpenAI GPT-4-turbo
Creates semantic embedding of the entire summary using OpenAI text-embedding-3-small (1536 dimensions)
Stores summary and embedding in ChromaDB for semantic search
Uses the same pipeline when the file watcher detects new or changed files

File Watching

The server watches the VTT directory with chokidar:

New files: Indexed automatically
Changed files: Re-indexed (existing segments for that file are removed first)
Deleted files: Segments for that file are removed from the database

Query Processing

For each query:

The question is embedded using OpenAI text-embedding-3-small (same model used for indexing)
Chroma returns the closest transcript summaries by embedding similarity (cosine distance → converted to a 0–1 score)
Results are filtered by minScore, sorted by score, and formatted with summary text and metadata

Reindexing

If you need to refresh the index for one file or the whole directory:

Single file (force reindex one VTT file):

  npm run reindex -- path/to/file.vtt
  # or: tsx reindex-file.ts path/to/file.vtt

All files in VTT_DIRECTORY (force reindex everything):

  npm run reindex-all
  # or: tsx reindex-all.ts

Both scripts use your .env (e.g. VTT_DIRECTORY, CHROMA_DB_PATH, OPENAI_API_KEY). The Chroma server must be running.

Note: Re-indexing costs OpenAI API credits (~$0.02 per transcript).

File Structure

MCP_Call_Recording/
├── src/
│   ├── index.ts                 # Entry point: init services, index existing files, start file watcher, start MCP server
│   ├── server.ts                # MCP server setup and tool registration (query_transcripts)
│   ├── tools/
│   │   └── query.ts             # query_transcripts tool (embedding + vector search + format answer)
│   ├── services/
│   │   ├── vttParser.ts         # VTT file parsing
│   │   ├── summaryService.ts    # OpenAI GPT-4-turbo for structured summaries
│   │   ├── embeddingService.ts  # OpenAI text-embedding-3-small (1536-dim)
│   │   ├── vectorDb.ts          # Chroma client (connects to http://localhost:8000)
│   │   ├── metadataExtractor.ts # Metadata from filename and VTT content
│   │   ├── indexer.ts           # Index one file or directory into Chroma
│   │   └── fileWatcher.ts       # chokidar-based file watcher
│   ├── types/
│   │   └── transcript.ts        # TypeScript interfaces
│   └── utils/
│       └── chunking.ts          # Legacy chunking utilities (now using full-transcript summaries)
├── config/
│   ├── claude-desktop.json      # Example Claude Desktop MCP config
│   └── copilot-studio.json      # Example structure for Copilot Studio (HTTP not implemented)
├── reindex-file.ts             # Script to reindex a single VTT file
├── reindex-all.ts              # Script to reindex all VTT files in VTT_DIRECTORY
├── check-embeddings.sql        # Optional: SQL for inspecting Chroma SQLite DB (chroma_db)
├── start_chroma.sh             # Helper to start Chroma server (see CHROMADB_SETUP.md)
├── start_chroma.py
├── package.json
├── tsconfig.json
├── CHROMADB_SETUP.md
└── README.md

Development

Run (production build)

npm run build
npm start

Development mode (tsx, no build step)

npm run dev

Watch (rebuild on change)

npm run watch

Reindex

npm run reindex -- vtt_files/SomeFile.vtt
npm run reindex-all

VTT File Format

The server expects WebVTT files (.vtt extension). Example:

WEBVTT

00:00:00.000 --> 00:00:05.000
Hello, this is a transcript segment.

00:00:05.000 --> 00:00:10.000
<v Speaker Name>This segment has a speaker identifier.</v>

Metadata Extraction

Metadata is derived from:

Filename pattern: {ClientName}_{Date}_{Type}.vtt

Example: BankOfAmerica_2026-01-15_Sales.vtt

VTT headers: NOTE comments or other header metadata
File modification time: Fallback when no date is found in filename or content

Troubleshooting

Server won't start

Ensure all required environment variables are set in .env (especially OPENAI_API_KEY and VTT_DIRECTORY).
Verify your OpenAI API key is valid and has credits.
Ensure the ChromaDB server is running at http://localhost:8000 (see CHROMADB_SETUP.md).
In Claude Desktop config, use the absolute path to dist/index.js.
Check stderr/logs for errors.

Files not being indexed

Confirm VTT_DIRECTORY points to the correct directory and files have .vtt extension.
Check file permissions and stderr for indexing errors.
For a single file, try: npm run reindex -- path/to/file.vtt.

Poor or empty search results

Ensure transcripts are valid VTT and were indexed (watch startup logs or use reindex scripts).
Lower minScore (e.g. 0.0) to see more results; the tool default is 0.0.
Check OpenAI API usage to confirm embeddings are being generated.

OpenAI API Issues

Rate limits: OpenAI has rate limits. If indexing many files, they're processed sequentially.
Cost monitoring: Check your usage at https://platform.openai.com/usage
Budget alerts: Set limits at https://platform.openai.com/settings/organization/billing/limits

Security Considerations

API Key Security: Keep your OpenAI API key secure. Never commit .env files to version control.
Cost Control: Set usage limits in your OpenAI account to prevent unexpected charges.
Data Privacy: Transcripts are sent to OpenAI for processing. Ensure compliance with your data policies.
Validate and constrain file paths to avoid directory traversal.
Sanitize or limit user query input as needed.
Consider rate limiting and access control for production or HTTP deployment.

Migration from Ollama

If you're upgrading from the previous Ollama-based version, see docs/OPENAI_MIGRATION.md for complete migration instructions.

License

MIT

MCP Call Recording Server

MCP Call Recording Server

Architecture Diagram (generated using Gemini Nano Banana)

Data Flow

Features

The server uses:

Prerequisites

Installation

Configuration

Environment Variables

Claude Desktop Setup

Microsoft Copilot 365 Studio Setup

Usage

Tool: `query_transcripts`

How It Works

Automatic Indexing

File Watching

Query Processing

Reindexing

File Structure

Development

Run (production build)

Development mode (tsx, no build step)

Watch (rebuild on change)

Reindex

VTT File Format

Metadata Extraction

Troubleshooting

Server won't start

Files not being indexed

Poor or empty search results

OpenAI API Issues

Security Considerations

Migration from Ollama

License

Related MCP servers

MCP servers by category

MCP Call Recording Server

MCP Call Recording Server

Architecture Diagram (generated using Gemini Nano Banana)

Data Flow

Features

The server uses:

Prerequisites

Installation

Configuration

Environment Variables

Claude Desktop Setup

Microsoft Copilot 365 Studio Setup

Usage

Tool: query_transcripts

How It Works

Automatic Indexing

File Watching

Query Processing

Reindexing

File Structure

Development

Run (production build)

Development mode (tsx, no build step)

Watch (rebuild on change)

Reindex

VTT File Format

Metadata Extraction

Troubleshooting

Server won't start

Files not being indexed

Poor or empty search results

OpenAI API Issues

Security Considerations

Migration from Ollama

License

Related MCP servers

MCP servers by category

Tool: `query_transcripts`