CLDK over MCP

The cocoa plugin drives CLDK in-process: a coding agent shells out to it via Bash heredocs. That works when the analysis lives inside your agent, but it doesn’t have to. Wrap the same CLDK calls as fixed tools and publish them over the Model Context Protocol, and any MCP host (Claude Desktop, an MCP-aware IDE, or a different agent framework entirely) can consume get_callers, get_callees, and reachability over a standard wire protocol (no code execution required on the host side).

The mental model is unchanged: one analysis facade over your code, exposed across the wire. Callers become a tool call and reachability a networkx graph query, each backed by real static analysis.

flowchart LR
    A([MCP host]) <-->|MCP| B[CLDK MCP server]
    B <--> C[CLDK analysis]
    C --> D[Call graph]
    C --> E[Symbol table]
    C --> F[Class structure]

What you’ll need

pip install cldk mcp networkx and a project to analyze. We use Apache Commons CLI (project_path="commons-cli"), the recurring sample across these docs. The server is Java-first because Java has the richest call-graph support; the same shape works for Python.

The server, end to end

Build one analysis object at startup. Callers, callees, and reachability all require the project analyzed at call_graph level (the default symbol_table level won’t populate call edges). Build it once, outside any tool, so every request reuses it.
Decorate plain functions with @mcp.tool(). FastMCP reads each function’s type hints and docstring to generate the tool’s input schema and description automatically, so you never hand-write a JSON schema.
Run the server over stdio so an MCP host can launch and talk to it.

1. Build the analysis facade

1
import os
2
import networkx as nx
3
from mcp.server.fastmcp import FastMCP
4
from cldk import CLDK
5
from cldk.analysis import AnalysisLevel
6

7
mcp = FastMCP("cldk")
8

9
APP = os.environ.get("JAVA_APP_PATH", "commons-cli")
10

11
# Java facade at call-graph depth so callers/callees/reachability work.
12
# Built once at import time and reused across every tool call.
13
analysis = CLDK(language="java").analysis(
14
    project_path=APP,
15
    analysis_level=AnalysisLevel.call_graph,
16
)
17

18
# The call graph is a networkx.DiGraph (edges point caller -> callee).
19
CALL_GRAPH = analysis.get_call_graph()

2. Define the tools

Each tool is a normal function. FastMCP turns its signature and docstring into the MCP tool schema, so the host knows exactly which arguments to fill in.

21
@mcp.tool()
22
def get_method_body(qualified_class_name: str, qualified_method_name: str) -> str:
23
    """Return the source code of a method, given its fully qualified class
24
    name and method signature (e.g. 'create(String)')."""
25
    method = analysis.get_method(qualified_class_name, qualified_method_name)
26
    return method.code if method else "method not found"
27

28

29
@mcp.tool()
30
def get_callers(target_class_name: str, target_method_declaration: str) -> dict:
31
    """Return every method that calls the target method
32
    (impact analysis / 'who calls this?')."""
33
    return analysis.get_callers(target_class_name, target_method_declaration)
34

35

36
@mcp.tool()
37
def get_callees(source_class_name: str, source_method_declaration: str) -> dict:
38
    """Return every method invoked by the source method
39
    (what this method depends on)."""
40
    return analysis.get_callees(source_class_name, source_method_declaration)

3. The reachability tool

Reachability is a deterministic graph query over the call graph CLDK hands you. That is the whole point: the host doesn’t reason about whether a sink is reachable, it looks it up with networkx.has_path.

42
def _find_node(class_name: str, method_decl: str):
43
    """Locate a method's node in the call graph by matching its metadata."""
44
    for node, data in CALL_GRAPH.nodes(data=True):
45
        if class_name in str(data) and method_decl in str(data):
46
            return node
47
    return None
48

49

50
@mcp.tool()
51
def is_reachable(
52
    source_class_name: str,
53
    source_method_declaration: str,
54
    sink_class_name: str,
55
    sink_method_declaration: str,
56
) -> dict:
57
    """Return whether the sink method is reachable from the source method
58
    along call-graph edges. Use to confirm or refute whether vulnerable code
59
    can actually be invoked."""
60
    src = _find_node(source_class_name, source_method_declaration)
61
    sink = _find_node(sink_class_name, sink_method_declaration)
62
    if src is None or sink is None:
63
        return {"reachable": False, "reason": "endpoint not found in call graph"}
64
    return {"reachable": nx.has_path(CALL_GRAPH, src, sink)}
65

66

67
if __name__ == "__main__":
68
    # Speak MCP over stdio so a host can launch this server.
69
    mcp.run(transport="stdio")

Register it with a host

An MCP host launches the server as a subprocess and discovers its tools. For a Claude Desktop-style config, point the host at the script and pass the project path through the environment:

{
  "mcpServers": {
    "cldk": {
      "command": "python",
      "args": ["cldk_mcp_server.py"],
      "env": { "JAVA_APP_PATH": "commons-cli" }
    }
  }
}

Once connected, the host sees four tools (get_method_body, get_callers, get_callees, is_reachable) and a model on that host can chain them exactly like cocoa does. A reachability question, for instance, resolves to a single tool call:

is_reachable(
  source_class_name="org.apache.commons.cli.CLI",
  source_method_declaration="main(String[])",
  sink_class_name="org.apache.commons.cli.CommandLine",
  sink_method_declaration="execute(String)")
# -> {"reachable": false, "reason": "no path in call graph"}

Ground truth in, no hallucinated taint path out: the same value proposition as the in-process loop, now available to any MCP host.

Language coverage

The server above is Java. Swapping languages is a one-line change to the facade: the tools stay identical.

Java
Python

analysis = CLDK(language="java").analysis(
    project_path="commons-cli",
    analysis_level=AnalysisLevel.call_graph,
)

analysis = CLDK(language="python").analysis(
    project_path="my_pkg",
    analysis_level=AnalysisLevel.call_graph,
)

Where to go next

cocoa The in-process version: a Claude Code plugin that runs CLDK via Bash.

Core concepts What call graphs, reachability, and analysis levels actually mean.

Common tasks Copy-paste snippets for the individual analysis calls these tools wrap.

Java API reference Every method on the Java analysis facade you can expose as a tool.