Go-to-Definition

Terms used in this page

CachedBuild: a group of Rust HashMaps created from a successful build on disk. The maps store things like node id -> node info and source id -> file path. It is a snapshot, not the live editor buffer.
build_version: an integer stored with CachedBuild that records which text version produced that snapshot.
text_cache: an in-memory HashMap<uri, (version, text)> with the latest editor-buffer content.
Clean: the in-memory editor text for a file is at the same version as the last cached successful build for that file (text_version <= build_version).
Dirty: the in-memory editor text is newer than the cached build (text_version > build_version). This usually means unsaved edits or edits not yet reflected in a successful rebuild.
AST offset path: direct declaration lookup using AST src byte offsets (goto_declaration_cached).
Tree-sitter path: live-buffer parsing and name/scope-based resolution (goto_definition_ts).

Why this design exists

Go-to-definition must work in two very different editor states:

Clean state: the buffer matches the last successful build.
Dirty state: the buffer has unsaved or not-yet-built edits.

In the clean state, AST byte offsets are reliable and fast.
In the dirty state, offsets can drift, so name/scope-based resolution is safer.

This behavior was introduced to address the dirty-buffer reliability issue reported in #2.

Runtime flow

At request time, textDocument/definition checks whether the file is clean or dirty, then picks the resolution order.

End-to-end request path

Determine source of truth (clean vs dirty)

The server compares current text version in text_cache to the cached build version.
If text is newer, the file is treated as dirty and AST byte offsets are considered potentially stale.

Resolve a candidate from live syntax

For dirty files (and as a fallback for clean files), the server parses the live buffer and builds cursor context (identifier, function scope, contract scope), then resolves scope using cached declaration maps and inheritance order.

Typical context shape:

identifier "totalSupply"
  -> function_definition "mint"
  -> contract_declaration "Token"

Normalize to declaration location

After semantic resolution, the server maps the target to concrete declaration ranges in source, including same-file and cross-file targets. When several matches exist, it prefers the best scoped/overload-compatible target.

Validate and apply fallback policy

Tree-sitter targets are validated against the cursor identifier text.
If validation fails in dirty mode, the server falls back to AST-by-name resolution; in clean mode, it tries AST-by-offset first and then tree-sitter fallback.

Once this sequence succeeds, the server returns a single Location; otherwise it returns null.

Formatting race guard (related reliability fix)

A separate race was fixed so goto/hover read correct text after formatting:

on_change only writes to text_cache when version is not older than what is already cached.
Formatting updates text_cache immediately before returning edits.

Current Behavior Summary

Dirty buffers: tree-sitter path first (validated), then AST-by-name fallback. goto_declaration_by_name is only used in the dirty path.
Clean buffers: AST-by-offset path first, then tree-sitter fallback (validated). goto_declaration_by_name is not used in the clean path.
Tree-sitter results are validated against cursor identifier text before return.
Parameter and local declaration navigation is supported by tree-sitter declaration scanning.
Content hash guard: on_change skips a rebuild entirely when the saved text hash matches CachedBuild.content_hash from the last successful build. This prevents stale build_version drift in format-on-save loops.

Qualifier goto

When the cursor is on the qualifier segment of a qualified type path (e.g., Pool in Pool.State), resolve_qualifier_goto() intercepts before the normal goto_bytes() path and navigates to the container declaration (the contract/library/interface).

How it works:

Detects the cursor is on a multi-segment IdentifierPath's first nameLocations entry.
Follows referencedDeclaration to the declaration node (e.g., the struct).
Reads the declaration's scope field to find the container's node ID.
Emits a Location pointing to the container's name_location.

Helper: find_node_info() is a reusable utility that looks up a NodeInfo by NodeId across all files in the nodes map.

Reused Infrastructure

Component	From	Used For
`CompletionCache`	completion.rs	Scope chain, type resolution, inheritance
`scope_declarations`	completion.rs	Declaration → scope node mappings
`scope_parent`	completion.rs	Upward scope walk
`scope_ranges`	completion.rs	Innermost scope at cursor position
`name_to_type`	completion.rs	`SymbolName` → `TypeIdentifier` for resolution
`linearized_base_contracts`	completion.rs	C3 inheritance resolution
`name_to_node_id`	completion.rs	`SymbolName` (contract name) → scope ID
`text_cache`	lsp.rs	Live buffer content
`CachedBuild.content_hash`	lsp.rs / goto.rs	Skip rebuild when saved text is identical to last build
tree-sitter parser	goto.rs	Parse live buffer and find declaration ranges

Test Coverage

The goal of tests here is to protect behavior, not specific implementation details. Even if internals change, these guarantees should still hold:

Cursor context is extracted correctly We must reliably detect identifier, function, and contract context from live syntax. Why it matters: wrong context routes resolution to the wrong scope. Representative tests:
- test_cursor_context_state_var
- test_cursor_context_top_level
- test_cursor_context_short_param
Declaration discovery finds valid symbol kinds The declaration scanner must return the expected declaration nodes (state vars, params, enum values, etc.). Why it matters: if symbol extraction misses a kind, go-to-definition silently fails for that language feature. Representative tests:
- test_find_declarations
- test_find_declarations_enum_value
- test_cursor_context_short_param
Ambiguity resolution picks the intended target When multiple declarations share a name, selection logic must prefer the correct scope/container. Why it matters: users should not jump to unrelated symbols with the same name. Representative tests:
- test_find_declarations_multiple_contracts
- test_find_best_declaration_same_contract

This suite is intentionally tied to these guarantees so refactors can change function names or structure without weakening behavior.