Agent skill
ast-grep
Use for ast-grep: ast-grep run, sg scan, sg test, sg new, new rule, sgconfig.yml, inline-rules, stdin, json, optional chaining, rule catalog, meta variables, pattern objects, nthChild stopBy, range field, metadata url, caseInsensitive glob, severity off, include metadata, rule order, kind pattern, positive rule, kind esquery, debug-query, static analysis, tree-sitter parser, pattern yaml api, search rewrite lint analyze, textual structural, ast cst, named unnamed, kind field, ambiguous pattern, effective selector, meta variable detection, lazy multi, strictness smart, relaxed signature, string fix, fix config, expandEnd, replace substring, toCase separatedBy, rewriter, rewrite joinBy, find patch, barrel import, ruleDirs testConfigs, libraryPath languageSymbol, dynamic injected, custom language, TREE_SITTER_LIBDIR, language injection, styled components, language alias, languageGlobs, expandoChar, napi parse, python api, programmatic API.
Install this agent skill to your Project
npx add-skill https://github.com/vinhnx/VTCode/tree/main/vtcode-core/src/skills/assets/samples/ast-grep
Metadata
Additional technical details for this skill
- short description
- Ast-grep project workflows
SKILL.md
Ast-Grep
Use this skill for ast-grep project setup, rule authoring, rule debugging, and CLI workflows that go beyond a single structural query.
Routing
- Prefer
unified_searchwithaction="structural"andworkflow="scan"for read-only project scans. - Prefer
unified_searchwithaction="structural"andworkflow="test"for read-only ast-grep rule tests. - Prefer structural
debug_queryon the public tool surface before falling back to rawast-grep run --debug-query. - Stay on the public structural surface first when the task is only running project checks and reporting findings.
- Use
unified_execonly when the public structural surface cannot express the requested ast-grep flow.
Quick Start
- In VT Code, prefer
vtcode dependencies install ast-grepbefore suggesting system package managers. - External install routes such as Homebrew, Cargo, npm, pip, MacPorts, or Nix are fallback options when the user explicitly wants a system-managed install.
- After installation, validate availability with
ast-grep --help. - On Linux, prefer the full
ast-grepbinary name oversgbecausesgmay already refer tosetgroups. - When running CLI patterns with shell metavariables like
$PROP, use single quotes so the shell does not expand them before ast-grep sees the pattern. - A good first rewrite example is optional chaining, for example rewriting
$PROP && $PROP()to$PROP?.().
Command Overview
ast-grep run: ad-hoc query execution and one-off rewrites.ast-grep scan: project rule scanning.ast-grep new: scaffold and rule generation.ast-grep test: rule-test execution.ast-grep lsp: editor integration via language server.
Built-In Languages
- ast-grep ships many built-in languages. Common aliases include
bash,c,cc/cpp,cs,css,ex,go/golang,html,java,js/javascript/jsx,json,kt,lua,php,py/python,rb,rs/rust,swift,ts/typescript,tsx, andyml. --lang <alias>and YAMLlanguage: <alias>use those built-in aliases. File-system scans infer language from built-in extensions unless the project overrides them.- In VT Code, public structural
langis passed through to ast-grep. VT Code also normalizes and infers a local subset it can pre-parse itself: Rust, Python, JavaScript, TypeScript, TSX, Go, and Java. - That local subset includes common ast-grep aliases and extensions such as
golang,jsx,cjs,mjs,cts,mts,py3, andpyi. - Use
languageGlobswhen the repository needs a different extension mapping than ast-grep’s built-in defaults.
How Ast-Grep Works
- ast-grep accepts several query formats: pattern queries, YAML rules, and programmatic API usage.
- The core pipeline is parse first, match second. Tree-Sitter builds the syntax tree, then ast-grep’s Rust matcher finds the target nodes.
- The main usage scenarios are search, rewrite, lint, and analyze.
- ast-grep processes many files in parallel and is built to use multiple CPU cores on larger codebases.
- In VT Code, the public structural surface is the read-only entry point for query, scan, and test. Use the bundled skill when the task is about YAML authoring, rewrite/apply flows, or API-level ast-grep work.
Project Scaffolding
- A scan-ready ast-grep project needs workspace
sgconfig.ymlplus at least one rule directory, usuallyrules/. rule-tests/andutils/are optional scaffolding thatast-grep newcan create for rule tests and reusable utility rules.- If the repository already has
sgconfig.ymlandrules/, prefer working with the existing layout instead of recreating scaffolding. - Use
ast-grep newwhen the repository does not have ast-grep scaffolding yet. - Use
ast-grep new rulewhen the scaffold exists and the task is creating a new rule plus optional test case.
sgconfig.yml
sgconfig.ymlis the project-level ast-grep config file, not a rule file. Treat it like the repository root for rule discovery, tests, parser overrides, and embedded-language behavior.ruleDirsis required and is resolved relative to the directory containingsgconfig.yml.testConfigsis optional and configures ast-grep test discovery. Each entry needstestDir;snapshotDiris optional and otherwise defaults to__snapshots__under thattestDir.utilDirsdeclares directories for global utility rules shared across multiple rule files.languageGlobsremaps files to parsers and takes precedence over ast-grep’s default extension mapping, which is useful for similar-language reuse like TS -> TSX or C -> Cpp.customLanguagesregisters project-local parsers.libraryPathcan be one relative library path or a target-triple map,extensionsis required,expandoCharis optional, andlanguageSymboldefaults totree_sitter_{name}.languageInjectionsis experimental. Each entry needshostLanguage,rule, andinjected.- Use dynamic
injectedcandidates when the rule captures$LANGand the embedded language must be chosen from a list such ascss,scss, orless. - Raw ast-grep project discovery walks upward from the current working directory until it finds
sgconfig.yml, and--config <file>overrides that discovery with an explicit root config path. ast-grep scanrequires project config and will error if nosgconfig.ymlis found.ast-grep runcan still search without project config, though it also benefits from discovered config for things likecustomLanguagesandlanguageGlobs.ast-grep scan --inspect summaryis the quickest way to confirm which project directory and config file ast-grep actually selected during discovery.- ast-grep also recognizes a home-directory
sgconfig.ymlas a global fallback config. XDG config directories are not part of this behavior. - Keep
sgconfig.ymlauthoring on the skill path. VT Code’s public structural tool can consume an existing config throughconfig_path, but it does not expose these top-level schema fields directly.
Rule Catalog
- Use the ast-grep catalog as inspiration when the user wants existing example rules, not as something to copy blindly.
- Start from examples in the same language family when possible.
- Read catalog markers as hints about rule complexity:
- simple pattern examples are good starting points
Fixmeans the example includes a rewrite pathconstraints,labels,utils,transform, andrewritersmean the example depends on more advanced rule features
- When adapting a catalog example, translate it to the current repository’s language, style, and safety constraints instead of preserving the example verbatim.
- Prefer the bundled skill workflow when the user asks to explain, adapt, or combine catalog examples.
Rust Catalog Highlights
- Avoid duplicated exports: a Rust lint-style rule can detect
pub use foo::Bar;in the same source file that already exposespub mod foo;. Treat this as API-surface cleanup, not a mechanical rewrite. - Beware
chars().enumerate(): the Rust catalog rewrite from$A.chars().enumerate()to$A.char_indices()is valid when the code needs byte offsets instead of character indexes. - Count
usizedigits without allocation: the catalog rewrite from$NUM.to_string().chars().count()to$NUM.checked_ilog10().unwrap_or(0) + 1is a good Rust-specific performance cleanup when the target is known to be an integer digit count. - Unsafe function without unsafe block: the Rust catalog’s
function_itemrule that requiresunsafemodifiers but rejects bodies containingunsafe_blockis a good review rule for redundantunsafemarkers. It is diagnostic-oriented and should usually stay a scan rule, not an automatic rewrite. - Rewrite
indoc!macro: the catalog example that removesindoc! { r#"..."# }wrappers is a rewrite-oriented example. Keep it on the CLI skill path because the replacement is formatting-sensitive and should be reviewed interactively before broad apply. - Adapt these rules to the repository’s Rust style before using them directly. In VT Code, preserve existing lint policy, public API conventions, and the project’s bias against unnecessary rewrites.
TypeScript Catalog Highlights
- TypeScript vs TSX matters: keep
.tsand.tsxrules separate unless the repository intentionally parses.tsas TSX throughlanguageGlobs. Do not assume one pattern works unchanged across both parsers. - Find import file without extension: good scan rule for ESM codebases that require explicit local file extensions on static or dynamic imports. It is policy-dependent, so only use it where the runtime or bundler actually requires explicit extensions.
- XState v4 to v5 migration: strong example of multi-rule YAML with
utils,transform, and multi-document configs. Keep this sort of migration on the CLI skill path and review the generated diff instead of treating it as a one-line rewrite. - No
awaitinsidePromise.all([...]): good rewrite rule when the awaited expression is directly inside the array literal. Keep the rewrite narrow so it does not change intentionally sequential logic hidden behind helper calls. - No console except allowed cases: good scan rule for client-facing TypeScript, but it is repository-policy dependent. Adapt the allowed methods and environments before enabling it broadly.
- Find import usage or identifiers: these examples are useful for repository analysis and dependency cleanup, not just linting. They are often better treated as search/report rules than rewrite rules.
- Switch Chai
shouldtoexpect: a useful migration example, but it is test-framework-specific and should be applied only where Chai is actually in use. - Speed up barrel imports: strong
rewriters/transform.rewriteexample for splitting one import into many direct imports. Keep it on the CLI skill path because path conventions, default-vs-named exports, and formatting policy vary by repository. - Missing Angular
@Component()decorator: good example of labels plus pattern-objectcontextandselector. Keep framework-specific rules tied to actual framework usage in the repository. - Logical assignment operators: a compact rewrite example for
$A = $A || $Bto$A ||= $B, but only apply it where the project’s JS target and lint policy allow ES2021 operators. - Adapt TypeScript catalog rules to the repository’s module system, framework stack, transpilation target, and lint policy before using them directly.
TSX Catalog Highlights
- TSX vs TypeScript matters for parsing: JSX-bearing patterns should stay on the TSX parser unless the repository intentionally routes
.tsthrough TSX withlanguageGlobs. - Unnecessary
useState<T>primitives: good cleanup rewrite foruseState<string|number|boolean>($A)when the initializer already gives TypeScript enough information to infer the state type. - Avoid
&&short-circuit in JSX: good React-facing rewrite from{cond && <View />}to{cond ? <View /> : null}when the left side can evaluate to renderable falsy values like0. - Rewrite MobX component style: useful migration example when
observer(() => ...)hides React hook linting from tooling. Keep it on the CLI skill path because naming, export shape, and component conventions vary by repository. - Avoid unnecessary React hooks: good diagnostic rule for
use*functions that do not actually call hooks. Treat it as a review rule first, because renaming or de-hooking can be API-affecting. - Reverse React Compiler: clearly rewrite-oriented and intentionally opinionated. Keep it on the CLI skill path and only use it when the user explicitly wants that de-memoization behavior.
- Avoid nested links: good accessibility and correctness scan rule for JSX trees.
- Rename SVG attributes: strong TSX rewrite example for hyphenated SVG attribute names such as
stroke-linecaptostrokeLinecap. Keep it reviewable because generated markup can be formatting-sensitive. - Adapt TSX catalog rules to the repository’s React version, JSX runtime, lint rules, framework conventions, and browser-support target before using them directly.
YAML Catalog Highlights
- YAML scan rules are useful for configuration-policy checks where the repository needs to flag specific keys or values rather than rewrite source code.
- The catalog host/port example is a simple message-oriented rule that matches either
host: $HOSTorport: $PORTand attaches a diagnostic. Treat it as a starting point for config validation, not a complete policy by itself. - For YAML rules, be explicit about whether the repository cares about the key name, the value, or both. If both matter together, move from separate
anypatterns to a more structured rule before relying on the result. - Keep YAML config checks repository-specific. Hard-coded values like
8000are only useful when they reflect an actual project policy.
Ruby Catalog Highlights
- Rails
*_filterto*_action: useful migration rewrite for older Rails controllers. Keep it on the CLI skill path because framework version, controller style, and review expectations vary by repository. - Prefer symbol over proc: good Ruby cleanup rewrite for cases like
.select { |v| v.even? }to.select(&:even?), but only where the shorthand remains readable and matches local Ruby style. - Path traversal detection in Rails: good security-oriented scan rule for
Rails.root.join,File.join, orsend_filefed by variables. Treat it as a review hint, not a proof of exploitability, because the surrounding validation path still matters. - Adapt Ruby catalog rules to the repository’s Rails version, Ruby style guide, and security posture before using them directly.
Python Catalog Highlights
- OpenAI SDK migration: useful multi-rule migration example for legacy
openaiPython client code, but keep it on the CLI skill path because imports, client lifetime, response shapes, and surrounding application logic often need repository-specific review. - Prefer generator expressions: good example of narrowing a rewrite to contexts like
any(...),all(...), orsum(...)where generator expressions are clearly valid. Do not generalize it to every list comprehension. - Walrus operator in
ifstatements: useful paired-rule rewrite example, but only apply it where the repository targets Python 3.8+ and the style guide accepts assignment expressions. - Remove async function: strong
rewritersexample for strippingasyncand innerawait, but treat it as high-risk migration work because it changes call semantics and often requires broader control-flow review. - Pytest fixture refactors: good example of
utils-driven context matching for fixture rename or type-hint updates. Keep it tied to real pytest usage so similarly named non-test code is not swept in. Optional[T]toT | Noneand recursive union rewrites: useful typing-modernization examples, but only where the repository targets Python 3.10+ and static typing policy actually prefers PEP 604 unions.- SQLAlchemy
mapped_columnto annotatedMapped[...]: useful ORM migration example, but keep it on the CLI skill path because ORM version, model style, and nullable semantics need review. - Adapt Python catalog rules to the repository’s Python version floor, framework stack, typing policy, async model, and migration scope before using them directly.
Kotlin Catalog Highlights
- Clean-architecture import checks: good scan-rule example for enforcing architectural boundaries with
filesplus import-path constraints. Treat it as repository-policy enforcement rather than a universal Kotlin rule. - The Kotlin catalog example is diagnostic-oriented, not rewrite-oriented. Keep it on the scan path because import-boundary violations usually need design review instead of blind mutation.
- File-scoped package constraints are the point of the example: adapt the
filesglob and package regexes to the repository’s actual module layout before relying on the result. - Adapt Kotlin catalog rules to the repository’s package naming, architecture boundaries, Android-vs-server structure, and lint ownership before using them directly.
Java Catalog Highlights
- Unused local variable detection: useful educational example for
hasplus orderedallplusprecedes, but prefer the project’s established linter or IDE for real unused-variable enforcement because Java variable scopes are broader than the sample rule covers. - Field declarations of type
String: good structural scan example showing whyfield_declarationplushas: { field: type }is more robust than a naive pattern when modifiers and annotations are present. - The Java catalog examples are primarily search/diagnostic material, not high-confidence autofix rules. Keep them review-oriented unless the repository explicitly wants ast-grep-based cleanup instead of compiler or linter diagnostics.
- Adapt Java catalog rules to the repository’s package conventions, annotation usage, style tooling, and existing static-analysis stack before using them directly.
HTML Catalog Highlights
- HTML parser reuse for framework templates: useful when Vue, Svelte, Astro, or similar files are mostly HTML, but keep parser caveats in mind because framework-specific control flow or frontmatter may require a custom language instead.
- Ant Design Vue
visibletoopen: good framework-specific attribute rewrite using enclosing-tag checks plus constraints. Keep it on the CLI skill path because framework version and component set must be confirmed first. - i18n key extraction: useful template rewrite example for wrapping static text while skipping mustache expressions, but keep it reviewable because real projects usually need key naming, dictionary updates, and whitespace policy beyond the raw rewrite.
- Adapt HTML catalog rules to the repository’s template framework, parser limitations, i18n workflow, and component-library version before using them directly.
Go Catalog Highlights
- Problematic
deferwith nested function calls: strong Go-specific scan example for catching cases where deferred arguments are evaluated immediately. Treat it as a correctness and test-reliability review rule, and prefer a manual rewrite to a closure-wrappeddefer func() { ... }()when the repository agrees with that style. - Function declarations by name pattern: good example of using
kindplushasplusregexwhen a meta-variable pattern cannot express the naming constraint directly, such asTest.*. - Contextual matching for function calls: good illustration of Go parser ambiguity between function calls and type conversions. Use contextual patterns with
selector: call_expressionwhen naive call patterns do not behave as expected. - Package-import detection: useful search/scan rule for dependency auditing, compliance checks, or migration prep. Adapt the import regex to the repository’s actual dependency boundaries instead of hard-coding example packages.
- Problematic JSON tags with
-,: high-signal security-oriented scan rule for Go struct tags. Treat matches as actionable review items because the example represents a real unmarshaling footgun, not just a style preference. - Adapt Go catalog rules to the repository’s Go version, test conventions, package layout, security posture, and existing static-analysis tooling before using them directly.
Cpp Catalog Highlights
- Reuse Cpp rules for C only when the repository intentionally parses C sources as Cpp via
languageGlobs; do not assume mixed C/C++ projects want that parser tradeoff by default. - Format-string vulnerability rewrite: strong security-oriented example for
fprintf/sprintf-style calls missing an explicit format string. Keep it reviewable because real C/C++ codebases may prefer safer API migrations over mechanical"%s"insertion in some contexts. - Struct inheritance matching: useful example of AST-shaped pattern authoring in C++, especially when a shorter surface pattern produces an
ERRORnode. Use the full structural form or fall back to a YAML rule when the syntax is too incomplete to parse reliably. - Adapt Cpp catalog rules to the repository’s C-vs-C++ parser choice, security posture, libc usage, and coding-standard expectations before using them directly.
C Catalog Highlights
- Parsing C as Cpp can reduce duplicated rule authoring, but only use that route when the repository intentionally opts into the parser tradeoff with
languageGlobs; do not blur C and C++ semantics by default. - Match function calls in C with contextual patterns: good example of tree-sitter-c ambiguity around fragments like
test($A). Usecontextplusselector: call_expressionwhen a plain call pattern under-parses or turns intomacro_type_specifier. - Rewrite method-style calls to function calls: useful migration example for C codebases that emulate methods with structs or function pointers, but keep it on the CLI skill path because it changes calling conventions and may affect ownership, pointer semantics, or naming policy.
- Yoda-condition rewrite: clearly style-driven and repository-policy-sensitive. Treat it as optional rewrite material only where the project explicitly prefers constant-on-the-left comparisons.
- Adapt C catalog rules to the repository’s parser choice, macro usage, pointer conventions, coding style, and safety policy before using them directly.
JavaScript API Highlights
- Use
@ast-grep/napionly when rule YAML or VT Code’s public structural tool is not enough. The programmatic API is the right escalation path for computed replacements, ordered-match logic, cross-node inspection, or edit orchestration that would be awkward in pure rule syntax. - Core objects are
SgRootandSgNode:parse(Lang.<X>, source)creates the tree,root()returns the root node, andfind/findAll/ traversal / refinement / edit APIs live onSgNode. Matcherinputs can be pattern strings, numeric kind ids, orNapiConfigobjects. Prefer patterns orNapiConfigunless there is a concrete reason to drop to raw kind ids.getMatchandgetMultipleMatchesexpose captured metavariables, butreplacedoes not interpolate metavariables for you. Build replacement strings explicitly in JavaScript from matched nodes before callingcommitEdits.- Keep VT Code’s boundary clear: prefer the public structural tool or CLI path for ordinary query/scan/test/rewrite flows, and only drop to NAPI when the task is genuinely programmatic.
registerDynamicLanguageand extra language packages exist, but that path is still experimental. Prefer established parsers and repo-native tooling unless dynamic-language support is actually needed.
Python API Highlights
- Use
ast-grep-pywhen the task needs programmatic AST traversal or computed edits but a Python host environment is a better fit than JavaScript or Rust. As with NAPI, prefer it only after rule YAML or VT Code’s structural/CLI path stops being a good fit. - Core objects are again
SgRootandSgNode:SgRoot(source, language)parses the source,root()returns the root node, and search, refinement, traversal, and edit APIs live onSgNode. findandfind_allsupport either direct rule keyword arguments or a config object. Prefer keyword-rule searches for simple cases and config objects when constraints or utility rules make the query more expressive.get_match,get_multiple_matches, and__getitem__expose captured metavariables.__getitem__is useful when you want a stricter access pattern and are willing to let missing captures raise instead of returningNone.replaceandcommit_editsgenerate source edits, but they do not interpolate metavariables for you. Build replacement text explicitly from matched nodes before applying edits.- Keep VT Code’s boundary clear here too: use Python API only for genuinely programmatic transformations, not as a default substitute for public structural queries or ordinary CLI rewrites.
NAPI Performance Highlights
- NAPI is not automatically faster than host-language traversal. Performance usually comes from reducing Rust↔JavaScript FFI crossings and letting ast-grep do more work per boundary crossing.
- Prefer
parseAsyncoverparsewhen many parse jobs can benefit from Node’s libuv thread pool and the main JS thread is already busy. - Prefer
findAllover manual recursive traversal in JavaScript. One bulk Rust-side search is usually cheaper than repeatedkind(),children(), and recursion calls across the FFI boundary. - Prefer
findInFileswhen scanning many files and you can use its file-path-oriented search model. It avoids unnecessary round-tripping source strings through JavaScript and can parallelize work in Rust threads. findInFileshas a callback-completion caveat: its returned promise can resolve before all callbacks run. If completion ordering matters, track callback counts explicitly before treating the scan as finished.- Apply these performance tips only when scale justifies them. For small inputs or one-off transformations, simpler synchronous code is often the better tradeoff.
Rule Essentials
- Start rule files with
id,language, and rootrule. - Treat the root
ruleas a rule object that matches one target AST node per result. - A rule object still needs a positive anchor. In practice, start with
patternorkind;regexis a filter, not a sufficient root rule by itself. - Atomic rules are
pattern,kind,regex,nthChild, andrange. - Use atomic fields such as
pattern,kind,regex,nthChild, andrangefor direct node checks. - Use relational fields such as
inside,has,follows, andprecedeswhen the match depends on surrounding nodes. - Use composite fields such as
all,any,not, andmatchesto combine sub-rules or reuse utility rules. - Rule object fields are effectively unordered and conjunctive; if matching becomes order-sensitive, rewrite the logic with an explicit
allsequence instead of assuming YAML key order matters. languagecontrols how patterns parse. Syntax that is valid in one language can fail in another.
Rule Cheat Sheet
- Atomic rules check properties of one node. Start here when a single syntax shape is enough.
pattern,kind, andregexare the common atomic fields.patterncan also be an object withcontext,selector, and optionalstrictness.- Pattern objects are for invalid, incomplete, or ambiguous snippets.
contextis required;selectorpicks the real target node inside that context;strictnesstunes how literally the pattern matches. - Use pattern objects when the bare snippet would parse as the wrong node kind, such as JavaScript class fields or Go/C call expressions inside ambiguous fragments.
kindis usually a plain node kind name, but ast-grep also supports a limited ESQuery-style syntax for somekindselectors.- Separate
kindandpatternchecks do not change how the pattern is parsed. If parse shape is the problem, switch to one pattern object withcontextandselector. regexmatches the whole node text. Reach fornthChildwhen position among named siblings matters andrangewhen the match must be limited to a known source span.- Regex syntax follows Rust
regex, not PCRE. Do not assume look-around or backreferences are available, and usually pairregexwithkindorpatternso the expensive text check only runs on the right node shapes. nthChildaccepts a number, anAn+Bstring, or an object withposition,reverse, andofRule. Counting is 1-based and only considers named siblings.rangematches by source position with 0-basedlineandcolumn;startis inclusive andendis exclusive.- Relational rules describe structure around the target node. Use
inside,has,follows, andprecedeswhen the match depends on ancestors, descendants, or neighboring nodes. - Read relational rules as: target node relates to surrounding node. The top-level rule still matches the target; the relational subrule matches the surrounding node that filters it.
- Relational subrules can themselves use
pattern,kind, composites, and captures. Those captures can still be referenced later infix, which is a practical way to extract surrounding syntax while keeping the target node as the match. - Add relational
fieldwhen the surrounding node matters by semantic role, not just by shape.fieldonly applies toinsideandhas. - Add
stopBywhen ancestor or sibling traversal must continue past the nearest boundary instead of stopping early. The default isneighbor,endsearches to the boundary, and a rule object stop is inclusive. insidemeans the target is somewhere under a matching ancestor,hasmeans the target node contains a matching descendant,followsmeans the target comes after a matching sibling or prior node, andprecedesmeans it comes before one.- Composite rules combine checks for the same target node. Use
allfor explicit conjunction,anyfor alternatives,notfor exclusions, andmatchesto delegate to a utility rule. allandanystill operate on one target node. They combine sub-rules, not multiple matched nodes.allis the ordered composite. Use it when later checks depend on captures established by earlierpatternmatches, because YAML rule-object field order is not guaranteed.anyis for alternatives, not for "collect all matching nodes". If one node cannot satisfy every branch at once,allis the wrong operator even if the surrounding structure feels plural.- Nested composites still evaluate one node at a time. A rule like
has: { all: [...] }means "has one child satisfying every listed rule", not "has one child for each listed rule". - When you need "a node that has X child and has Y child", write
all: [ { has: ... }, { has: ... } ]on the outer target instead of putting incompatible checks inside one nestedall. - Utility rules keep repeated logic out of the main rule body. Use file-local
utilsfor one config file and global utility-rule files when multiple rules in the project need the same building block. - Local utility rules live under the current file's
utilsmap, are only visible in that one config file, inherit the file's language, and cannot define their own separateconstraints. - Global utility rules live in separate files discovered through
utilDirs. They must declareidandlanguage, and can only use the utility-safe fields:id,language,rule,constraints, and localutils. - Local utility names must be unique inside one file. A local utility can shadow a global utility with the same name, so check the nearest file first when
matchesseems to resolve unexpectedly. - Utility rules can call other utility rules through
matches, including recursive structural tricks like nested-parentheses matching. Avoid cyclicmatchesdependency graphs, because ast-grep does not allow recursive cycles there. - Self-reference through relational structure such as
insideorhasis different from cyclicmatchesreuse and is allowed when the AST traversal still makes progress. - Switch from a single
patternto a rule object when you need positional constraints, role-sensitive matching, reusable sub-rules, or several structural conditions on one node. - Rule-object fields are logically equivalent to an
allacross those fields, but not to an orderedall. Keep explicitallwhen capture order matters; use rule-object style when the checks are independent and flatter indentation helps readability.
Config Cheat Sheet
- Basic info keys define the rule itself. Use
idfor the unique rule name,languagefor the parser target,urlfor rule documentation, andmetadatafor custom project data that VT Code should preserve with the rule. - One YAML file can hold multiple rules when you separate documents with
---. - Finding keys define what gets matched.
ruleis the core matcher,constraintsnarrows meta-variable captures, andutilsholds reusable helper rules that you call throughmatches. utilscan be purely local to the current file or can supplement global utility-rule files loaded throughutilDirs. Keep shared building blocks global only when multiple rule files genuinely need them.constraintsruns afterrulematched, only targets single meta variables like$ARG, and is a poor fit insidenot.- Patching keys define reusable fixes. Use
transformto derive new meta-variables before replacement,fixfor either a string replacement or atemplateobject withexpandStart/expandEnd, andrewriterswhen the transformation is too complex for one inlinefix. - Linting keys define what scan results report. Use
severity,message,note, andlabelsfor diagnostics, thenfilesandignoresto scope where the rule applies. - Severity levels are
error,warning,info,hint, andoff.hintis the default severity in ast-grep project scans. errorfindings make rawast-grep scanexit non-zero; VT Code normalizes that CLI behavior into structured findings on the public scan path instead of surfacing a tool error.severity: offdisables the rule during scanning.notesupports Markdown but cannot interpolate meta variables.- Source suppression uses
ast-grep-ignorecomments.ast-grep-ignoresuppresses all rules for the same line or following lineast-grep-ignore: rule-idsuppresses one rule- comma-separated rule ids suppress multiple specific rules
- next-line suppression only works when there is no preceding AST node on that same comment line
- File-level suppression requires the suppression comment on the first line plus an empty second line.
unused-suppressionis a built-in hint-style rule with autofix for stale ignore directives, but it only appears in fullscanruns when ast-grep is not filtering or disabling rules through CLI narrowing flags.labelskeys must come from meta variables already defined by the rule orconstraints.filessupports either plain globs or object entries. Use object syntax when you need options likecaseInsensitiveglob matching.ignoresruns beforefiles. Both are relative to thesgconfig.ymldirectory, and the glob should not start with./.- Rule-level
ignoresis different from CLI--no-ignore: the CLI flag changes global ignore-file behavior, while YAMLignoresonly filters files for that rule. - JSON output only includes rule
metadatawhen the ast-grep run enabled metadata output, for example via--include-metadata. - Keep config authoring on the ast-grep skill path. VT Code’s public structural tool runs read-only query/scan/test workflows; it does not expose rule-YAML authoring fields directly.
Transformation Objects
transformbuilds new strings from captured meta variables beforefixruns.- Each
transformentry introduces a new variable name without a leading$. Inside the transform object,sourcestill points at an existing capture or prior transform result using the normal$VARform. - Later transforms can consume variables created by earlier transforms, so transform order matters when you are stacking multiple string operations.
replaceuses a Rust regex over one meta variable.sourcemust be$VARstyle,replaceis the regex,byis the replacement text, and regex capture groups can be reused inby.- Regex capture groups are only available inside the
replacefield of thereplacetransform and can only be referenced from thebyfield of that same transform. Regularregexrules do not expose those capture groups. substringslices a meta variable by character index with inclusivestartCharand exclusiveendChar. Negative indexes count from the end, and slicing is based on Unicode characters rather than raw bytes.substringbehaves like Python string slicing, so omit either bound when the slice should stay open-ended.convertchanges identifier-style casing throughtoCase. Common outputs arelowerCase,upperCase,capitalize,camelCase,snakeCase,kebabCase, andpascalCase.- Use
separatedByto control howconvertsplits words before rebuilding the target case. Supported separators include dash, dot, space, slash, underscore, andCaseChange. CaseChangesplits at transitions such asastGrep,ASTGrep, orXMLHttpRequest, which matters when converting mixed acronym identifiers.- Use
replacetransforms for conditional punctuation or whitespace when a multi-capture may be empty. The common pattern is derivingMAYBE_COMMAor similar from$$$ARGSso the extra separator only appears when matches exist. - String-form transforms such as
replace(...),substring(...),convert(...), andrewrite(...)are valid shorthand in newer ast-grep versions, but keep examples explicit when debugging. - String-form transforms require ast-grep 0.38.3 or newer. Prefer object form when compatibility or debugging clarity matters.
Rewriters
rewritersis an experimental feature for advanced multi-node rewrites. Prefer ast-grep’s API instead when the YAML starts carrying too much control flow or state.- A rewriter is a smaller rule object with only
id,rule,constraints,transform,utils, andfix.id,rule, andfixare required. - Rewriters are only usable through the
rewritetransformation. They are not standalone scan/report rules. - Meta variables captured inside one rewriter do not leak to sibling rewriters or the outer rule.
- Rewriter-local
transformvariables andutilsare also scoped to that one rewriter. - A rewriter transform can call other rewriters from the same
rewriterslist when the rewrite pipeline needs multiple internal passes.
Pattern Syntax
- Pattern code must be valid code that tree-sitter can parse.
- Patterns match syntax trees, so a query can match nested expressions instead of only top-level text.
$VARmatches one named AST node.$$$ARGSmatches zero or more AST nodes in places like arguments, parameters, or statements.- Reusing the same captured name means both occurrences must match the same syntax.
- Prefixing a meta variable with
_disables capture, so repeated$_Xoccurrences do not need to match the same content. $$VARcaptures unnamed nodes when named-node matching is too narrow.- If a short snippet is ambiguous, move to an object-style pattern with more
contextplusselectorinstead of guessing.
Pattern Parsing Deep Dive
- Pattern creation has four stages: preprocess meta-variable text when the language needs a custom
expandoChar, parse the snippet, choose the effective node, then detect meta variables inside that effective node. - Invalid pattern code usually fails because a meta variable is standing in for syntax that the parser treats as an operator or keyword. Patterns like
$LEFT $OP $RIGHTor{ $KIND foo() {} }should become rule objects using parseable code pluskind,regex,has, or other rule fields. - Incomplete or ambiguous snippets can appear to work only because tree-sitter recovered from an error. Treat that as best-effort behavior, not a stable contract across ast-grep upgrades, and prefer valid
contextplusselector. - The default effective node is the leaf node or the innermost node with more than one child. Override it with
selectorwhen the real match should be a statement instead of the inner expression, especially forfollowsandprecedes. - Meta variables are detected only when the whole AST node text matches meta-variable syntax. Mixed text like
obj.on$EVENT, lowercase names like$jq, or string-content fragments do not become meta variables. $$VARcaptures unnamed nodes such as operators when the grammar exposes them only as anonymous tokens.$$$ARGSis lazy: it stops before the next node that satisfies the rest of the pattern.- When pattern behavior is surprising, inspect the parsed tree and effective node first. In VT Code, start with public structural
debug_query; in the Playground, use the pattern view for the same questions.
Pattern Core Concepts
- ast-grep is structural search, not plain text search.
patternmatches syntax-tree shape, whileregexis the escape hatch when node text itself matters. - Tree-Sitter gives ast-grep a concrete syntax tree, not a stripped-down abstract syntax tree. That CST detail is why punctuation and modifiers can still matter even when matching stays syntax-aware.
- Named nodes carry a
kind; unnamed nodes are punctuation or literal tokens. Meta variables match named nodes by default, and$$VARis the opt-in when unnamed nodes matter. kindbelongs to the node itself.fieldbelongs to the parent-child relationship, so use relationalhasorinsidewithfieldwhen role matters more than raw node kind.- A node is significant to ast-grep when it is named or has a
field. Trivial nodes can still matter for exact matching, so do not assume every important token has its own named node.
Match Algorithm
- The default strictness is
smart. Every node you spell out in the pattern is respected, but unnamed nodes in the target code can be skipped. - Unnamed nodes written in the pattern are not skipped. A shorter pattern like
function $A() {}can matchasync function, whileasync function $A() {}requiresasyncto be present. - Use strictness to tune what ast-grep may skip during matching.
cst: skip nothingsmart: default, skip unnamed nodes in code onlyast: skip unnamed nodes on both sidesrelaxed: also skip commentssignature: ignore text and compare mostly named-node kinds
- This explains why quote differences can disappear under
ast, comments can disappear underrelaxed, and even different callee text can match undersignature. - In VT Code, read-only structural queries already expose
strictness. Use the bundled skill when the task is choosing between levels, or when the user needs raw CLI--strictnessor YAML pattern-objectstrictness.
Custom Languages
- Use custom language support when the parser exists in tree-sitter form but ast-grep does not ship it as a built-in language.
- The basic workflow is:
- install
tree-sitterCLI and obtain the grammar - compile the parser as a shared library
- register it in workspace
sgconfig.ymlundercustomLanguages
- install
- Prefer
tree-sitter build --output <lib>to compile the dynamic library. If the installed tree-sitter is too old forbuild, useTREE_SITTER_LIBDIRwithtree-sitter testas the fallback path. - Reusing a parser library built by Neovim is valid when it already matches the grammar/version you need.
- Register
libraryPath,extensions, and optionalexpandoCharinsgconfig.yml.expandoCharmatters when$VARis not valid syntax in the target language and must be rewritten to a parser-friendly prefix. - Use
tree-sitter parse <file>to inspect parser output when the custom grammar or file association is unclear. - VT Code’s public structural queries can use a custom language only after the local ast-grep project config is in place. The setup, compilation, and debugging work stays on the bundled ast-grep skill path.
Language Injection
- ast-grep can search embedded languages inside a host document. Built-in injection already covers HTML with CSS in
<style>and JavaScript in<script>. - Use
languageInjectionsinsgconfig.ymlwhen the embedded language is project-specific, such as CSS inside styled-components or GraphQL inside tagged template literals. - A
languageInjectionsentry needshostLanguage, arule, andinjected. Theruleshould capture the embedded subregion with a meta variable such as$CONTENT. - Typical patterns are
styled.$TAG\$CONTENT`` for CSS-in-JS andgraphql\$CONTENT`` for GraphQL template literals. - ast-grep parses the extracted subregion with the injected language, not the parent document language. That is why CSS patterns can match inside JavaScript once injection is configured.
- Use
languageGlobswhen the whole file should be parsed as a different or superset language. UselanguageInjectionswhen only a nested region inside the file changes language. - In VT Code, read-only structural query / scan / test can consume existing injection config. Designing or debugging
languageInjectionsitself stays on the bundled ast-grep skill path.
FAQ Highlights
- If a pattern fragment fails, the usual fix is to provide more valid
contextand then narrow the real target withselector. This is the standard workaround for subnodes like JSON pairs or class fields that are not standalone code. - If a rule behaves strangely, reduce it to the smallest repro, confirm whether it is matching an expression or a statement, and use
allto make rule order explicit when later checks depend on earlier meta-variable captures. - CLI and Playground can disagree because parser versions and text encodings differ. In VT Code, prefer the public structural
debug_queryflow first, then compare the parsed AST or CST before assuming the rule is wrong. - Meta variables must occupy one whole AST node.
use$HOOKand similar prefix/suffix patterns will not work; capture the full node and narrow it withconstraints.regexinstead. Use$$VARfor unnamed nodes, and remember that$$$MULTIis lazy. - Do not combine separate
kindandpatternrules to force a different parse shape. Use one pattern object withcontextandselectorso the parser sees the intended node kind. - ast-grep rules are single-language. Share coverage across related languages by parsing both with the superset via
languageGlobs, or keep separate rules when the AST differences matter. - ast-grep does not provide scope, type, control-flow, data-flow, taint, or constant-propagation analysis. If the task needs those, switch tools instead of stretching rule syntax.
Find & Patch
- ast-grep rewrites are still find first, patch second.
ruleplus optionalconstraintsfinds the target,transformderives replacement strings, andfixpatches the final text. - The simple workflow rewrites one matched node at a time. When one node must expand into multiple outputs, use
rewritersplustransform.rewriteinstead of forcing everything into one inlinefix. transform.rewritelets you run sub-rules over a matched meta-variable, generate one fix per sub-node, and join the results withjoinBy.transform.rewriteis still experimental. It rewrites descendants of the captured source, prevents overlapping rewriter matches, prefers higher-level AST matches first, and for one node only applies the first matching rewriter in the declared order.- This is the right model for list-style rewrites such as exploding a barrel import into multiple single imports, and it is the canonical example for rewriter usage.
- Keep this declarative workflow on the ast-grep skill path. VT Code’s public structural surface stays read-only and does not expose rewrite/apply behavior.
Rewrite Essentials
- Use
ast-grep run --pattern ... --rewrite ...for one-off rewrites. - Use YAML
fixin rule files for reusable rewrites that should live with the rule. - Use
--interactiveto review rewrite hunks before applying them. - Use
--update-allor-Uonly when the user clearly wants non-interactive apply behavior. - Meta variables captured in
patterncan be reused infix. - String
fixis raw replacement text, not a parsed Tree-Sitter pattern. Meta variables can appear anywhere in the replacement string. fixindentation is preserved relative to the matched source location, so multiline rewrites must be authored with deliberate indentation.- Non-matched meta variables become empty strings in rewritten output.
- If appended uppercase text would be parsed as part of a meta variable name, use transforms instead of writing
$VARNamedirectly. - Use
transform.rewritewhen a matched list must be rewritten element-by-element before the outerfixruns. - Use
joinByto control how rewritten list items are stitched together, for example newline-joined imports in a barrel-import rewrite. - Use
FixConfigwhen replacing only the matched node is not enough, especially for deleting list items or key-value pairs that also need a surrounding comma removed. - In
FixConfig,templateis the replacement text andexpandStart/expandEndwiden the rewritten range to consume commas, brackets, or other surrounding trivia outside the target node. - Keep advanced
transformandrewritersin the skill-driven CLI workflow.
Run Command Basics
ast-grep -p 'foo()'andast-grep run -p 'foo()'are equivalent.runis the default subcommand.ast-grep rundefaults to searching.when no path is provided and can search multiple paths in one invocation.--globsincludes or excludes paths and overrides ignore-file behavior. Prefix a glob with!to exclude, and let later globs win when multiple patterns match.--no-ignorechanges which ignore sources ast-grep respects. The supported categories arehidden,dot,exclude,global,parent, andvcs.--followmakes ast-grep traverse symlinks. Expect loop or broken-link errors to surface directly from the CLI when the filesystem is invalid.
Scan Command Basics
ast-grep scandefaults to searching.when no path is provided and can search multiple paths in one invocation.--config <file>points scan at a projectsgconfig.ymlroot. It is the default scan mode in VT Code’s publicworkflow="scan"surface.--rule <file>runs one YAML rule file without project setup and conflicts with--config.--inline-rules '...'runs one or more inline YAML rules without creating a file on disk. Separate multiple rules with YAML---. It conflicts with--rule.--filter <regex>narrows project-config scan to matching rule ids and conflicts with--rule.--include-metadataonly affects JSON output and is already enabled on VT Code’s public scan path so normalized findings can carry rule metadata.
Test Command Basics
ast-grep testvalidates rule tests from the ast-grep project config.- Rule test files are YAML with
id,valid, andinvalid.validcases should produce no issue;invalidcases should produce at least one issue. - Ast-grep’s test output distinguishes four outcomes:
reported: invalid code correctly reportsvalidated: valid code correctly stays quietnoisy: valid code reported unexpectedlymissing: invalid code was not reported
--config <file>points test execution at a specific ast-grep root config.--test-dir <dir>narrows where test YAML files are discovered.--snapshot-dir <dir>changes the snapshot directory name from the default__snapshots__.--filter <glob>narrows which rule test cases run.--skip-snapshot-testschecks test validity without snapshot-output assertions. VT Code exposes this one on the publicworkflow="test"path.--include-offincludesseverity: offrules during test runs.--update-allgenerates or refreshes snapshot baselines, usually under__snapshots__/.--interactiveis for selective snapshot updates after rule or test changes.- Snapshot tests cover output details such as spans, labels, or message rendering in addition to simple valid/invalid matching, so
--skip-snapshot-testsis useful while a rule is still evolving.
Other Commands
ast-grep new [project|rule|test|util]scaffolds a project or individual items. Common flags are--lang,--yes,--base-dir, and an optional itemNAME.ast-grep lspstarts the language server and accepts an optional--config <file>.ast-grep completions [shell]generates shell completion scripts forbash,elvish,fish,powershell, orzsh.ast-grep helpandast-grep --helpare the authoritative command-discovery entry points when the exact subcommand or flags are in doubt.
CLI Modes
--interactiveis for reviewing rewrite hunks one-by-one; ast-grep’s interactive controls arey,n,e, andq.--json=pretty|stream|compactis for raw ast-grep JSON output when the user needs native ast-grep payloads or shell pipelines.prettyis the default if a style is not specified. Prefer VT Code’s normalized structural results when those are sufficient.- Raw ast-grep JSON match objects include fields such as
text,range,file,lines, optionalreplacement, optionalreplacementOffsets, and optionalmetaVariables. Scan-mode rule matches add fields likeruleId,severity,message, and optionalnote. - ast-grep JSON positions are zero-based for line, column, and byte offsets. Keep that convention in mind when translating payloads into editor-facing or user-facing locations.
--json=streamemits one JSON object per line and is the better fit for large pipelines;prettyandcompactemit one JSON array and are easier to inspect but less streaming-friendly.--json=<STYLE>must use the equals-sign form.--json streamis parsed as plain--jsonplus an extra positional argument, not as--json=stream.--stdinis for piping code into ast-grep. It conflicts with--interactive.ast-grep run --stdinrequires an explicit--langbecause stdin has no file extension for language inference.ast-grep scan --stdinonly works with one single rule via--rule/-r.--stdinonly activates when the flag is present and ast-grep is not running in a TTY.--heading=auto|always|neveronly changes the human-readable text layout. It does not matter when VT Code is already consuming structured JSON.--color=auto|always|ansi|neveronly controls terminal coloring. VT Code’s public structural query forces plain output with--color=never.--format=github|sarifis for CI/reporting pipelines, not VT Code’s normalized public scan result shape.--report-style=rich|medium|shortonly changes ast-grep’s human-readable diagnostics.--error,--warning,--info,--hint, and--offoverride rule severities for one scan run. These flags belong on the CLI skill path, not VT Code’s public structural surface.--inspect entityis the direct CLI way to inspect each rule’s final enabled severity, including overrides and project-config effects.unused-suppressioncan also have its severity overridden on the CLI, but that is still CLI-only behavior outside VT Code’s public structural surface.--inspect=summary|entityemits file and rule discovery diagnostics to stderr without changing the actual match results.--threads <NUM>controls approximate parallelism.0keeps ast-grep’s default heuristics.-C/--contextshows symmetric surrounding lines.-A/--afterand-B/--beforeare asymmetric alternatives and conflict with--context.ast-grep runexits0when at least one match is found and1when no matches are found. VT Code normalizes that no-match case to an emptymatchesarray on the public structural query path.ast-grep scanexits1when at least one error-severity rule matches and0when no rules match. VT Code normalizes that error-finding case to structuredfindingsinstead of surfacing a tool error.
API Escalation
- Do not force complex transformations into rule syntax when the task needs arbitrary AST inspection or computed replacements.
- Escalate to ast-grep’s library API when the task needs conditional replacement logic, counting or ordering matched nodes, per-node patch generation, or replacement text computed from matched content and surrounding nodes.
- Node.js NAPI is the main experimental API surface today. The common entry points are
parse,kind, andpattern, and the main objects areSgRootandSgNode. - In NAPI,
parse(Lang.<X>, source)returnsSgRoot,root()returnsSgNode, and traversal/search APIs likefind,findAll,field,parent,children,matches,inside,has,replace, andcommitEditslive onSgNode. NapiConfigis the programmatic equivalent of rule YAML forfind/findAll, andFindConfigis the config shape for file-based searching.- Python bindings expose the same general model with
SgRoot(src, language)plusSgNodemethods for rule checks, traversal, searching, and edit generation. - JS language-specific objects like
js.parse(...)are deprecated; prefer the unified NAPI functions withLang.JavaScript. - Rust
ast_grep_coreis the lowest-level and most efficient option, but also the heaviest lift. - Applying ast-grep
fixthrough the JS/Python APIs is still experimental, so prefer generating explicit patches in code when reliability matters. - If the target language has no suitable JS/Python parser path for the desired automation, prefer a Rust implementation or another repo-native AST approach instead of overcomplicating ast-grep rules.
Use unified_exec For
ast-grep --helpast-grep newast-grep new ruleast-grep scan -r <rule.yml> <path>ast-grep scan --rule <file> <path>ast-grep scan --inline-rules '...' <path>ast-grep run --pattern <pattern> --rewrite <rewrite>ast-grep run --jsonast-grep run --stdin --lang <lang>ast-grep run --no-ignore hidden --followast-grep run --inspect summary --threads 4ast-grep run --context 2ast-grep scan --format sarif --report-style shortast-grep scan --error=rule-idast-grep scan --stdin --rule <rule.yml>ast-grep test --config sgconfig.yml --filter 'rust/*'ast-grep test --test-dir rule-tests --snapshot-dir __snapshots__ast-grep test --include-off --update-allast-grep lspast-grep completionsast-grep new project --base-dir . --yesast-grep new rule my-rule --lang rustast-grep help- ast-grep GitHub Action setup
- ast-grep programmatic API experiments and library examples
- System package-manager install commands when the user explicitly wants them
sg new- Rewrite or apply flows
- Interactive ast-grep flags
- Raw ast-grep color control such as
--color never transform,rewrite,joinBy, orrewriters- Non-trivial
sgconfig.ymlauthoring or debugging - Rule authoring tasks that need direct ast-grep CLI iteration beyond public scan/test
Read More
- Read references/project-workflows.md when you need the boundary between public scan/test support and skill-driven CLI work, or when you need a quick reminder of ast-grep pattern and rule essentials.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
cmd-review
Review the current diff or selected files (usage: /review [--last-diff|--target <expr>|--file <path>|files...] [--style <style>])
cmd-command
Run a terminal command (usage: /command <program> [args...])
skill-installer
Install VT Code skills into ~/.agents/skills from a curated list or a GitHub repo path. Use when a user asks to list installable skills, install a curated skill, or install a skill from another repo, including private repos.
cmd-analyze
Perform comprehensive codebase analysis and generate reports (usage: /analyze [full|security|performance])
skill-creator
Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends VT Code's capabilities with specialized knowledge, workflows, or tool integrations.
kirby-i18n-workflows
Manages Kirby multi-language workflows, translations, and localized labels. Use when dealing with languages, translation keys, placeholders, or importing/exporting translations.
Didn't find tool you were looking for?