Advanced Commands - LLM Checker

Commands Requiring sql.js

The following commands require the optional sql.js package for SQLite database access:

npm install sql.js

sync

Downloads and stores the full Ollama model catalog into a local SQLite database. Run this before using search or smart-recommend.

llm-checker sync
llm-checker sync --force          # Force full sync
llm-checker sync --incremental    # Only sync new/updated models
llm-checker sync --quiet          # Suppress progress output

Flag	Description
`-f, --force`	Force full resync even if recent data exists
`--incremental`	Only sync new and updated models
`-q, --quiet`	Suppress all progress output

Advanced recommendations using the full scoring engine and the local SQLite model database. Provides best, fastest, and highest-quality picks with score breakdowns.

llm-checker smart-recommend
llm-checker smart-recommend --use-case reasoning
llm-checker smart-recommend --use-case coding --limit 5 --target-tps 30

Flag	Description
`-u, --use-case`	Optimize for: `general`, `coding`, `chat`, `reasoning`, `creative`, `fast`
`-l, --limit`	Maximum number of recommendations (default: `5`)
`--target-tps`	Target tokens per second (default: `20`)
`--target-context`	Target context length (default: `8192`)
`--include-vision`	Include multimodal models
`--include-embeddings`	Include embedding models
`-j, --json`	Output as JSON

Hardware Simulation

simulate

Simulates hardware profiles to show compatible LLM models for a different system, without needing the actual hardware. Supports both preset profiles and fully custom hardware configurations.

# List all available preset profiles
llm-checker simulate --list

# Use a preset profile
llm-checker simulate --profile rtx4090
llm-checker simulate --profile m4pro24 --use-case coding

# Custom hardware configuration
llm-checker simulate --gpu "RTX 5060" --ram 32 --cpu "AMD Ryzen 7 5700X"
llm-checker simulate --gpu "RTX 4090" --ram 64
llm-checker simulate --ram 16

Flag	Description
`-p, --profile <name>`	Preset hardware profile to simulate (e.g., `rtx4090`, `m4pro24`, `h100`)
`-l, --list`	List all available hardware profiles
`--gpu <model>`	Custom GPU model (e.g., `"RTX 5060"`, `"RX 7800 XT"`, `"Apple M4 Pro"`)
`--ram <gb>`	Custom RAM in GB
`--cpu <model>`	Custom CPU model
`--vram <gb>`	Override GPU VRAM in GB (auto-detected from GPU model if omitted, requires `--gpu`)
`-u, --use-case <case>`	Use case for scoring (default: `general`)
`--optimize <profile>`	Optimization profile (default: `balanced`)
`--limit <number>`	Number of models to show (default: `1`)
`--no-verbose`	Disable progress output

Run llm-checker simulate --list to see all built-in profiles including laptop, desktop, workstation, and data center tiers.

Catalog & Discovery

list-models

Lists all models from the Ollama model database with filtering options. Does not require sql.js — uses the built-in curated catalog.

llm-checker list-models
llm-checker list-models --category coding
llm-checker list-models --popular
llm-checker list-models --size small
llm-checker list-models --json

Flag	Description
`-c, --category <category>`	Filter by category: `coding`, `talking`, `reading`, `reasoning`, `multimodal`, `creative`, `general`
`-s, --size <size>`	Filter by size: `small`, `medium`, `large`, or specific (e.g., `"7b"`, `"13b"`)
`-p, --popular`	Show only popular models (>100k pulls)
`-r, --recent`	Show only recently updated models (last 30 days)
`--limit <number>`	Limit number of results (default: `50`)
`--full`	Show full details including variants and tags
`--json`	Output as JSON

ollama

Manage Ollama integration and check its availability.

llm-checker ollama
llm-checker ollama --list
llm-checker ollama --running
llm-checker ollama --compatible
llm-checker ollama --recommendations

Flag	Description
`-l, --list`	List installed models with compatibility scores
`-r, --running`	Show running models with performance data
`-c, --compatible`	Show only hardware-compatible installed models
`--recommendations`	Show installation recommendations

Hardware Tools

gpu-plan

Multi-GPU placement advisor. Computes single-GPU and pooled VRAM envelopes, strategy recommendation, and ready-to-paste environment variables.

llm-checker gpu-plan
llm-checker gpu-plan --model-size 14    # Validate a 14GB model
llm-checker gpu-plan --json

Flag	Description
`--model-size`	Target model size in GB to validate against the plan
`-j, --json`	Output plan as JSON

Output includes:

Detected GPU count and total VRAM/unified memory
Single-GPU safe model size envelope
Pooled (multi-GPU) safe model size envelope
Placement strategy with rationale
Recommended env vars (CUDA_VISIBLE_DEVICES, etc.)

verify-context

Verifies the practical context window limit for a local Ollama model by combining the declared context from model metadata with a hardware memory budget estimate.

llm-checker verify-context
llm-checker verify-context --model qwen2.5-coder:14b
llm-checker verify-context --model llama3.2:3b --target 32768

Flag	Description
`-m, --model`	Model to verify (default: first installed model)
`-t, --target`	Target context tokens to validate against (default: `8192`)
`-j, --json`	Output as JSON

Output includes:

Declared context window from model metadata
Estimated memory-safe context limit
Recommended runtime context value
Pass/warn/fail status with per-check breakdown

amd-guard

AMD/Windows reliability guard with actionable mitigation hints. Checks ROCm availability, detects common driver issues, and provides a fix list.

llm-checker amd-guard
llm-checker amd-guard --json

Flag	Description
`-j, --json`	Output report as JSON

Output includes:

Platform and primary backend detection
ROCm availability and detection method
Per-check status (pass/warn/fail)
Actionable recommendations for AMD/Windows setups

toolcheck

Tool-calling compatibility tester. Sends a standardized add_numbers tool-calling prompt to local Ollama models and scores the response.

llm-checker toolcheck
llm-checker toolcheck --model qwen2.5-coder:14b
llm-checker toolcheck --all

Flag	Description
`-m, --model`	Test a specific model by name
`--all`	Test all installed models (default: tests first installed model)
`--timeout`	Per-model timeout in ms (default: `45000`)
`-j, --json`	Output as JSON

Status levels: SUPPORTED, PARTIAL, UNSUPPORTED

MCP Setup

mcp-setup

Prints or applies the Claude Code MCP configuration for LLM Checker.

llm-checker mcp-setup
llm-checker mcp-setup --apply      # Run claude mcp add automatically
llm-checker mcp-setup --npx        # Use npx instead of global binary
llm-checker mcp-setup --json       # Output setup details as JSON

Flag	Description
`--name`	MCP server name in Claude (default: `llm-checker`)
`--npx`	Use `npx llm-checker-mcp` instead of global `llm-checker-mcp`
`--apply`	Execute `claude mcp add ...` automatically
`-j, --json`	Output setup details as JSON

Enterprise Policy Commands

policy init

Generates a policy.yaml template for enterprise governance.

llm-checker policy init
llm-checker policy init --file ./my-policy.yaml
llm-checker policy init --file ./my-policy.yaml --force

Flag	Description
`-f, --file`	Output path for the policy file (default: `policy.yaml`)
`--force`	Overwrite an existing file

policy validate

Validates a policy file against the v1 schema. Exits non-zero on schema errors.

llm-checker policy validate
llm-checker policy validate --file ./my-policy.yaml
llm-checker policy validate --json

Flag	Description
`-f, --file`	Policy file to validate (default: `policy.yaml`)
`-j, --json`	Output validation result as JSON

Audit Export

audit export

Evaluates policy compliance against model candidates from check or recommend, then exports machine-readable reports in JSON, CSV, or SARIF format.

# Single format JSON report
llm-checker audit export \
  --policy ./policy.yaml \
  --command check \
  --format json \
  --out ./reports/check-policy.json

# All configured formats
llm-checker audit export \
  --policy ./policy.yaml \
  --command check \
  --format all \
  --out-dir ./reports

Flag	Description
`--policy`	Required. Policy file path
`--command`	Evaluation source: `check` or `recommend` (default: `check`)
`--format`	Report format: `json`, `csv`, `sarif`, or `all` (default: `json`)
`--out`	Output file path (single-format only)
`--out-dir`	Output directory when `--out` is omitted (default: `audit-reports`)
`-u, --use-case`	Use case for `check` mode (default: `general`)
`-c, --category`	Category hint for `recommend` mode
`--optimize`	Optimization profile for recommend mode
`--runtime`	Runtime for check mode
`--include-cloud`	Include cloud models in check-mode analysis
`--max-size`	Maximum model size filter
`--min-size`	Minimum model size filter
`-l, --limit`	Model analysis limit for check mode (default: `25`)
`--no-verbose`	Disable verbose progress

Integration Examples

llm-checker audit export \
  --policy ./policy.yaml \
  --command check \
  --format json \
  --out ./reports/policy-report.json

GitHub Actions Policy Gate

name: Policy Gate
on: [pull_request]

jobs:
  policy-gate:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-node@v4
        with:
          node-version: 20
      - run: npm ci
      - run: node bin/enhanced_cli.js check --policy ./policy.yaml --runtime ollama --no-verbose
      - if: always()
        run: node bin/enhanced_cli.js audit export --policy ./policy.yaml --command check --format all --runtime ollama --no-verbose --out-dir ./policy-reports
      - if: always()
        uses: actions/upload-artifact@v4
        with:
          name: policy-audit-reports
          path: ./policy-reports

When --format all is used, the export honors the reporting.formats list in your policy.yaml. If that list is empty, it defaults to json, csv, and sarif.

Documentation Index

​Commands Requiring sql.js

​sync

​smart-recommend

​Hardware Simulation

​simulate

​Catalog & Discovery

​list-models

​ollama

​Hardware Tools

​gpu-plan

​verify-context

​amd-guard

​toolcheck

​MCP Setup

​mcp-setup

​Enterprise Policy Commands

​policy init

​policy validate

​Audit Export

​audit export

​Integration Examples

​GitHub Actions Policy Gate

Commands Requiring sql.js

sync

smart-recommend

Hardware Simulation

simulate

Catalog & Discovery

list-models

ollama

Hardware Tools

gpu-plan

verify-context

amd-guard

toolcheck

MCP Setup

mcp-setup

Enterprise Policy Commands

policy init

policy validate

Audit Export

audit export

Integration Examples

GitHub Actions Policy Gate