Reduce size of LLM prompts + cache per-schema context#1384

rolandwalker · 2025-10-25T22:34:39Z

Description

truncate text/binary sample data fields to 1024 characters (or smaller if judged to be needed)
truncate entire tables from schema representation if the representation is very large
for latency improvement, cache sample data and schema representation, passing the dbname in both cases to invalidate the cache if changing the db
add separate progress message when generating sample data

The target_size values are chosen somewhat arbitrarily, and it could be nice for the user to be able to control these.

We could also apply final size limits to the prompt string, though meaning-preserving truncation at that point is harder.

Addresses #1348.

Untested at the time of writing.

Checklist

I've added this contribution to the changelog.md.
I've added my name to the AUTHORS file (or it's already there).
I ran uv run ruff check && uv run ruff format && uv run mypy --install-types . to lint and format the code.

* truncate text/binary sample data fields to 1024 characters (or smaller if judged to be needed) * truncate entire tables from schema representation if the representation is very large * for latency improvement, cache sample data and schema representation, passing the dbname in both cases to invalidate the cache if changing the db * add separate progress message when generating sample data The target_size values are chosen somewhat arbitrarily. We could also apply final size limits to the prompt string, though meaning-preserving truncation at that point is harder. Addresses #1348.

rolandwalker self-assigned this Oct 25, 2025

rolandwalker force-pushed the RW/conserve-llm-tokens-and-cache branch from 8cf2200 to a4eefeaCompare October 25, 2025 22:40

rolandwalker requested a review from amjith October 27, 2025 10:44

rolandwalker mentioned this pull request Oct 27, 2025
[LLM/Feature Requesting] Optimize Token Usage in LLM Integration for SQL Query Generation #1348
Open

rolandwalker changed the title ~~Reduce size of LLM prompts~~Reduce size of LLM prompts + cache contextOct 27, 2025

rolandwalker changed the title ~~Reduce size of LLM prompts + cache context~~Reduce size of LLM prompts + cache per-schema contextOct 27, 2025

rolandwalker force-pushed the RW/conserve-llm-tokens-and-cache branch 3 times, most recently from 6858160 to 6ed2c19Compare November 1, 2025 14:11

rolandwalker force-pushed the RW/conserve-llm-tokens-and-cache branch from 6ed2c19 to 42c18e7Compare November 1, 2025 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce size of LLM prompts + cache per-schema context#1384

Reduce size of LLM prompts + cache per-schema context #1384

Uh oh!

rolandwalker commented Oct 25, 2025•
edited
Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Reduce size of LLM prompts + cache per-schema context#1384

Are you sure you want to change the base?

Reduce size of LLM prompts + cache per-schema context #1384

Uh oh!

Conversation

rolandwalker commented Oct 25, 2025• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rolandwalker commented Oct 25, 2025•
edited
Loading