Best Practices for Caching Generative AI Outputs in Dash Apps Without Breaking Callback Logic

tasosad · July 10, 2025, 4:14am

Hello

I am building a Dash app that uses a backend Generative AI model to create charts; text summaries / layout elements based on user input. To improve performance, I’ve added a caching layer (using flask_caching) ; but I am running into issues where the callback logic becomes inconsistent. Dash doesn’t reflect updated outputs when a user prompt changes slightly ; likely due to stale cache keys / callback memoization confusion.

Has anyone implemented caching for AI-generated content within Dash without interfering with the reactivity of Dash callbacks? I am trying to prevent unnecessary regeneration from the AI when prompts are similar but I still want the app to remain responsive. Additionally; I want to avoid full layout refreshes since parts of the UI are static.
I have checked Performance | Dash for Python Documentation | Plotly guide with Background Callbacks | Dash for Python Documentation | Plotly for reference.

One context behind this issue relates to understanding what is Generative AI and how it introduces new challenges in Dash especially when AI responses are expensive and non-deterministic. I would love any architectural suggestions or code examples that solve this elegantly.

Thank you !!

AIMPED · July 10, 2025, 8:07am

Hey @tasosad welcome to the forums.

You are trying to cache the user input on the dash side? What is the aim for it? return the already available LLM answer from a previous, similar question and skip the LLM call?

Why don’t you do the caching on the LLM side?

Topic		Replies	Views
Avoid running callback if inputs are the same Dash Python	9	5766	November 29, 2017
How to increase speed in sharing complex python instance during callbacks Dash Python	14	3422	June 14, 2021
Show and Tell - Server Side Caching Dash Python show-and-tell , community-components	93	35416	April 30, 2023
Dash Caching problem (duckdb IOException, conflicting lock, PandasAI) Dash Python	2	683	June 3, 2024
Background (running=[]) callback breaks with OpenAI query Dash Python question	1	155	October 27, 2023

Best Practices for Caching Generative AI Outputs in Dash Apps Without Breaking Callback Logic

Related topics