Speeding up plotting hundreds of thousands of separate plots

jyolo · January 31, 2022, 5:26pm

I’m plotting thousands of plots within a pandas dataframe row-wise. For example, my data looks like this:

df1

Trace	Layout
trace_1	layout_1
…	…
trace_n	layout_n

df2 = df1.apply(lambda x: go.Figure(data=x['trace'], layout=x['layout']), axis=1)

Trace	Layout	Figure
trace_1	layout_1	Figure_1
…	…	…
trace_nn	layout_n	figure_n

Are there any best practices to apply in this situation? My code takes ~20 minutes to run with 150k+ records to plot. It can do about 126 plots per second, seems to be the same across differing processor speeds.

Any ideas appreciated!

kmhurchla · February 4, 2022, 11:52am

hi Jyolo,
Seems like a lot of duplicative rendering, at least with figures. As I understand from the fundamentals docs, graph objects are essentially JSON text that get rendered to Javascript under the (Python) hood. Someone correct me if I’m off… That makes me question whether storing them in a dataframe could cause the traces, layouts, and figures behave any differently (I don’t know myself). Impressive to me really that it works at all!

I’m curious what the data types look like. What’s this return?

df2.dtypes

The data viz best practice side of me however wants to find you a better UX human computer interaction design solution than scanning 150+ plots! If you’ll indulge sharing a little more about the use case and what led you to doing it this way, is it a comparative situation, or something else you’re looking for? Maybe other solutions would be less computing/memory intensive also.

Other ideas would be tuning and comparing your log files for performance:

Can you make use of subplots, and would that be faster or slower than distinct figures? The low level code there might yield some ideas.

Is being explicit with attributes slower or faster than letting Plotly render the defaults (on all those figures…).

jyolo · April 11, 2022, 3:59pm

Hey @kmhurchla !

Thanks for the response. The solution was in your first paragraph. I was needlessly calling go.Figure() on each plot. I just needed the dict created! removing the call took out all that extra overhead - exactly what I needed.

Hope this helps anyone else!

kmhurchla · April 11, 2022, 4:21pm

So glad my questions helped you track down a faster way to make this many plots! I’m always surprised by how many different ways there can be to get the same result, each with its tradeoffs in different situations. It can be so helpful just to hear how someone else would approach the same problem. Happy plotting

Topic		Replies	Views
Subplot with 100 figures 📊 Plotly Python	1	1275	December 19, 2016
Graph_obj.py slow performance when .append_trace on large amount of subplots 📊 Plotly Python	2	1354	June 8, 2016
Plotting large number of graphs 📊 Plotly Python	0	544	March 6, 2020
Speeding up `add_trace()` and `show()` calls? 📊 Plotly Python	2	583	January 12, 2023
Plotting large number of plots of a single page Dash Python	3	1789	September 5, 2018

Speeding up plotting hundreds of thousands of separate plots

Related topics