Plotly express line slow

borian · December 29, 2020, 11:45am

Thanks for providing such a powerful tool!
I have a data frame of size 7486 rows x 960 columns. I am plotting the data with facet rows of 8 and columns of 2. Each plot has 3 categories and each category a group of 20 time series. Thus, each of the 16 contains 60 time series. I am plotting the data frame with plotly express lines, see snippet below. As renderer, I am using WebGL.
My impression is that plotly express line is rather slow. It uses a couple of minutes. Is this to be expected for this amount of data? Once it is rendered, the figures are responsive, though.
Thanks for any advice.

fig = px.line(error_per_type, facet_col=“eval_type”, facet_row=“data_set”, color=“est_type”, line_group=“run”, hover_name=“est_type”, render_mode=fig_renderer)
fig.update_traces(mode=“lines+markers”, line=dict(width=1), marker=dict(size=3), connectgaps=True)
fig.update_layout(height=fig_height, legend_title_text=“”)

borian · December 30, 2020, 8:48am

With respect to the large figure above, I have the problem to save the plots at static images, PNG format. If configured with kaleido engine, I get a crash, because of size. However, if I open the saved HTML thereof and save it from there as a static image, I get a PNG file without crash. Any advice on this?

fig.write_image(fig_file, format = img_format, engine = img_engine)

borian · December 30, 2020, 1:53pm

Example of the facet plot:

borian · December 31, 2020, 5:57am

And a couple of minutes means more than 15min to render the above plot on a decent computer.

windrose · December 31, 2020, 8:45am

Have you tried to install orca? Perhaps that saves static images faster?

I don’t have experience with subplots, but ~ 10.000 data points are saved in PDF format in seconds. However, export using scattergl results in bad resolution which is why I omit the gl before final export.

borian · January 4, 2021, 8:52am

Thank you for your advice. I made an attempt to use orca, but so far, it did not run out of the box. However, my primary issue is not the static image generation, but the rendering itself. The rendering itself take a lot of time - 15min or more - even before show or write_image.

borian · January 4, 2021, 12:10pm

Instead of plotly express line, I could use scatter, which seems to be much faster. However, with scatter, it is not possible to plot groups as with line_group. Thus, it results in one series with “jumping” lines and it is not possible to distinguish between individual runs anymore. How could line_group considered in scatter?

Example with scatter:

fig = px.scatter(error_per_type, y=“value”, facet_col=“eval_type”, facet_row=“data_set”, color=“est_type”, hover_name=“est_type”, render_mode=fig_renderer)
fig.update_traces(mode=“lines+markers”, line=dict(width=1), marker=dict(size=3), connectgaps=True)
fig.update_layout(height=fig_height, legend_title_text=“”)

borian · January 4, 2021, 12:57pm

I did a profiling of line and scatter. For the same pandas data frame, line needs about 23min whereas scatter about 5min, see code snippets above. Scatter spends 56% of the time in the function plotly.express._chart_types.scatter, whereas line 79% in the function pandas.core.algorithms.unique. So, for some reason, line is doing a lot of access to the pandas data frame. Anybody an idea how to improve on that?

Here, the profiling:

borian · January 6, 2021, 12:55pm

Still looking for advice to speed up plotly express line, if possible. - Might it help to rearrange the pandas data frame? Because of differing timestamps in each run, there are a lot of NaNs in the data frame. Might this slow down the plotting?

tschlich · May 5, 2021, 3:34am

Would like to give this a bump and hopefully get some fresh eyes – I have a couple hundred parameterized functions and I’m trying to figure out the cleanest way to visualize them.

I made an np.linspace, and using line_group slows down the plotting by several minutes – the exact same dataframe takes but a few seconds to render without the line_group option set.

edit: seems as though my line_group column wasnt grouped how i thought it was, instead creating a grouping for every row. With the correct groupings it seems to be working fine.

Isaac7499 · December 16, 2021, 9:47am

Thanks for sharing this information. It was very useful.

Topic		Replies	Views
Improving performance when rendering animated scatterplot using Plotly Express 📊 Plotly Python	0	285	July 15, 2020
Go figure updates slowly 📊 Plotly Python question	0	343	February 2, 2023
Speeding up plotting large timeseries (x5) 📊 Plotly Python	0	404	November 27, 2020
Slow line plot in some instances with large number of points 📊 Plotly Python	0	623	February 23, 2022
go.Figure slow with lots of data 📊 Plotly Python	8	14429	March 4, 2022

Plotly express line slow

Related topics