Fast Interactive Visualizations

jakogau · June 5, 2023, 3:14am

Hi there,

currently I working on cytometry dataset processing. briefly saying, a high dimensional dataset (100k samples) with 10 channels (features). there will be a number of dual-channel scatter plots. I would like to view it on a channel A and channel B and find if there is a cluster on the top right part. if yes, I would like to select the cluster and than view this cluster on other dual-channel scatter plots. that’s it. I checked the example below. however, I wonder if there is a way to reuse all the plots rather than to generate them on each selection?

function reference: Part 3. Interactive Graphing and Crossfiltering | Dash for Python Documentation | Plotly

Skiks · June 5, 2023, 8:42am

Hello @jakogau !
Welcome on the forum!

I’m not sure what you mean, but did you check the Patch class introduced couple of month ago:

You can use it to update only a part of your figure, like only your data points (x and y) keeping all other parameters as is.

jakogau · June 6, 2023, 4:46pm

Hello,

thanks for your reply. to clarify, my concern is that, rendering 12 scatter plots with at least 10k samples each really takes time. it’s around 10s in my implementation. I would like to quickly select some points on the first plot and show only them on others, which means I don’t need other calculation etc.

my previous solution is fetching the figure (json) and selection index and updating layout and returning the figure json obj, which means each time I do selection, I need to wait 10 seconds (perhaps).

your recommendation of Patch really helps. the performance gets really better. though think it’s quite not convenience to use this class.

Thanks again for your help

Jako

Skiks · June 6, 2023, 7:13pm

Hi @jakogau !
If performance is an issue, you can try to use Scattergl() (if you didn’t):
https://plotly.com/python/line-and-scatter/#large-data-sets

Topic		Replies	Views
Proper way to plot large datasets Dash Python	10	74545	August 23, 2023
How can I improve the response time of a scatter plot updated via events? 📊 Plotly Python	0	393	November 28, 2019
How to make panning plots, zooming etc. faster on dataset with about 200k records 📊 Plotly Python	1	720	May 3, 2022
Plotly.js performance in dashboard with multiple plots plotly.js	3	1577	February 27, 2019
Performance considerations w.r.t. highly interactive plots (scatterplots) and large data sets 📊 Plotly Python	1	1722	April 17, 2020

Fast Interactive Visualizations

Related topics