CSV file has been rearranged

majudd · November 8, 2022, 4:33pm

I have been plotting data from a .csv file where the y-axis data is listed in a column of 1051 rows, and basically the x-axis is the index. Now the developers that I am writing the dashboard for have changed their .csv format so that the y-axis values are all in a single row, with each value in it’s own column, for a total of 1051 columns (in addition to a few other columns). The 1051 columns are arranged from lowest x-axis value to highest x-axis value. After I get this into a df, I can’t figure out how to plot it. Can anyone help me?

AIMPED · November 8, 2022, 4:46pm

Hi @majudd,

maybe @adamschroeder’s response can help you restructuring your DataFrame:

majudd · November 8, 2022, 5:16pm

Hmm, that’s interesting. I am now looking into melt, that is new to me. But so far, I am not seeing how this can help me to plot a curve with 1051 values. I will play around with it though!

jinnyzor · November 8, 2022, 5:19pm

You may be able to use some transpose functionality from pandas.

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.transpose.html

majudd · November 8, 2022, 5:20pm

My column names are ‘x-1’, ‘x-2’, all the way up to ‘x-1051’, then the values in each column are the y-axis values I want to plot.

majudd · November 8, 2022, 5:22pm

Thanks, I will look into transpose!

AIMPED · November 8, 2022, 5:40pm

What kind of plot we are talking about? Would just switching the x and y arguments do the trick? As I understand you have 1051 columns and 1 row.

so assuming you want to do a scatter plot:

px.scatter(x=[*range(1, 1052)], y=df.iloc[0])

But I guess I do not fully understand the problem you are facing.

majudd · November 8, 2022, 6:22pm

I want to do a line plot. It was easy to do when all of the y data was in a single column, and the x data was just the index.

majudd · November 9, 2022, 6:54pm

I have one column, called “score”. This is my y-value. I have hundreds of curves in this one column, each 1051 rows long, listed one after the other. Each curve is in the correct order, from x=1 to x=1051 (listed in another column). There is a time stamp in another column that helps me to separate one curve from another, if I want to plot them as separate lines (each curve has a unique time stamp). I also plot everything together in a box plots, with the timestamp on the x axis and the score as the y-axis. Now my data has been rearranged by the people I am working for to have one row per time stamp, and all of the scores in 1051 columns in the same row. This is saving space, the .csv file I use to pull in the data is much smaller. But I am having trouble figuring out how to pull in all of the data for the box plot and how to plot individual curves for each time stamp.

Topic		Replies	Views
Allocate different data into different columns in plotly python Dash Python	3	259	July 6, 2022
Y-axis unordered on CSV import (Plotly Express) 📊 Plotly Python question	3	402	October 25, 2022
Plot data from csv file with changeable columns using Plotly 📊 Plotly Python	0	450	July 17, 2020
How to upload a csv file and render a bar plot Dash Python	1	1714	June 8, 2020
How to create a simple bar animation with column name in x axis and column data in y axis? 📊 Plotly Python	2	3684	June 14, 2020

CSV file has been rearranged

Related topics