Disable Base64 encoding on Audio files

suvigya · December 21, 2023, 1:53pm

I’m trying to create a Speech-to-text app using Dash and locally downloaded Whisper model (OpenAI). Whenever I get user input of the audio file, it’s converted to Base64 encoded string which isn’t compatible with Whisper.

My questions -

Is there a way I can save the uploaded file as-is, in mp3 or wav format, rather than dealing with encoded version?
Or, is there a way I can get audio attributes like channel, sampling etc to recreate audio file from encoded string?

Thanks in advance!

AIMPED · December 21, 2023, 1:57pm

HI @suvigya welcome to the forums.

I guess you are uploading the adio file with an dcc.Upload(), right?

You could convert it back to whatever it was before. A quick google search returned this, for example.

suvigya · December 21, 2023, 4:03pm

@AIMPED Thanks for your prompt response. Yes, I am using dcc.upload() for getting user upload. I had to adapt the code to re-generate the file uploaded from the user depending on the format, but the link helped.

Thanks again!

Topic		Replies	Views
How to Upload and Save an Audio file (.wav format) in Dash App using Python? Dash Python	3	650	August 2, 2023
Upload a voice file and process it in python in the backend Dash Python	7	2875	February 7, 2022
Microphone component into Dash Dash Python community-components	5	1470	May 19, 2024
Dash Upload - Uploading a Text File Dash Python	2	5159	March 6, 2018
Stream audio .wav file from S3 Dash Python	1	1460	June 16, 2021

Disable Base64 encoding on Audio files

Related topics