Help with Pandas pd.read_html

Eduardo · December 19, 2020, 4:15pm

Hi

I need some help with Pandas pd.read_html

I have this code that takes the Income Statement table from a specific Company ticker

import pandas as pd
from bs4 import BeautifulSoup
import requests


def getfinancialsdfy(ticker, price, com_name):  

    urlfinancials = 'https://www.marketwatch.com/investing/stock/'+ticker+'/financials'

    tables = pd.read_html(urlfinancials)
    tables_a = pd.DataFrame(tables[4])
    display(tables_a)


ejecutar = getfinancialsdfy("AA",22.1, "Alcoa Corp.")

It’s working fine except that it duplicates the text in the first text column:

Something that is not shown in the original page table:

Anybody knows why and how to solve this issue?

Thanks!

Topic		Replies	Views
Dash Table Display App Not Working Dash Python	0	647	July 4, 2019
Datatable display problem in dash 1.8 Dash Python	3	386	January 31, 2020
Embedding data into html.Details Dash Python	2	797	April 2, 2021
Dash table updates empty rows Dash Python	0	712	January 29, 2021
Datatable: Update dataframes with submit button Dash Python	8	836	September 9, 2024

Help with Pandas pd.read_html

Related topics