Progress bar implementation or itterative text updates to an html.Div as the porcess executes

vnavdulov · April 25, 2024, 4:12am

Has anyone had success with the implementation of depicting progress in layouts? I want the text from the function to display in the modal as the embedding process is happening. Since it takes a while with a large batch of documents, I don’t want the user to sit there and wonder if the process is still happening and not know when it’s going to end, however, I can’t return anything until the process ends. Ideally I would like to display the progress bar and the print statements below in a modal. I am not an advanced developer, so any help would be very useful. Thank you!

class CustomEmbeddings:
    def __init__(self, model_id='e5-small-v2'):
        print("Initializing the embedding model...")
        self.model, self.tokenizer = self.load_model_and_tokenizer(model_id)
        self.device = 'cuda' if torch.cuda.is_available() else 'cpu'
        self.model.to(self.device)
        print(f"Model moved to device: {self.device}")

    def load_model_and_tokenizer(self, model_id):
        project_root = os.path.dirname(os.path.dirname(__file__))
        model_dir = os.path.join(project_root, 'embeddings', 'models')
        print(f"Loading model and tokenizer from {model_dir} for model ID: {model_id}...")
        model = AutoModel.from_pretrained(os.path.join(model_dir, model_id))
        tokenizer = AutoTokenizer.from_pretrained(os.path.join(model_dir, model_id))
        print("Model and tokenizer successfully loaded.")
        return model, tokenizer

    def embed_documents(self, texts):
        print("Starting the embedding process...")
        self.model.eval()
        embeddings = []
        with torch.no_grad():
            for i, text in enumerate(texts):
                tokens = self.tokenizer(text.page_content, return_tensors="pt", padding=True, truncation=True, max_length=512).to(
                    self.device)
                output = self.model(**tokens)
                sum_embeddings = (output.last_hidden_state * tokens['attention_mask'].unsqueeze(-1)).sum(1)
                normalized_embeddings = sum_embeddings / tokens['attention_mask'].sum(1, keepdim=True)
                embeddings.append(normalized_embeddings.cpu().numpy().tolist()[0])
                if (i + 1) % 10 == 0 or i == len(texts) - 1:
                    print(f"Processed {i + 1}/{len(texts)} texts")
        print("Embedding process completed.")
        return embeddings

    def embed_query(self, query):
        return self.embed_documents([query])[0]

AIMPED · April 25, 2024, 4:33am

Hey @vnavdulov, does this help?

vnavdulov · April 25, 2024, 3:26pm

Hello @AIMPED , thank you for pointing me in that direction. I was thinking of trying that out but I would like the messages generated in my function to appear in the modal that progress happens. Although even just having the progress bar would be very helpful as well. I’ll start there

AIMPED · April 25, 2024, 4:38pm

I think this might be even more interesting. I was blinded by the progress-bar and the modal…

vnavdulov · April 25, 2024, 9:31pm

@AIMPED That’s perfect! Just what I was looking for. Thank you!

Topic		Replies	Views
Show a progress bar in a modal Dash Python show-and-tell	2	2793	October 5, 2023
Progress bar from tqdm Dash Python	15	13257	April 17, 2021
How to display progress in server side call backs(dash-extension) Dash Python	0	338	June 28, 2022
Custom progress information on each loop iteration in app.callback Dash Python	13	4565	April 14, 2021
How is the blue topic progress bar in this forum implemented? Dash Python	2	465	November 17, 2019

Progress bar implementation or itterative text updates to an html.Div as the porcess executes

Related topics