Newest 'summarization' Questions

Stack Overflow

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

367 questions

Newest Active Bountied Unanswered

Best practices

0 votes

2 replies

63 views

Recommended way to create abstracted text embeddings from large text data?

I would like to use a LLM Encoder model to create vector embeddings for certain texts in my dataset. The texts are written as technical problem descriptions by experts who are trying to repair a ...

Alles Klar's user avatar

Alles Klar

asked Dec 8, 2025 at 12:32

0 votes

1 answer

141 views

'NoneType' object has no attribute 'encode' when loading tokenizer

Error occurs when trying to load Pegasus model for text summarization from transformers import pipeline, set_seed pipe = pipeline("summarization", model="google/pegasus-cnn_dailymail&...

coolhand's user avatar

coolhand

2,109

asked Jan 28, 2025 at 22:29

1 vote

0 answers

34 views

Is it possible to replicate a Power BI matrix in DAX to find the maximum value in a column>?

I have a matrix visual in a Power BI report that finds the percentage of people in a quintile that are not completing a qualification which reports on three academic years. I need to pull out the ...

David's user avatar

David

asked Jul 30, 2024 at 10:37

1 vote

0 answers

38 views

Text summarizations of comments and replace the duplicates with the first occurrence if the meaning is comment is same

Context - Doing an NLP project to analyze comments column in a data frame. I want to replace the duplicates with the first occurrence if the meaning of the comments are same. I wants to compare all ...

Bhuvaneshwari D Raman Effect's user avatar

Bhuvaneshwari D Raman Effect

asked Jul 12, 2024 at 3:18

0 votes

1 answer

198 views

BadRequestError when Summarizing with MapReduceDocumentsChain as a tool within AgentExecutor in langchain

I try to combine 3 models with one another within langchain and that an openai tools calling agent can call the correct model based on a question. I made StructuredTools from the chains and made sure ...

turkishelehant's user avatar

turkishelehant

asked Jun 5, 2024 at 15:40

1 vote

1 answer

118 views

How do non-LLM models compare to LLMs for Abstractive Summaries of HTML content?

I'm interested in utilizing an NLP model to provide short (one sentence in length) abstractive summaries of web pages, providing the model a set of commonly occurring HTML content from each web page (...

Max Chis's user avatar

Max Chis

asked Mar 23, 2024 at 14:12

0 votes

1 answer

499 views

Increase summary length using MS Azure-AI services

Recently, I have been using Azure AI cognitive services to summarize text using document summarization and conversation summarization of it. But the summary length using both document summarization ...

JaS's user avatar

JaS

asked Feb 21, 2024 at 5:45

0 votes

1 answer

161 views

NLP - make summarization from each subtitle

I am very new to NLP. I'm trying to build simple text summarization model where it takes 1-2 important sentence from each subtitle in an article. For example, in the image I want take 1 sentence from &...

Zkant R.'s user avatar

Zkant R.

asked Jan 14, 2024 at 6:35

1 vote

1 answer

615 views

How to Find Positional embeddings from BARTTokenizer?

The objective is to add token embeddings (customized- obtained using different model) and the positional Embeddings. Is there a Way I can find out positonal embedding along with the token embeddings ...

New_user's user avatar

New_user

asked Jan 11, 2024 at 13:13

1 vote

1 answer

1k views

Summarization and Topic Extraction with LLMs (private) and LangChain or LlamaIndex using flan-t5-small

has anyone used Langchain or LlamaIndex imports to deal with single documents that amount to >512 tokens? Yes, I know there are other approaches to dealing with it, but it is difficult to find ...

Ja4H3ad's user avatar

Ja4H3ad

asked Dec 19, 2023 at 2:26

0 votes

0 answers

754 views

Max Length error while using Huggingface Transformer model for SHAP Explanation

I am using SHAP Explanation to explain the output of the pretrained model. It works for the documents with the token length less than 1024. It throws an error below if I provide sequence with token ...

Simran's user avatar

Simran

asked Nov 6, 2023 at 7:06

1 vote

1 answer

169 views

speed up PyTextRank for summarizing a document

I need to summarize documents with spacy-pytextrank, what is the best approach to make it faster without increasing the resources of the machine? I was thinking of parallelizing the computation using ...

Ire00's user avatar

Ire00

asked Oct 31, 2023 at 16:12

2 votes

0 answers

141 views

Stack size errors on fine tunning t5 with xsum using pytorch

I am trying to fine fine tunning t5-small with xsum dataset on pytorch Windows 10 (CUDA 12.1). Unfortunately Trainer (or Seq2SeqTrainer) class from bitsandbytes is not avaliable for Windows, so it was ...

celsowm's user avatar

celsowm

asked Oct 15, 2023 at 20:27

1 vote

3 answers

154 views

Get unique values from rows with comma separated values based on a specific category column in R

Lets say I have: group X Y Z A cat, dog dog, fox A fox, chicken dog, fox, chicken A B fox, dog B fox B ...

Gabriel G.'s user avatar

Gabriel G.

asked Oct 6, 2023 at 17:34

1 vote

1 answer

103 views

Detecting adding/removal from string difference between texts

I have two versions of a short text, e.g.: old = "(a) The provisions of this article apply to machinery of class 6." new = "(a) The provisions of this article apply to machinery of ...

user456789's user avatar

user456789

asked Aug 21, 2023 at 14:42

15 30 50 per page

2 3 4 5

...

25 Next

CollectivesTM on Stack Overflow

Recommended way to create abstracted text embeddings from large text data?

'NoneType' object has no attribute 'encode' when loading tokenizer

Is it possible to replicate a Power BI matrix in DAX to find the maximum value in a column>?

Text summarizations of comments and replace the duplicates with the first occurrence if the meaning is comment is same

BadRequestError when Summarizing with MapReduceDocumentsChain as a tool within AgentExecutor in langchain

How do non-LLM models compare to LLMs for Abstractive Summaries of HTML content?

Increase summary length using MS Azure-AI services

NLP - make summarization from each subtitle

How to Find Positional embeddings from BARTTokenizer?

Summarization and Topic Extraction with LLMs (private) and LangChain or LlamaIndex using flan-t5-small

Max Length error while using Huggingface Transformer model for SHAP Explanation

speed up PyTextRank for summarizing a document

Stack size errors on fine tunning t5 with xsum using pytorch

Get unique values from rows with comma separated values based on a specific category column in R

Detecting adding/removal from string difference between texts

Hot Network Questions