Skip to main content
Stack Overflow
  1. About
  2. For Teams
Filter by
Sorted by
Tagged with
1 vote
0 answers
62 views

I'm trying in vain to make NLTK show a Frequency Distribution Plot using FreqDist.plot(). I've followed the steps from the first chapter of the NLTK book (scroll down to "3.1 Frequency ...
0 votes
1 answer
396 views

I'm attempting to perform Part-of-Speech (POS) tagging in a Jupyter Notebook using the NLTK package. The averaged_perceptron_tagger resource cannot be located, according to the LookupError I keep ...
3 votes
1 answer
99 views

Below code tokenises the text and identifies the grammar of each tokenised word. import nltk from nltk.tokenize import sent_tokenize, word_tokenize from nltk.corpus import wordnet as wn #nltk....
Ali's user avatar
  • 31
0 votes
1 answer
91 views

I am attempting to create a Python transform that requires me to import nltk. When I import nltk, later on I get: Resource [93mpunkt_tab[0m not found. Please use the NLTK Downloader to obtain the ...
0 votes
1 answer
41 views

I have the following trained classifier: classifier = nltk.NaiveBayesClassifier.train(features[:train_count]) When I try to use it to classify(): result = classifier.classify(feature) and feature is ...
0 votes
2 answers
207 views

Repeated Lookup error eventhough NLTK is downloaded: Resource [93mpunkt_tab[0m not found. Please use the NLTK Downloader to obtain the resource: 31m>>> import nltk nltk.download('...
Ellster's user avatar
0 votes
1 answer
119 views

Sometimes, a Python library depends on additional data, such as ML models. This could be a model from transformers, spacy, nltkand so on. Typically there is a command to download such a model: python -...
jdm's user avatar
  • 10.4k
1 vote
1 answer
93 views

How do I get rid of non-printing (escaped) characters from the output of the nltk.word_tokenize method? I am working through the book 'Natural Language Processing with Python' and am following the ...
0 votes
0 answers
61 views

When tokenizing paragraphs to sentences in the Russian language, I am observing the special case when the sequence is not treated as the end of the sentence. The case is with the я. at the end of the ...
pepr's user avatar
  • 21.1k
0 votes
0 answers
49 views

When tokenizing paragraphs in the Czech language, I am observing that some abbreviations are not treated as abbreviations. The paragraph is stored in the file as one long line. The nltk is of the ...
pepr's user avatar
  • 21.1k
1 vote
0 answers
34 views

Title: ValueError: could not convert string to float when training GaussianNB for SMS Spam Detection Body: I'm building an SMS spam detection tool and encountering an error while predicting with a ...
-1 votes
2 answers
2k views

I’m trying to perform Named Entity Recognition (NER) using NLTK, SpaCy, and a dataset in PyCharm. However, I’m encountering an error related to a missing resource (punkt_tab) when tokenizing text. ...
0 votes
0 answers
192 views

I have been trying to use nltk's entity chunker, and tried different approaches but I keep getting the error: LookupError Traceback (most recent call last) ... ...
0 votes
1 answer
64 views

I have installed nltk through cmd on my windows 10 but when I try to import it in IDLE it does not work. Can someone please help? [This is the image of the installationThis is the image of the error ...
Sahil Patel's user avatar
2 votes
2 answers
117 views

I was trying to understand how BLEU score works and noticed that if I had to compute the n-gram precisions and have multiple reference sentences, it makes more sense to turn everything into sets to ...
Yuirike's user avatar
  • 41

15 30 50 per page
1
2 3 4 5
...
473

AltStyle によって変換されたページ (->オリジナル) /