Web Scraping data using python

Asked 10 years, 2 months ago

Viewed 477 times

I m just started learning web scraping using Python. My aim is to web scrape the Realtime news for Bajaj Auto Ltd. from http://money.rediff.com/companies/Bajaj-Auto-Ltd/10540026.

The problem: I'm unable to extract the contents(i.e news).

from urllib.request import urlopen
from bs4 import BeautifulSoup
url = 'http://money.rediff.com/companies/Bajaj-Auto-Ltd/10540026'
data = urlopen(url)
soup = BeautifulSoup(data)
te=soup.find('a',attrs={'target':'_jbpinter'})
lis=te.find_all_next('a',attrs={'target':'_jbpinter'})
#print(lis)
for li in lis:
 print(li.find('a').contents[0])

I m getting the error "AttributeError: 'NoneType' object has no attribute 'contents'" And I does not get the desired result.

Any input will be appreciated.

Improve this question

edited Feb 14, 2021 at 14:37

DisappointedByUnaccountableMod's user avatar

DisappointedByUnaccountableMod

6,8444 gold badges21 silver badges23 bronze badges

asked Nov 4, 2015 at 16:37

Nks's user avatar

Nks

52 bronze badges

looks like it can't find what you think is there. try printing li and see if there is actually an a in there

R Nar
– R Nar

2015年11月04日 16:44:41 +00:00
Commented Nov 4, 2015 at 16:44

Add a comment |

1 Answer 1

Sorted by: Reset to default

You are trying to get the a tag twice.

Replace

for li in lis:
 print(li.find('a').contents[0])

with

for li in lis:
 print(li.get_text())

and you get this output:

Need Different Rates For Different Products: Rahul Bajaj on GST
Reforms irrespective of Bihar results: Bajaj
Auto shares in focus; Tata Motors up over 5%
We believe new Avenger will stimulate the market: Bajaj Auto's Eric Vas
BHP Billiton pins future of Indonesian coal mine on new...

Improve this answer

answered Nov 4, 2015 at 16:52

dstudeba's user avatar

dstudeba

9,0583 gold badges34 silver badges42 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Web Scraping data using python

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related