1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

Missing html in response using python requests and beautifulsoup4

Asked 9 years, 8 months ago

Viewed 3k times

When I view the page source in my browser, the html I am after appears there. However, when I make a requests using python requests the html doesn't appear.

The url I'm trying to scrape is http://dota2lounge.com/match?m=13362, and the specific html I am after in the page is.

<div class="full">
 <a class="button" onclick="ChoseEvent(13362,'Whole Match',false)">Match</a>
 <a class="button" onclick="ChoseEvent(13392,'1st Game','1462327200')">1st Game</a>
 <a class="button" onclick="ChoseEvent(13424,'2nd Game','1462327200')">2nd Game</a>
 <br><div id="toma" class="full" style="background: #444;line-height: 2.5rem;border: 1px solid #333;text-align: center;">Whole Match</div>
</div>

I'd like to get the 'onclick' values of the buttons. So far I've tried:

r = requests.get('http://dota2lounge.com/match?m=13268')
soup = bs(r.content, 'lxml')
buttons = soup.find_all('a', class_='button')

Which doesn't work.

r.content

Doesn't appear to show the html either.

Improve this question

asked May 4, 2016 at 7:57

Peter's user avatar

Peter

1521 silver badge13 bronze badges

Try soup.find_all('a', 'button'). Btw sounds like you have a typo in the param class: soup.find_all('a', class='button')

Jeremie Ges
– Jeremie Ges

2016年05月04日 08:02:06 +00:00
Commented May 4, 2016 at 8:02

Add a comment |

2 Answers 2

Sorted by: Reset to default

Looks like the elements you want are being added by javascript that isn't being run when you make the request in python. Check out this question.

If you're just scraping this once (i.e. you just want the data and you're not trying to build a bot to play the game for you), the quickest option is often to just create a .htm file containing only links to every page you want to scrape (put each link in an <a> tag, you don't even need text). Then you can use a tool like downthemall in firefox to save a local copy of every page with the proper formatting.

Improve this answer

edited May 23, 2017 at 12:31

Community's user avatar

Community Bot

11 silver badge

answered May 6, 2016 at 1:47

Joseph's user avatar

Joseph

7311 gold badge4 silver badges13 bronze badges

Comments

try this

soup = BeautifulSoup(r.text, "html.parser")
for link in soup.findAll('a'):
 print link.get('onclick')

Improve this answer

answered May 4, 2016 at 9:10

Suraj's user avatar

Suraj

1785 bronze badges

2 Comments

Peter

Peter Over a year ago

Thanks but I tried your suggested parser and that didn't work. If I look into the text from the Request response I still can't see the html there. Are there any reasons it would be rendered in my browser but not in the Python Request?

2016年05月04日T09:54:31.023Z+00:00

Suraj

Suraj Over a year ago

i didn't find your html section in source code and try this code on dota2lounge.com/match?m=13362 url it find 2 onclick selectTeam($(this), 'a') FUNCTIONS there.

2016年05月04日T10:43:39.227Z+00:00

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

default

CollectivesTM on Stack Overflow

Missing html in response using python requests and beautifulsoup4

2 Answers 2

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Hot Network Questions

CollectivesTM on Stack Overflow

2 Answers 2

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related