Questions tagged as 'beautifulsoup'

2
answers

Load different URLs without closing the browser

Through this code, I load different URLs in the browser to extract the source, but I close the browser after each reading, can I reload with the following URL in the list? from selenium import webdriver from bs4 import BeautifulSoup delFicher...
asked by 03.08.2017 / 14:53
1
answer

Problems importing libraries in Python

I'm doing a tutorial on beautifulsoup in Python (version 2.7) with the following code: from bs4 import BeautifulSoup import urllib2 f = open ('C:\Python27\project\FFootball_DiamondMine\outfileESPN.txt','w') errorFile = open ('C:\Python27\proj...
asked by 16.03.2017 / 02:20
1
answer

Get text within annotated tags using BeautifulSoup

I'm scraping an HTML document using BeautifulSoup4. But I was obliged to obtain Text commented. If I want to get commented text like this: <!-- este es el texto --> That can be obtained (bypassing the other statements) like this: te...
asked by 10.09.2018 / 11:21
1
answer

Search for all tags that begin with a given chain

I'm doing a scrapy for a page where I'm looking for a label. I do it with BeautifulSoup. I look for all the links that start with them, but end up differently. Example: for url in soup.find_all('a', href=("/es/nds/*******")): Where the...
asked by 13.01.2018 / 12:58
1
answer

Error installing BeautifoulSoup using pip

I'm trying to install the BeautifulSoup library with pip for Python 3.6: $ pip install beautifulSoup But he throws me an error: (myvenv) eduardorr21@eduardoreyes21 ~/Documents/WebScraping $ pip install beautifulSoup beautifulSoup...
asked by 08.04.2018 / 08:23
1
answer

Error 'NoneType' object is not callable when trying to select item by id

I'm trying to get to an element of the DOM that has a certain id . The problem is that I am using the .getElementsById() method of AdvancedHTMLParser , which is available in your guide . Once I extract the HTML from the URL ,...
asked by 27.12.2017 / 19:47
0
answers

Beautifulsoap loop

I am trying to select a series of elements within a web page. Actually 6. Using this code I get the elements I want but also many elements that I do not want, after the sixth. How could you program a loop that only selects the first 6 element...
asked by 03.05.2018 / 02:53
2
answers

Select an element of a website using BeautifulSoup

link Hello good day, I am trying to extract the information from that page with python and BeautifulSoap, so far I have managed to extract the part of the yellow box below and filter it, but the box above it is impossible to extract t...
asked by 01.05.2018 / 01:00
1
answer

Analyze HTML in Python with BeautifulSoup

Objective: I'm trying to show a list about the specific names of the website https://www.screwfix.com/c/tools/angle-grinders/cat830694 . For example: Get Titan TTB281GRD of the title of the link in this part: <div id=...
asked by 13.12.2017 / 00:10
0
answers

how do I parse data from an html in real time?

I'm trying to read data from a local html file. A script causes one of the tags of this html to increase in value. My intention is to obtain this value. I am parsing the html file with the BeautifulSoup library but it always reads the initial ht...
asked by 18.11.2018 / 20:19