I am attempting to scrape text from different elements within a website. I have had no luck scraping any text so far. attached is the HTML and a picture of the exact elements I would like to extract from the website. I want to scrape the date/time, the name of the teams "Juventus - AC Milan", and the odds. I am not asking for it to be done for me I just don't know why my code isn't getting text from the buttons.
I'm using urllib3 and beautifulsoup4. I got the header from a forum and just changed the chrome version to mine. How do I know what needs to be inside my header? Attached is the code I have tried:
import urllib3
from bs4 import BeautifulSoup
url = "https://beta.goal3.xyz/sports/soccer"
# Define the headers
headers = {
'User-Agent': 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/113.0.5672.127 Safari/537.11',
'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
'Accept-Charset': 'ISO-8859-1,utf-8;q=0.7,*;q=0.3',
'Accept-Encoding': 'none',
'Accept-Language': 'en-US,en;q=0.8',
'Connection': 'keep-alive'
}
# Create an instance of the PoolManager with the specified headers
http = urllib3.PoolManager(headers=headers)
# Send a GET request to the website
response = http.request('GET', url)
# Check if the request was successful
if response.status == 200:
# Parse the HTML content using BeautifulSoup
soup = BeautifulSoup(response.data, 'html.parser')
# Find the element with the specified tag and class
element = soup.find('div', class_='flex justify-between flex-1 w-full')
if element:
# Locate the elements with the specified tag and class within the found element
buttons = element.find_all('button', class_='group outcome')
# Iterate over the found elements and print the text attribute of the span tags
for button in buttons:
span = button.find('span')
if span:
print(span.text)
else:
print("Element not found.")
else:
print("Failed to retrieve the website content.")
source https://stackoverflow.com/questions/76359803/getting-text-attributes-from-different-elements-on-a-website-using-python
Comments
Post a Comment