What is the method to retrieve website links using python?

answered 2023-07-09 06:53:02 +0000

lakamha
21 ●3 ●3

There are several ways to retrieve website links using Python. Here are two common methods:

Using the requests and BeautifulSoup libraries:

import requests
from bs4 import BeautifulSoup

url = 'https://www.example.com'
html = requests.get(url).text
soup = BeautifulSoup(html, 'html.parser')

# Find all links on the page
links = []
for link in soup.find_all('a'):
    links.append(link.get('href'))

print(links)

Using the Selenium library:

from selenium import webdriver

url = 'https://www.example.com'
driver = webdriver.Chrome('path/to/chromedriver.exe')
driver.get(url)

# Find all links on the page
links = []
for link in driver.find_elements_by_tag_name('a'):
    links.append(link.get_attribute('href'))

print(links)

driver.quit()

Note that using the requests and BeautifulSoup method requires less setup and browser overhead compared to the Selenium method, but is limited to static web pages. If you need to scrape dynamic web pages that involve user interaction or require JavaScript rendering, Selenium is a better choice.

edit flag offensive delete link

add a comment

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account. This space is reserved only for answers. If you would like to engage in a discussion, please instead post a comment under the question or an answer that you would like to discuss

Add Answer

What is the method to retrieve website links using python?

1 Answer

Your Answer

Question Tools

Stats

Related questions

What is the method to retrieve website links using python? edit

1 Answer