Scrape  News

Newspaper Web Scraping

Introduction

La Gaceta is a popular newspaper in the Cotopaxi province, in the country of Ecuador.

La Gaceta publishes news of a broad bunch of topics. The topics are about politics, sports, economy and so on.

This newspaper also splits the notices per town in the province to target the audience.

The goal of this work was scraped the news to get the title and the story for each new.

All the scraping process followed the rules of the robots.txt file

Data obtained:

    The newspaper is available in the next link: https://lagaceta.com.ec/

  1. News title.
  2. News body.
Importance of the project:
  • Save the news to have a historical register.
  • Automatization of data collection.
Outcomes:
  • Python script to scrape the site.
  • Each new is saved like a txt file in a folder according to the date.

Sebastián

Sarasti

Follow me on my social media channels to know more about my projects.

Follow Us

Get In Touch

Pujilí, Cotopaxi, Ecuador

sebitas.alejo@hotmail.com

© Sebastián Sarasti Zambonino. All Rights Reserved.

Designed by HTML Codex

Edited by Sebastián Sarasti and Angel Bastidas