#webscraping #pythontutorial #dataanalysis #selenium #flightdata #pythonprojects #automation #dataengineering #datascience #mysql #graphalgorithm #matplotlib
This end-to-end data engineering project showcases the process of web scraping, storing, and analyzing flight data from Ryanair. Using Python and Selenium, we scrape flight information from ryanair.at, first storing it in a CSV file (Part 1) and then migrating it to a MySQL database (Part 2).
Once enough data is collected, we analyze the routes to find the most popular destinations using weighted degree centrality with the networkx library. Finally, we visualize all routes using matplotlib, creating an insightful view of the Ryanair flight network.
GitHub: https://github.com/TetyanaHvozda/webscraping_ryanair/tree/part_2
Find all parts of the project below:
Web Scrape Flights from Ryanair to CSV
GitHub: https://github.com/TetyanaHvozda/webscraping_ryanair/blob/main/part_1
Web Scrape Flights from Ryanair to MySQL DB
https://github.com/TetyanaHvozda/webscraping_ryanair/tree/part_2
Flight Network Analysis & Visualization
https://github.com/TetyanaHvozda/webscraping_ryanair/tree/main
コメント