Web Scraping

(4 sessions, 6 hours)

Language: English/廣東話

$3,800

推廣價 $2,800

WORKSHOP OUTLINE

Session 1

what is web scraping?

  1. Introduction of Web Scraping with examples.

  2. Understand structure of HTML.

  3. Introduction of JavaScript.

  4. Python Web Scraping workflow.

  5. Python libraries for Web Scraping (e.g. BeautifulSoup).

  1. 介紹網絡數據採集 (Web Scraping) 並透過例子說明。

  2. 了解 HTML 的結構。

  3. 認識 JavaScript。

  4. Python Web Scraping 流程介紹。

  5. 認識 Web Scraping 的 Python libraries,例如: BeautifulSoup。

Session 2

process the scrapped data

  1. Crawl a selected website and scrap the data.

  2. Data storing - CSV and MySQL.

  3. Work with Python library - PyMySQL.

  4. Database management good practices.

  5. Data cleaning and document encoding to increase readability.

  1. 爬行並採集綱頁數據。

  2. 儲存數據到 CSV 和 MySQL。

  3. 利用 Python library - PyMySQL。

  4. 數據庫管理技巧分享。

  5. 過濾數據和進行文檔編碼,以取得更容易理解的結果。

session 3

crawl some websites!

  1. Stocks market website.

  2. Properties market website.

  3. Jobs market website.

  4. Travel website.

  5. Workflow analysis and review.

  1. 股票市埸網站。

  2. 物業市埸網站。

  3. 工作市埸網站。

  4. 旅遊網站。

  5. 流程分析和回顧。

Session 4

data mining and analysis

  1. Concept and Examples of Data Mining and Analysis.

  2. Python libraries for Data Mining and Analysis.

  3. Insight and experience sharing by Data Specialist.

  4. Summary of the workshop.

  1. Data Mining and Analysis 的例子和概念。

  2. 介紹 Data Mining and Analysis 常用的 Python libraries。

  3. 數據專家的經驗和心得分享。

  4. 工作坊總結。