Skip to content

yousenwang/web-crawler-104-sap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

web-crawler-104-sap

Introduction

This project is about web crawling. It will use a keyword (e.g. SAP) to search hiring positions and their companies on 104 human resource bank and store them as two .csv file respectively.

Users Friendly Steps

  1. python web_crawler_final.py

  2. python company_count.py

Files Descriptions

  1. web_crawler_final.py

The program will load html scripts using beautiful soup package. Then, it will extract SAP positions' data and then search for the hiring companies web page to get companies' data.

  1. company_count.py

This is optional. It will rank the companies by the number of opening positions in the .csv file.

Note: save the .csv file as csv utf-8 in order to work.

Author

Ethan Wang

[email protected]

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages