Skip to content

A Simple spider that use to crawl the Coursera video and pdf links and downloader script

Notifications You must be signed in to change notification settings

chuanleixu/coursera_spider

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

#Coursera Spider

  • coursera.py you can use it to get top 100 movie name which is listed by douban website

##1. Install

git clone [email protected]:Andrew-liu/Coursera_Spider.git

##2. Use It simply

$ python coursera.py pkuco-001 #最后一个参数为课程地址

##3. Example Output

Coursera git:(master) ✗ python coursera.py pkuco-001
Input your Email > User_name
Input your Password >
登陆成功...
读取网页成功...
正则匹配结束...
下载链接的长度 56
https://class.coursera.org/pkuco-001/lecture/download.mp4?lecture_id=17
...
https://class.coursera.org/pkuco-001/lecture/download.mp4?lecture_id=139
下载pdf的长度 56
抓取Coursera课程下载链接和pdf链接成功

##4. Download Script

下载脚本运行

$python downloadshell.py coursera.pdf #最后一个参数为连接保存文件

更多详细内容请查看Python-Coursera抓站小结

About

A Simple spider that use to crawl the Coursera video and pdf links and downloader script

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%