[go: nahoru, domu]

Skip to content

A web crawler capable of downloading all academic papers on Science Robotics within 5 mins based on Scrapy Framework.

License

Notifications You must be signed in to change notification settings

lvliangxiong/scirob-paper-download

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Science Robotics Paper Download

This is a web crawler which can download all the papers on Sci Rob in 5 mins.

The technique architecture are based on Scrapy and Python.

Prerequisites

In order to use this crawler, you should make sure you have the access to the paper resources on science robotics website (right IP).

Finally, a pdf combination script (pdfcom.py) is provided for combining the papers belongs to one issue into one single pdf, also a pdf file containing all the downloaded pdfs are generated. Papers in the combined pdf file are arranged in their original order on the web.

Documentation

A detailed tutorial of scrapy and this demo are given at here.

About

A web crawler capable of downloading all academic papers on Science Robotics within 5 mins based on Scrapy Framework.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages