mirror of
https://github.com/awesomedata/awesome-public-datasets.git
synced 2024-04-18 07:30:58 +08:00
removing lots of 200 returning sites from the whitelist
This commit is contained in:
parent
f944ea2785
commit
cb9ac6f1db
|
@ -4,7 +4,7 @@ rvm:
|
|||
before_script:
|
||||
- gem install awesome_bot
|
||||
script:
|
||||
- site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,data.gov.be,census.gov/acs/www/data_documentation/data_release_info/,europeansocialsurvey.org/data/
|
||||
- whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,missionlocal.org,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,data.ohouston.org,ntrl.ntis.gov,networkdata.ics.uci.edu,sinda.crn2.inpe.br,archive.ics.uci.edu,openflights.org,www.data.gov.bc.ca
|
||||
- site503=labrosa.ee.columbia.edu/millionsong,datamob.org,research.microsoft.com
|
||||
- site404=www.datawrangling.com,getglue-data.s3.amazonaws.com,archive.org/details/2011-05-calufa-twitter-sql,www.stats4stem.org,lib.stat.cmu.edu,http://www.oecd.org/document/0,census.gov/acs/www/data_documentation/data_release_info/
|
||||
- whtlist=travis,crawdad.cs.dartmouth.edu,data.nasdaq.com,137.189.35.203/WebUI/CatDatabase/catData.html,numbrary.com,www.cmr.osu.edu,gutenberg.org,donnees.gouv.qc.ca,data.rio.rj.gov.br,ntrl.ntis.gov,openflights.org,www.data.gov.bc.ca
|
||||
- site503=datamob.org,research.microsoft.com
|
||||
- awesome_bot README.rst --allow-dupe --allow-redirect --white-list $site404,$whtlist,$site503 --set-timeout=5
|
||||
|
|
Loading…
Reference in New Issue
Block a user