Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Last update: Dec 23, 2021

Overview

Agroforestry Species Switchboard 2.0 Scraper

Scrape plants scientific name information from Species Switchboard 2.0.

Requirements

python >= 3.10 (you can use pyenv for easier python version management)
pipenv

How to run

Install dependencies

cp env.sample .env
pipenv --python 3
pipenv install

Run
```
pipenv run python main.py
```
The result will be placed in a file named result.*.csv

Test Shell

pipenv run scrapy shell 'http://apps.worldagroforestry.org/products/switchboard/index.php/species_search/Acacia%20abyssinica'

Cleanup All Outputs

rm result.* && rm log.*

Special Cases

Case	Link	Note
ICRAF Databases Not Found	Engelhardia spicata
Genus Found	Forficula	What to do next?
Multiple Species Found	Alstonia spectabilis	Get the matched species right?
Species Variant Found	Engelhardtia spicata	Need human to check
Similar Species Found	Costus speciosus	Need human to check

Contributing

Fork this repo
Develop
Create pull request
Tag @rizqirizqi for review
Merge~~

License

GPL-3.0

Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Related tags

Overview

Agroforestry Species Switchboard 2.0 Scraper

Requirements

How to run

Test Shell

Cleanup All Outputs

Special Cases

Contributing

License

Owner

Mgs. M. Rizqi Fadhlurrahman

Proxy scraper. Format: IP | PORT | COUNTRY | TYPE

Google Developer Profile Badge Scraper

An automated, headless YouTube Watcher and Scraper

Get paper names from dblp.org

A webdriver-based script for reserving Tsinghua badminton courts.

Libextract: extract data from websites

Scraping Top Repositories for Topics on GitHub,

Extract embedded metadata from HTML markup

Dailyiptvlist.com Scraper With Python

A database scraper created with mechanical soup and sqlite

Scrapy, a fast high-level web crawling & scraping framework for Python.

12306抢票脚本

LSpider 一个为被动扫描器定制的前端爬虫

Discord webhook spammer with proxy support and proxy scraper

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

VG-Scraper is a python program using the module called BeautifulSoup which allows anyone to scrape something off an website. This program lets you put in a number trough an input and a number is 1 news article.

Semplice scraper realizzato in Python tramite la libreria BeautifulSoup

CRI Scrape is a tool for get general info about Italian Red Cross in GAIA Platform

🤖 Threaded Scraper to get discord servers from disboard.org written in python3

薅薅乐 - JD 测试脚本