Abstract : Extractify is a free extension for Chromium, developed in JavaScript under Atom, whose purpose is to scrap structured data on the web. It is particularly designed for collecting online comments or online conversations such as forums.
It allows you to:
- Select structured information on a web page (like tables with rows and columns), by direct selection on the web page, or manual selection by entering HTML tags and related CSS code
- Select the pagination of pages with the same structure and level
- Repeat the process as many times as desired for lower levels
- Scrape the whole selection
- Finally, obtain a file in json format that can be easily imported in other software, in L@ME for example.
What it does not allow: everything else!
https://hal-mines-paristech.archives-ouvertes.fr/hal-02404932 Contributor : Frédéric VergnaudConnect in order to contact the contributor Submitted on : Wednesday, December 11, 2019 - 3:42:19 PM Last modification on : Wednesday, February 16, 2022 - 5:11:22 PM