skip to main content
10.1145/3508072.3508106acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicfndsConference Proceedingsconference-collections
research-article

Web Scraper Application for Extracting Scientific Journals Data

Published:13 April 2022Publication History

ABSTRACT

Searching for certain subjects of articles that are disseminated throughout scientific journals would be a time-consuming task, as it would necessitate scouring many digital libraries or journal websites. This process can be performed efficiently by utilizing web scraping technology, in which a scraper is used to extract web page content into more organized and structured datasets. This paper proposes a customized web scraper called ”Research Scraper” that will extract content from scientific journal websites, allowing users to access all results from a single search interface. The proposed technique is simple to use and can help with the process of analyzing publications in a specific field. This paper presents and explains the development steps, system design, and technologies that will be used in the implementation phase.

Skip Supplemental Material Section

Supplemental Material

References

  1. Rabiyatou Diouf, Edouard Ngor Sarr, Ousmane Sall, Babiga Birregah, Mamadou Bousso, and Sény Ndiaye Mbaye. 2019. Web scraping: state-of-the-art and areas of application. In 2019 IEEE International Conference on Big Data (Big Data). IEEE, 6040–6042.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Import.io.2022. https://www.import.io/.Google ScholarGoogle Scholar
  3. Yesi Novaria Kunang, Susan Dian Purnamasari, 2018. Web scraping techniques to collect weather data in South Sumatera. In 2018 International Conference on Electrical Engineering and Computer Science (ICECOS). IEEE, 385–390.Google ScholarGoogle Scholar
  4. Software Innovation Lab LLC. 2021. Data Miner. https://data-miner.io/.Google ScholarGoogle Scholar
  5. Deepak Kumar Mahto and Lisha Singh. 2016. A dive into Web Scraper world. In 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom). IEEE, 689–693.Google ScholarGoogle Scholar
  6. Ryan Mitchell. 2018. Web scraping with Python: Collecting more data from the modern web. ” O’Reilly Media, Inc.”.Google ScholarGoogle Scholar
  7. [7] Octoparse.2021. https://www.octoparse.com/.Google ScholarGoogle Scholar
  8. D Pratiba, MS Abhay, Akhil Dua, Giridhar K Shanbhag, Neel Bhandari, and UTKARSH SINGH. 2018. Web Scraping And Data Acquisition Using Google Scholar. In 2018 3rd International Conference on Computational Systems and Information Technology for Sustainable Solutions (CSITSS). IEEE, 277–281.Google ScholarGoogle Scholar
  9. [9] Simplescaper.2020. https://simplescraper.io/.Google ScholarGoogle Scholar
  10. [10] Helium Scraper Software.2021. https://www.heliumscraper.com/.Google ScholarGoogle Scholar
  11. K Sundaramoorthy, R Durga, and S Nagadarshini. 2017. Newsone—an aggregation system for news using web scraping method. In 2017 International Conference on Technical Advancements in Computers and Communications (ICTACC). IEEE, 136–140.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICFNDS '21: Proceedings of the 5th International Conference on Future Networks and Distributed Systems
    December 2021
    847 pages
    ISBN:9781450387347
    DOI:10.1145/3508072

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 April 2022

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format