research-article

rFITb: Random Forest in the Browser

Authors:

Chandradhar Rao,

Darshan Madesh,

Mamatha H RAuthors Info & Claims

ICIEI '23: Proceedings of the 2023 8th International Conference on Information and Education Innovations

Pages 218 - 223

https://doi.org/10.1145/3594441.3594477

Published: 21 August 2023 Publication History

Abstract

This paper introduces rFITb, a distributed computing platform that enables the execution of computationally intensive random forest jobs on personal devices such as smartphones and personal computers. The platform leverages the increased computational capacity of personal devices to distribute and execute jobs globally, providing an efficient alternative to cloud-based services. The paper describes rFITb's architecture and design optimizations, along with a comparative evaluation of its performance against Python's sklearn ensemble random forest classifier on various datasets. The results show that rFITb outperforms the sklearn classifier in terms of model time, while also providing a mechanism for managing failure-prone volunteers.

References

[1]

Anderson, David P. "Boinc: A system for public-resource computing and storage." In Fifth IEEE/ACM international workshop on grid computing, pp. 4-10. IEEE, 2004.

[2]

Tahani Daghistani, "Comparison of Statistical Logistic Regression and RandomForest Machine Learning Techniques in Predicting Diabetes." Journal of Advances in Information Technology, Vol. 11, No. 2, pp. 78-83, May 2020.

[3]

Changro Lee, "Random Forest with Transfer Learning: An Application to Vehicle Valuation." Journal of Advances in Information Technology, Vol. 13, No. 4, pp. 326-331, August 2022.

[4]

Cushing, Reginald, GaneshwaraHerawanHananda Putra, Spiros Koulouzis, Adam Belloum, Marian Bubak, and Cees De Laat. "Distributed computing on an ensemble of browsers." IEEE Internet Computing 17, no. 5 (2013): 54-61.

Digital Library

[5]

Meeds, Edward, Remco Hendriks, Said Al Faraby, Magiel Bruntink, and Max Welling. "MLitB: machine learning in the browser." PeerJ Computer Science 1 (2015): e11.

[6]

Pan, Yao, Jules White, Yu Sun, and Jeff Gray. "Gray computing: An analysis of computing with background javascript tasks." In 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, vol. 1, pp. 167-177. IEEE, 2015.

[7]

Pan Yao, Jules White, Yu Sun, and Jeff Gray. "Gray computing: A framework for computing with background javascript tasks." IEEE Transactions on Software Engineering 45, no. 2 (2017): 171-193.

[8]

Ujhelyi, Matúš, Peter Lacko, and Aurel Paulovič. "Task scheduling in distributed volunteer computing systems." In 2014 IEEE 12th international symposium on intelligent systems and informatics (SISY), pp. 111-114. IEEE, 2014.

[9]

Morell, José Á., Andrés Camero, and Enrique Alba. "Jsdoop and tensorflow. js: Volunteer distributed web browser-based neural network training." IEEE Access 7 (2019): 158671-158684.

[10]

Matsuo, Hiroyuki, Shinsuke Matsumoto, Yoshiki Higo, and Shinji Kusumoto. "Madoop: Improving Browser-Based Volunteer Computing Based on Modern Web Technologies." In 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER), pp. 634-638. IEEE, 2019.

[11]

Chen, Huey-Ling, and Chung-Ta King. "Eager scheduling with lazy retry in multiprocessors." Future Generation Computer Systems 17, no. 3 (2000): 215-226.

Digital Library

[12]

Biswas, Tarun, PratyayKuila, and Anjan Kumar Ray. "Multi-level queue for task scheduling in heterogeneous distributed computing system." In 2017 4th International Conference on Advanced Computing and Communication Systems (ICACCS), pp. 1-6. IEEE, 2017.

[13]

R.A. Fisher. (1988). Iris Data Set. Retrieved 25-12-2022 from https://archive.ics.uci.edu/ml/datasets/iris

[14]

Ronny Kohavi and Barry Becker. (1996). Adult Data Set. Retrieved 25-12-2022 from https://archive.ics.uci.edu/ml/datasets/adult.

[15]

Micheal OW. (2020). Cleaned Day-Trading Training Data, Version2. Retrieved 25-12-2022 from https://www.kaggle.com/datasets/dawerty/cleaned-daytrading-training-data.

[16]

Andrea Dal Pozzolo, Olivier Caelen and Gianluca Bontempi. (2015). Credit card fraud detection. Retrieved 25-12-2022 from https://datahub.io/machine-learning/creditcard.

[17]

Scikit-learn: Machine Learning in Python, Pedregosa, JMLR 12, pp. 2825-2830, 2011.

Index Terms

rFITb: Random Forest in the Browser
1. Computing methodologies
  1. Distributed computing methodologies
    1. Distributed algorithms
    2. Distributed programming languages
  2. Machine learning
2. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language types
        Distributed programming languages

Index terms have been assigned to the content through auto-classification.

Recommendations

Combining bagging, boosting, rotation forest and random subspace methods

Bagging, boosting, rotation forest and random subspace methods are well known re-sampling ensemble methods that generate and combine a diversity of learners using the same learning algorithm for the base-classifiers. Boosting and rotation forest ...
Reinforced random forest
ICVGIP '16: Proceedings of the Tenth Indian Conference on Computer Vision, Graphics and Image Processing

Reinforcement learning improves classification accuracy. But use of reinforcement learning is relatively unexplored in case of random forest classifier. We propose a reinforced random forest (RRF) classifier that exploits reinforcement learning to ...
A fuzzy random forest

When individual classifiers are combined appropriately, a statistically significant increase in classification accuracy is usually obtained. Multiple classifier systems are the result of combining several individual classifiers. Following Breiman's ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICIEI '23: Proceedings of the 2023 8th International Conference on Information and Education Innovations

April 2023

243 pages

ISBN:9798400700613

DOI:10.1145/3594441

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Conference

ICIEI 2023

ICIEI 2023: 2023 The 8th International Conference on Information and Education Innovations

April 13 - 15, 2023

Manchester, United Kingdom

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
23
Total Downloads

Downloads (Last 12 months)11
Downloads (Last 6 weeks)0

Reflects downloads up to 12 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten