Ensemble Classifiers Employed for Spam Review Detection

Alhassan Jamilu Ibrahim; Maheyzah Siraj; Usman Abubakar Jauro

doi:doi:10.11648/j.es.20210603.11

| Peer-Reviewed

Ensemble Classifiers Employed for Spam Review Detection

Alhassan Jamilu Ibrahim, Maheyzah Siraj, Usman Abubakar Jauro

Published in Engineering Science (Volume 6, Issue 3)

Received: 12 June 2021 Accepted: 7 July 2021 Published: 11 August 2021

Views: Downloads:

Download PDF

Share This Article

Twitter
Linked In
Facebook

Abstract

The advancement of technology and the use of internet have changed many aspects of human culture over the years. Today, consumers take confidence in e-commerce platforms like amazon and eBay for comprehensive understanding of products and services when making a purchase decision. Here the web or user-generated content from consumers of such products and services, known as reviews, are exploited by spam reviewers to falsely promote or downgrade some targeted products. Despite potential solutions, Identifying and preventing review spam are still one of the top challenges faced by web search engines today. Therefore, in the quest to provide a more improved and efficient classification of review spam, this research probed different techniques in order to find most effective solution to spam detection. The research employed three base classifiers, Naïve Bayes, Support Vector Machines and Logistic Regression to form ensemble classifiers complimented with Arching classifier. The Arching classifier performs the weighted voting that produces the final class label with performance and accuracy higher than either of the individual base classifiers. Cross-validation is used as evaluation metrics to measure the performance or effectiveness of the ensemble classifiers while the experimental results shows that the ensemble classifiers achieve the best results compared to the single based classifier in terms of Precision, Recall, F1-measure and Accuracy.

Published in	Engineering Science (Volume 6, Issue 3)
DOI	10.11648/j.es.20210603.11
Page(s)	33-38
Creative Commons	This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.
Copyright	Copyright © The Author(s), 2021. Published by Science Publishing Group

Keywords

Spam Review, Detection, Ensemble Classifier, Arching Classifier, Weighted Voting

References

[1]	N. Jindal, and B. Liu, “Analyzing and detecting review spam,” Seventh IEEE International Conference on Data Mining, 2007, pp. 547-552.
[2]	A. A. Benczur, K. Csalogany, T. Sarlos, and M. Uher, “SpamRank– fully automatic link spam detection,” Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web, 2005.
[3]	G. Muthukumarasamy, “Spam review detection using a hybrid classification method,” International Journal of Advances in Engineering Sciences, vol. 4, 2014, pp. 22-27.
[4]	J. Halloran, “Classification: naive bayes vs logistic regression,” Technical report, University of Hawaii, 2009.
[5]	R. Rajat, S. Yirong, Y. N. Andrew, and M. Andrew, “Classification with hybrid generative/discriminative models,” The Annual Conference on Neural Information Processing Systems (NIPS), 2003.
[6]	J. Mcauley, and J. Leskovec, “Hidden factors and hidden topics: understanding rating dimensions with review text. ACM Proceedings of the 7th ACM Conference on Recommender Systems, 2013, pp. 165-172.
[7]	B. J. Peterson, “Finding a duplicate in a haystack,” Proceedings of the Thirty-first Annual SAS® Users Group, 2006.
[8]	Y. Bi, “The Impact of diversity on the accuracy of evidential classifier ensembles,” International Journal of Approximate Reasoning, vol. 53 (4), 2012, pp. 584-607.
[9]	P. Melville, and R. J. Mooney, “Constructing diverse classifier ensembles using artificial training examples,” IJCAI Citeseer, 2003, pp. 505-510.
[10]	F. Barigou, N. Barigou, and B. Atmani, “Spam detection system combining cellular automata and naïve bayes classifier,” ICWIT Citeseer, 2012, pp. 250-260.
[11]	Hussain N, Turab Mirza H, Rasool G, Hussain I, Kaleem M. Spam Review Detection Techniques: A Systematic Literature Review. Applied Sciences, 2019; 9 (5): 987.
[12]	Li A, Qin Z, Liu R, Yang Y, Li D. Spam review detection with graph convolutional networks. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019, pp. 2703-2711.
[13]	You Z, Qian T, Liu B. An attribute enhanced domain adaptive model for cold-start spam review detection. In Proceedings of the 27th International Conference on Computational Linguistics, 2018, pp. 1884-1895.
[14]	Hussain N, Mirza HT, Hussain I, Iqbal F, Memon I. Spam review detection using the linguistic and spammer Behavioral methods. IEEE Access, 2020, 8: 53801-16.
[15]	Fayaz M, Khan A, Rahman JU, Alharbi A, Uddin MI, Alouffi B. Ensemble Machine Learning Model for Classification of Spam Product Reviews. Complexity, 2020.
[16]	Shahariar GM, Biswas S, Omar F, Shah FM, Hassan SB. Spam review detection using deep learning. In 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), 2019, (pp. 0027-0033). IEEE.

Cite This Article

Plain Text BibTeX RIS

APA Style

Alhassan Jamilu Ibrahim, Maheyzah Siraj, Usman Abubakar Jauro. (2021). Ensemble Classifiers Employed for Spam Review Detection. Engineering Science, 6(3), 33-38. https://doi.org/10.11648/j.es.20210603.11

Copy | Download

ACS Style

Alhassan Jamilu Ibrahim; Maheyzah Siraj; Usman Abubakar Jauro. Ensemble Classifiers Employed for Spam Review Detection. Eng. Sci. 2021, 6(3), 33-38. doi: 10.11648/j.es.20210603.11

Copy | Download

AMA Style

Alhassan Jamilu Ibrahim, Maheyzah Siraj, Usman Abubakar Jauro. Ensemble Classifiers Employed for Spam Review Detection. Eng Sci. 2021;6(3):33-38. doi: 10.11648/j.es.20210603.11

Copy | Download

@article{10.11648/j.es.20210603.11,
  author = {Alhassan Jamilu Ibrahim and Maheyzah Siraj and Usman Abubakar Jauro},
  title = {Ensemble Classifiers Employed for Spam Review Detection},
  journal = {Engineering Science},
  volume = {6},
  number = {3},
  pages = {33-38},
  doi = {10.11648/j.es.20210603.11},
  url = {https://doi.org/10.11648/j.es.20210603.11},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.es.20210603.11},
  abstract = {The advancement of technology and the use of internet have changed many aspects of human culture over the years. Today, consumers take confidence in e-commerce platforms like amazon and eBay for comprehensive understanding of products and services when making a purchase decision. Here the web or user-generated content from consumers of such products and services, known as reviews, are exploited by spam reviewers to falsely promote or downgrade some targeted products. Despite potential solutions, Identifying and preventing review spam are still one of the top challenges faced by web search engines today. Therefore, in the quest to provide a more improved and efficient classification of review spam, this research probed different techniques in order to find most effective solution to spam detection. The research employed three base classifiers, Naïve Bayes, Support Vector Machines and Logistic Regression to form ensemble classifiers complimented with Arching classifier. The Arching classifier performs the weighted voting that produces the final class label with performance and accuracy higher than either of the individual base classifiers. Cross-validation is used as evaluation metrics to measure the performance or effectiveness of the ensemble classifiers while the experimental results shows that the ensemble classifiers achieve the best results compared to the single based classifier in terms of Precision, Recall, F1-measure and Accuracy.},
 year = {2021}
}

Copy | Download

TY  - JOUR
T1  - Ensemble Classifiers Employed for Spam Review Detection
AU  - Alhassan Jamilu Ibrahim
AU  - Maheyzah Siraj
AU  - Usman Abubakar Jauro
Y1  - 2021/08/11
PY  - 2021
N1  - https://doi.org/10.11648/j.es.20210603.11
DO  - 10.11648/j.es.20210603.11
T2  - Engineering Science
JF  - Engineering Science
JO  - Engineering Science
SP  - 33
EP  - 38
PB  - Science Publishing Group
SN  - 2578-9279
UR  - https://doi.org/10.11648/j.es.20210603.11
AB  - The advancement of technology and the use of internet have changed many aspects of human culture over the years. Today, consumers take confidence in e-commerce platforms like amazon and eBay for comprehensive understanding of products and services when making a purchase decision. Here the web or user-generated content from consumers of such products and services, known as reviews, are exploited by spam reviewers to falsely promote or downgrade some targeted products. Despite potential solutions, Identifying and preventing review spam are still one of the top challenges faced by web search engines today. Therefore, in the quest to provide a more improved and efficient classification of review spam, this research probed different techniques in order to find most effective solution to spam detection. The research employed three base classifiers, Naïve Bayes, Support Vector Machines and Logistic Regression to form ensemble classifiers complimented with Arching classifier. The Arching classifier performs the weighted voting that produces the final class label with performance and accuracy higher than either of the individual base classifiers. Cross-validation is used as evaluation metrics to measure the performance or effectiveness of the ensemble classifiers while the experimental results shows that the ensemble classifiers achieve the best results compared to the single based classifier in terms of Precision, Recall, F1-measure and Accuracy.
VL  - 6
IS  - 3
ER  -

Copy | Download

Author Information

Alhassan Jamilu Ibrahim

Information Assurance and Security Research Group (IASRG), Faculty of Computing, Universiti Teknologi Malaysia, Skudai Johor, Malaysia
Maheyzah Siraj

Information Assurance and Security Research Group (IASRG), Faculty of Computing, Universiti Teknologi Malaysia, Skudai Johor, Malaysia
Usman Abubakar Jauro

Information Assurance and Security Research Group (IASRG), Faculty of Computing, Universiti Teknologi Malaysia, Skudai Johor, Malaysia

Download PDF

Sections

Plain Text BibTeX RIS

APA Style

Alhassan Jamilu Ibrahim, Maheyzah Siraj, Usman Abubakar Jauro. (2021). Ensemble Classifiers Employed for Spam Review Detection. Engineering Science, 6(3), 33-38. https://doi.org/10.11648/j.es.20210603.11

Copy | Download

ACS Style

Alhassan Jamilu Ibrahim; Maheyzah Siraj; Usman Abubakar Jauro. Ensemble Classifiers Employed for Spam Review Detection. Eng. Sci. 2021, 6(3), 33-38. doi: 10.11648/j.es.20210603.11

Copy | Download

AMA Style

Alhassan Jamilu Ibrahim, Maheyzah Siraj, Usman Abubakar Jauro. Ensemble Classifiers Employed for Spam Review Detection. Eng Sci. 2021;6(3):33-38. doi: 10.11648/j.es.20210603.11

Copy | Download

@article{10.11648/j.es.20210603.11,
  author = {Alhassan Jamilu Ibrahim and Maheyzah Siraj and Usman Abubakar Jauro},
  title = {Ensemble Classifiers Employed for Spam Review Detection},
  journal = {Engineering Science},
  volume = {6},
  number = {3},
  pages = {33-38},
  doi = {10.11648/j.es.20210603.11},
  url = {https://doi.org/10.11648/j.es.20210603.11},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.es.20210603.11},
  abstract = {The advancement of technology and the use of internet have changed many aspects of human culture over the years. Today, consumers take confidence in e-commerce platforms like amazon and eBay for comprehensive understanding of products and services when making a purchase decision. Here the web or user-generated content from consumers of such products and services, known as reviews, are exploited by spam reviewers to falsely promote or downgrade some targeted products. Despite potential solutions, Identifying and preventing review spam are still one of the top challenges faced by web search engines today. Therefore, in the quest to provide a more improved and efficient classification of review spam, this research probed different techniques in order to find most effective solution to spam detection. The research employed three base classifiers, Naïve Bayes, Support Vector Machines and Logistic Regression to form ensemble classifiers complimented with Arching classifier. The Arching classifier performs the weighted voting that produces the final class label with performance and accuracy higher than either of the individual base classifiers. Cross-validation is used as evaluation metrics to measure the performance or effectiveness of the ensemble classifiers while the experimental results shows that the ensemble classifiers achieve the best results compared to the single based classifier in terms of Precision, Recall, F1-measure and Accuracy.},
 year = {2021}
}

Copy | Download

TY  - JOUR
T1  - Ensemble Classifiers Employed for Spam Review Detection
AU  - Alhassan Jamilu Ibrahim
AU  - Maheyzah Siraj
AU  - Usman Abubakar Jauro
Y1  - 2021/08/11
PY  - 2021
N1  - https://doi.org/10.11648/j.es.20210603.11
DO  - 10.11648/j.es.20210603.11
T2  - Engineering Science
JF  - Engineering Science
JO  - Engineering Science
SP  - 33
EP  - 38
PB  - Science Publishing Group
SN  - 2578-9279
UR  - https://doi.org/10.11648/j.es.20210603.11
AB  - The advancement of technology and the use of internet have changed many aspects of human culture over the years. Today, consumers take confidence in e-commerce platforms like amazon and eBay for comprehensive understanding of products and services when making a purchase decision. Here the web or user-generated content from consumers of such products and services, known as reviews, are exploited by spam reviewers to falsely promote or downgrade some targeted products. Despite potential solutions, Identifying and preventing review spam are still one of the top challenges faced by web search engines today. Therefore, in the quest to provide a more improved and efficient classification of review spam, this research probed different techniques in order to find most effective solution to spam detection. The research employed three base classifiers, Naïve Bayes, Support Vector Machines and Logistic Regression to form ensemble classifiers complimented with Arching classifier. The Arching classifier performs the weighted voting that produces the final class label with performance and accuracy higher than either of the individual base classifiers. Cross-validation is used as evaluation metrics to measure the performance or effectiveness of the ensemble classifiers while the experimental results shows that the ensemble classifiers achieve the best results compared to the single based classifier in terms of Precision, Recall, F1-measure and Accuracy.
VL  - 6
IS  - 3
ER  -

Copy | Download