Enhancing Dark Web Classification: A Dynamic Crawler and Robust Classification Framework
Keywords:
Data Mining module, Dark Web Classification, Neural Network, Support Vector Machine, Dark Web LinksAbstract
The dark web presents significant challenges for law enforcement agencies due to its anonymous and constantly evolving nature, making it difficult to trace and monitor illegal activities. To combat this issue, a proposed system collects and cleans dark web pages, focusing on studying the market that specializes in selling illegal and harmful products through three key modules. The Crawler module accesses the market through the Tor Network, gathers data, and extracts crucial information about the products, sellers, and prices. The Pre-Processing module cleans and organizes the extracted data, ensuring its integrity and transforming it into a mineable format. The Data Mining module extracts insights and knowledge from the processed data, using techniques like clustering, classification, and association rule mining to identify patterns and trends. These modules provide valuable insights to law enforcement agencies and security researchers to combat illicit activities on the dark web market. The system's efficiency is evaluated through metrics like throughput and speedup, demonstrating its capability to handle large datasets and improve performance through parallel processing. Additionally, the proposed Support Vector Machine (SVM) with Neural Network (NN) outperforms other methodologies, highlighting its accuracy in predicting dark web links and establishing a robust classification framework. The research contributes to a comprehensive understanding of the dark web landscape and fosters advancements in cybersecurity and law enforcement practices. By integrating these solutions, this study aims to enhance the accuracy, adaptability, and effectiveness of dark web classification. The research contributes to a comprehensive understanding of the dark web landscape and fosters advancements in cybersecurity and law enforcement practices.
Downloads
References
Lautenschlager, S. (2016). Surface web, deep web, dark web—What’s the difference. Cambia research [Online] Available at: https://www.cambiaresearch.com/articles/85/surface-web-deep-web-dark-web-whats-the-difference.
Dingledine, R., Mathewson, N., & Syverson, P. F. (2004, August). Tor: The second-generation Ahmad, Maynard, A., S., & Gupta, A. (2019). The dark web as a phenomenon: A review and research agenda. In 30th Australasian Conference on Information Systems. Perth, Australia. Onion router USENIX security symposium, 4.
Baney, L. L., & Lewis, N. M. (2018). Internet pharmacies: Trends, opportunities, and risks. Health Law, 31, 1.
Magnúsdóttir, H. (2019). Darknet drug markets in a Swedish context: A descriptive analysis of Wall Street market and flugsvamp 3.0.
Meland, P. H., Bayoumy, Y. F. F., & Sindre, G. (2020). The Ransomware-as-a-Service economy within the darknet. Computers and Security, 92, 101762. https://doi.org/10.1016/j.cose.2020.101762
Bracci, A., Nadini, M., Aliapoulios, M., McCoy, D., Gray, I., Teytelboym, A., Gallo, A., & Baronchelli, A. (2022). Vaccines and more: The response of Dark Web marketplaces to the ongoing COVID-19 pandemic. PLOS ONE, 17(11), e0275288. https://doi.org/10.1371/journal.pone.0275288
Abdel Samad, Y. (2021). Case study: Dark web markets. Dark web investigation (pp. 237–247).
Rajawat, A. S., Bedi, P., Goyal, S. B., Kautish, S., Xihua, Z., Aljuaid, H., & Mohamed, A. W. (2022). Dark web data classification using neural network. Computational Intelligence and Neuroscience, 2022, 8393318. https://doi.org/10.1155/2022/8393318
Ahmad, A., Maynard, S., & Gupta, A. (2019). The dark web as a phenomenon: A review and research agenda. In 30th Australasian Conference on Information Systems. Perth, Australia.
Rhumorbarbe, D., Werner, D., Gilliéron, Q., Staehli, L., Broséus, J., & Rossy, Q. (2018). Characterizing the online weapons trafficking on crypto markets. Forensic Science International, 283, 16–20. https://doi.org/10.1016/j.forsciint.2017.12.008
Liggett, R., Lee, J. R., Roddy, A. L., & Wallin, M. A. (2020). The dark web as a platform for crime: An exploration of illicit drug, firearm, CSAM, and cybercrime markets. The Palgrave handbook of international cybercrime and cyberdeviance (pp. 91–116).
Nazah, S., Huda, S., Abawajy, J., & Hassan, M. M. (2020). Evolution of dark web threat analysis and detection: A systematic approach. IEEE Access, 8, 171796–171819. https://doi.org/10.1109/ACCESS.2020.3024198
Gulati, H., Saxena, A., Pawar, N., Tanwar, P., & Sharma, S. (2022, January). Dark web in modern world theoretical perspective: A survey. In International Conference on Computer Communication and Informatics (ICCCI), 2022 (pp. 1–10). IEEE Publications. https://doi.org/10.1109/ICCCI54379.2022.9740785
Dalvi, A., Paranjpe, S., Amale, R., Kurumkar, S., Kazi, F., & Bhirud, S. G. (2021, May). SpyDark: Surface and dark web crawler. In 2nd International Conference on Secure Cyber Computing and Communications (ICSCCC), 2021 (pp. 45–49). IEEE Publications. https://doi.org/10.1109/ICSCCC51823.2021.9478098
Kadoguchi, M., Hayashi, S., Hashimoto, M., & Otsuka, A. (2019, July). Exploring the dark web for cyber threat intelligence using machine leaning. In IEEE International Conference on Intelligence and Security Informatics (ISI), 2019 (pp. 200–202). IEEE Publications. https://doi.org/10.1109/ISI.2019.8823360
Sasnouskaya, T. (2023). Unveiling the dark web and the impact of REvil's cyberattacks.
Saleem, J., Islam, R., & Kabir, M. A. (2022). The anonymity of the dark web: A survey. IEEE Access, 10, 33628–33660. https://doi.org/10.1109/ACCESS.2022.3161547
Qasem, A. E., & Sajid, M. (2022, October). Exploring the effect of N-grams with BOW and TF-IDF representations on detecting fake news. In International Conference on Data Analytics for Business and Industry (ICDABI), 2022 (pp. 741–746). IEEE Publications. https://doi.org/10.1109/ICDABI56818.2022.10041537
Nezhad, S. Z. (2023). Dark web traffic detection using supervised machine learning.
Basheer, R., & Alkhatib, B. (2021). Threats from the dark: A review over dark web investigation research for cyber threat intelligence. Journal of Computer Networks and Communications, 2021, 1–21. https://doi.org/10.1155/2021/1302999
Prabha, C., & Mittal, A. (2023, February). Dark Web: A review on the deeper side of the Web. In O. International (Ed.) Technology Conference on Emerging Technologies for Sustainable Development (OTCON), 2022 (pp. 1–6). IEEE Publications. https://doi.org/ 10.1109/OTCON56053.2023.10113989
Alaidi, A. H. M. (2022). Dark web illegal activities crawling and classifying using data mining techniques. iJIM. Roa’a, M. ALRikabi, H.T.S., Aljazaery, I. A, and Abbood, S.H., 16(10), 123.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.