Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3379247.3379272acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccdeConference Proceedingsconference-collections
research-article

Crawling and Analysis of Dark Network Data

Published: 07 March 2020 Publication History

Abstract

Due to its anonymity and non-traceability, it is very difficult to research websites on the dark network. The research of the dark network is very important for our network security. Now there is very little data for studying the dark network, so we independently developed dark web crawler that runs automatically. This article will detail the implementation process of our dark web crawler and the data analysis process of crawled data. Currently, we can use crawled data to detect if multiple urls belong to the same site. We can use data to extract features of similar websites and we have generated an ever-increaing data set that can be used for simple website classification.We use the crawled data as a categorical dataset to categorize newly discovered urls.When we get the a certain number of new urls, we crawl again and the crawled data will be added to the previous data set. After multiple rounds of crawling, our data sets will be more and more abundant. through our approach, we can solve the problem that the dark network data is small, researchers can use our method to get enough data to study all aspects of the dark network.

References

[1]
Jardine, E. (2018). Privacy, censorship, data breaches and Internet freedom: The drivers of support and opposition to Dark Web technologies. new media & society, 20(8), 2824--2843.
[2]
He Gaofeng, Yang Ming, Luo Junzhou, & Zhang Wei. (2013). Tor Anonymous Communication Traffic Online Identification Method. Journal of Software, 24(3), 540--556.
[3]
Reddit. Available online: https://www.reddit.com/r/darknetmarkets (accessed on 6 August 2019)
[4]
Ghosh, S., Das, A., Porras, P., Yegneswaran, V., & Gehani, A. (2017, August). Automated categorization of onion sites for analyzing the darkweb ecosystem. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1793--1802). ACM.
[5]
Wu, Y., Zhao, F., Chen, X., Skums, P., Sevigny, E. L., Maimon, D., ... & Zhang, Y. (2019, July). Python Scrapers for Scraping Cryptomarkets on Tor. In International Conference on Security, Privacy and Anonymity in Computation, Communication and Storage (pp. 244--260). Springer, Cham.
[6]
Alex Biryukov, Ivan Pustogarov, Fabrice Thill, Ralf-Philipp Weinmann, Content and Popularity Analysis of Tor Hidden Services, Proceedings of the 2014 IEEE 34th International Conference on Distributed Computing Systems Workshops, p.188--193, June 30-July 03, 2014 [doi>10.1109/ICDCSW.2014.20]
[7]
A. Biryukov, I. Pustogarov, and R.-P. Weinmann. 2013. Trawling for Tor Hidden Services: Detection, Measurement, Deanonymization. In IEEE-SP.
[8]
Nicolas Christin, Traveling the silk road: a measurement analysis of a large anonymous online marketplace, Proceedings of the 22nd international conference on World Wide Web, May 13-17, 2013, Rio de Janeiro, Brazil [doi>10.1145/2488388.2488408]
[9]
Nunes, E., Diab, A., Gunn, A., Marin, E., Mishra, V., Paliath, V., ... & Shakarian, P. (2016, September). Darknet and deepnet mining for proactive cybersecurity threat intelligence. In 2016 IEEE Conference on Intelligence and Security Informatics (ISI) (pp. 7--12). IEEE.
[10]
G. Owen and N. Savage. 2016. Empirical analysis of Tor Hidden Services. IET Info. Sec. 10 (2016). Issue 3.
[11]
Tai, X. H., Soska, K., & Christin, N. (2019). Adversarial Matching of Dark Net Market Vendor Accounts.

Cited By

View all
  • (2024)Data Analysis of Dark Web Marketplaces using Machine Learning2024 7th International Conference on Signal Processing and Information Security (ICSPIS)10.1109/ICSPIS63676.2024.10812633(1-6)Online publication date: 12-Nov-2024
  • (2024)Dark Web Content Exploration using Network Analysis based on Data Crawling2024 4th International Conference of Science and Information Technology in Smart Administration (ICSINTESA)10.1109/ICSINTESA62455.2024.10747942(201-206)Online publication date: 12-Jul-2024
  • (2024)A Comparative Analysis of Models for Dark Web Data ClassificationProceedings of International Joint Conference on Advances in Computational Intelligence10.1007/978-981-97-0180-3_20(245-257)Online publication date: 2-Apr-2024
  • Show More Cited By

Index Terms

  1. Crawling and Analysis of Dark Network Data

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICCDE '20: Proceedings of 2020 6th International Conference on Computing and Data Engineering
    January 2020
    279 pages
    ISBN:9781450376730
    DOI:10.1145/3379247
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 March 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Dark net
    2. crawl
    3. darknet data set
    4. data analysis

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICCDE 2020

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)28
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 03 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Data Analysis of Dark Web Marketplaces using Machine Learning2024 7th International Conference on Signal Processing and Information Security (ICSPIS)10.1109/ICSPIS63676.2024.10812633(1-6)Online publication date: 12-Nov-2024
    • (2024)Dark Web Content Exploration using Network Analysis based on Data Crawling2024 4th International Conference of Science and Information Technology in Smart Administration (ICSINTESA)10.1109/ICSINTESA62455.2024.10747942(201-206)Online publication date: 12-Jul-2024
    • (2024)A Comparative Analysis of Models for Dark Web Data ClassificationProceedings of International Joint Conference on Advances in Computational Intelligence10.1007/978-981-97-0180-3_20(245-257)Online publication date: 2-Apr-2024
    • (2023)A Forensic Analysis Procedure for Dark Web Drug Marketplaces2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT)10.1109/ICCCNT56998.2023.10307116(1-6)Online publication date: 6-Jul-2023
    • (2023)Exploring Dark Web Crawlers: A Systematic Literature Review of Dark Web Crawlers and Their ImplementationIEEE Access10.1109/ACCESS.2023.325516511(35914-35933)Online publication date: 2023
    • (2022)SoK: An Evaluation of the Secure End User Experience on the Dark Net through Systematic Literature ReviewJournal of Cybersecurity and Privacy10.3390/jcp20200182:2(329-357)Online publication date: 27-May-2022
    • (2022)A Comparative Study of Artificial Intelligence based Vehicle Classification Algorithms used to Provide Smart Mobility2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N)10.1109/ICAC3N56670.2022.10074282(2341-2348)Online publication date: 16-Dec-2022
    • (2021)Black Widow Crawler for TOR network to search for criminal patterns2021 Second International Conference on Information Systems and Software Technologies (ICI2ST)10.1109/ICI2ST51859.2021.00023(108-113)Online publication date: Mar-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media