Skip to main content

SENSEI: Scraper for ENhanced AnalySis to Evaluate Illicit Trends

  • Conference paper
  • First Online:
Service-Oriented Computing – ICSOC 2022 Workshops (ICSOC 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13821))

Included in the following conference series:

  • 638 Accesses

Abstract

Over the last years, we faced an exponential growth of illegal online market services in the Dark Web, making it easier than ever before of acquiring illicit goods online via a simple service interaction. To study and understand this emerging illegal services economy, we developed a trend analysis and (dark-)web services monitoring tool: SENSEI, which stands for ‘Scraper for ENhanced analySis to Evaluate Illicit trends’. SENSEI extracts specific service transaction trends and analyses the human behaviours behind, to produce symmetric insights on specific service transaction habits from both customers and vendors on the Dark Web. Moreover, a trend analysis tool is provided to discover and typify relationships among different criminal activities and hence provide evidence and support investigation activities and Law Enforcement Agencies (LEAs) detecting criminal operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.palantir.com/.

References

  1. Appendix: Sensei (2022). https://doi.org/10.6084/m9.figshare.21131557.v2

  2. Chen, H.: Dark web: exploring and mining the dark side of the web. In: 2011 European Intelligence and Security Informatics Conference, pp. 1–2. IEEE (2011)

    Google Scholar 

  3. EMCDDA: European drug report (2021). https://doi.org/10.2810/18539

  4. Foster, E.C., Godbole, S.: Overview of MySQL. In: Database Systems. Apress, Berkeley, CA (2016). https://doi.org/10.1007/978-1-4842-1191-5_24

  5. Győrödi, C., et al.: A comparative study: MongoDB vs. MySQL. In: 2015 13th International Conference on Engineering of Modern Electric Systems (EMES), pp. 1–6. IEEE (2015)

    Google Scholar 

  6. Mehrotra, T., Mehtre, B.M.: Forensic analysis of Wickr application on android devices. In: 2013 IEEE International Conference on Computational Intelligence and Computing Research, pp. 1–6. IEEE (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel De Pascale .

Editor information

Editors and Affiliations

Appendices

Requirement

In the following, we provide installation information and the interaction process between the SENSEI platform and the SENSEI SOA architecture.

1.1 Installation

The SENSEI architecture lays on Docker to build and run its components. Docker-compose is used to create an interconnected infrastructure, where the database can communicate with the SENSEI SOA tool and then with the SENSEI platform. The usage of a docker-compose eases the configuration settings, the building process, and the execution process. Indeed, to build and execute the entire framework, it is sufficient to run, the first time, from the main project folder, the following commands:

figure a

to create the network where the SENSEI framework operates and this command:

figure b

to build and execute the framework. The platform is available in the GitHub repository: https://github.com/danieldp92/SENSEI.

Services

1.1 Load Services

Table 1. Original dataset. The attribute name is an identifier. Instead age, gender and postcode are quasi-identifiers.

Load services manage the loading of datasets and the extraction of data from HTML pages passed as input. Table 1 shows the endpoint list of all LOAD services developed.

The upload dump endpoint reads the HTML page received and extract information to store into the database. As shown in Fig. 1, the tool has as many scrapers as the number of dark markets taken into account. This work takes into account three dark markets: Agartha, Cannazon and Dark Market.

The delete dumps is used to delete one or more markets scraped in the past. It is possible to set parameters as ‘timestamp’ and ‘market name’.

1.2 Get Services

Table 2. Get services.

Get services allow the end-user to effectively retrieve information stored during the loading process. The information retrieved by the GET services includes information like vendors and the product list and data selection to build the graph analysis by vendor’s name or PGP key. The services provided in this macro category can be grouped into four main categories:

  • Market services: services used to get information regarding marketplaces, such as details of a specific marketplace stored, timestamp, size of the dump, and the number of pages.

  • Vendor services: services used to retrieve information about vendors.

  • Product services: services used to retrieve information about products.

  • Graph analysis services: services used to show the connections of a vendor among marketplaces, leveraging the vendor’s username and the vendor’s PGP key.

In Table 2 we provided a detailed list of all the services under the GET category. In addition, we listed all the service names and a short description for each of them to explain their purpose.

Trend Analysis Platform

1.1 Home Page

Fig. 2.
figure 2

SENSEI platform screen: home. It contains general info about the top vendors, a trend analysis of the last month recorder, general info for each country and a map containing the number of products sold in every country.

Table 3. Platform home services.

1.2 Trend Analysis Page

Fig. 3.
figure 3

SENSEI platform screen: trend analysis. Comparison between two countries based on the amount of drug, counted as the total price, in all the markets.

Table 4. Platform trend analysis services.
Table 5. Platform trend analysis comparison services.

1.3 Vendor’s Tree-Map Page

In Table 6 we provide the list of services used to build the tree-map analysis. In the first row of the table, we have GET /vendor/treemap/n-products, the service is in charge of retrieving the number of products on sale per each vendor. Next, the service GET /market/list/ is used to show the market list as a treemap’s filtering option. The last service, GET /vendor/treemap/vendor-name extracts further details about the number of products on sale per each drug of a specific vendor.

Table 6. Platform treemap services.

1.4 Vendor’s Search Page

Fig. 4.
figure 4

SENSEI platform screen: vendors general info with detailed information regarding a specific vendor.

In Table 7 are shown the two services used in the vendor search page. The first service is in charge of retrieving the general data of all the vendors. While, the second one GET /vendor/info/vendor-name retrieves the information of a specific vendor.

Table 7. Platform search vendor services.

1.5 Vendor-Market Graph Analysis

Table 8 shows the services used to build the interactive graph. First, GET /market/n-products/ is used to estimate the impact of the vendor in a market, retrieving the number of products sold. Next, GET /market/graph/ service provides the list of all the vendors from a specific market. Last, the service GET /market/graph/vendor/ provides additional information, like the number of markets connected to a vendor and the number of products for each market.

Table 8. Platform interactive graph services.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

De Pascale, D., Cascavilla, G., Tamburri, D.A., Van Den Heuvel, WJ. (2023). SENSEI: Scraper for ENhanced AnalySis to Evaluate Illicit Trends. In: Troya, J., et al. Service-Oriented Computing – ICSOC 2022 Workshops. ICSOC 2022. Lecture Notes in Computer Science, vol 13821. Springer, Cham. https://doi.org/10.1007/978-3-031-26507-5_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-26507-5_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-26506-8

  • Online ISBN: 978-3-031-26507-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics