Skip to content

Searching on Tor

Understanding Tor

The darknet is an encrypted area of the internet that is accessible only through special software. It is commonly used by cybercriminals.

Cyber attackers use it to leak victims' data from ransomware attacks. This data is then often used to identify new targets, particularly for phishing schemes.

Beyond cybercrime, political and religious activists use the darknet to discuss their upcoming actions and activities.

What OwlyScan Indexes?

OwlyScan indexes all unencrypted and unprotected data and archives, filtering out problematic content such as sexual content, drugs, and weapons.

OwlyScan focuses on textual data (it does not index images, videos, or any other multimedia content). It can handle various document types that contain text, even if they are within archives (e.g., a DOCX attached to an email inside a ZIP file).

OwlyScan extracts text content from documents, splitting larger documents while preserving paragraph integrity (approximately 100,000 words per segment). Queries are executed on these individual splits.

Writing a Query

OwlyScan searches are deliberately non-semantic, focusing on keyword-based matching to precisely identify results that relate to the specific domain of interest.

For example, if you are searching for entity X, you wouldn't want results about Y, even if Y operates in the same sector or is in the same city.

Once the initial results are gathered, further semantic searches can be performed by using the AI assistant.

Queries are conducted using keywords and optional advanced filters, called dorks.

Crafting an Effective Query

An effective query returns a manageable number of results that are contextually relevant to your specific needs.

Funnel of a good query

TIP

Narrow or expand your search to get between 100 and 1,000 results for effective analysis.

Real-Time Query Preview

The right-hand panel provides a real-time approximation of the number of results available for your query. This helps you refine the query before submitting it, as searches consume a query token only upon confirmation.

Results are categorized based on their source:

  • Ransomware: Related to ransomware groups
  • Forum: Discussion groups with varying topics, akin to forums like 4chan or more localized activist cells
  • Shop: Online marketplaces for goods
  • Service: Online service providers
  • BTC: Bitcoin-related content
  • Links: Link directories

Preview of a tor query

WARNING

A query token is consumed as soon as the Launch Search button is clicked.

Using the Query assistant

OwlyScan provides a query assistant to help users craft effective queries through an interactive form.

A query that uses the assistant

This assistant is similar to the Google query assistant, guiding users by suggesting and adding appropriate dorks to refine their search criteria.

The query as generated by the assistant

Running a Query

To initiate a search, click the "Run Search" button. A query token will be consumed upon submission.

Depending on the query's complexity, results will be displayed within 20 to 30 seconds.

A search on the Darknet includes:

  • Collecting results matching the search expression
  • Generating exportable reports (in tabular format)
  • Initializing the AI assistant to help interpret and utilize the results

Interpreting the Results

Display and Grouping of Data

Search results are presented grouped by URL to condense the display, especially when an URL contains multiple results or points to an archive. This grouping is indicated by the following symbol: Results are grouped icon

To expand the grouped results, click on them. To return to the general results list, click the blue "Back to results list" button: Results are grouped icon

Each result provides a snippet of text that can be expanded to display up to 100,000 characters.

Only the first 500 results are displayed initially. Users can click the "Load more results" button at the end of the page to load additional results, 500 at a time.

Reports can be exported by clicking the export button at the top of the page. This export will generate as many tabular files as there are multiples of 500 results:

Export the results

WARNING

Avoid using the browser's back button, as it may disrupt the navigation experience.

Utilizing the AI Assistant

The AI assistant is specifically designed to provide insights tailored to your search results. For example, it will exclude unrelated results for entity X even if entity Y appears in the same document.

AI assistant icon

Explaining a specific result

Click on the AI assistant icon to explain a specific result:

AI assistant explain a result icon

AI assistant explain a result

Conducting Semantic Searches Within Results

The AI assistant can also facilitate semantic searches among the returned results: AI assistant semantic query

Handling "Illegal" Content

Some results are marked as "illegal". These contain content that is restricted based on the client's level of accreditation, such as violent or otherwise shocking material.

Export the results

If your results contain only illegal content, your query token will be automatically credited back.

Checking resource availability

OwlyScan now includes a feature to verify the availability of resources in real time directly from the search results. This ensures users can quickly determine whether a linked resource is still accessible on the Tor network before engaging with it.

Each result includes a "Check Availability" button. This button allows you to perform a real-time query on the Tor network to verify if the resource is still available. The status is displayed in a matter of seconds, helping you avoid time wasted on inaccessible resources.

Real-time check of the availability of a result

The "Check Availability" button can display three statuses:

  • Unset: When the status of the resource has not yet been checked, the button remains in its initial state.
  • Resource Available: When the resource is confirmed as accessible on the Tor network, the button will display a success indicator.
  • Resource Unavailable: If the resource is no longer accessible, the button will indicate that the resource is unavailable.

The availability check only verifies the resource at the time of the query.

Why might a resource be unavailable?

There are several scenarios where a resource may no longer be accessible:

  1. The website no longer exists: The site has been removed entirely from the Tor network.
  2. The website is temporarily unavailable: This is common, particularly for newer or less stable ransomware groups.
  3. The website is online, but the specific resource is unavailable: This can happen when data has been bought, removed, or is of interest to the operators.

Advanced Query Examples

Search Filters

All fields are case-insensitive.

FilterDescriptionExample
textCase-insensitive search within documentstext: myquery
myquery
urlFull match on URL parts (separated by / - : ? & .)url: test.onion
titleCase-insensitive search within document titlestitle: myquery
myquery
file(archives only) Full match on files (separated by / - : ? & . )file: secrets.docx
dateFilter on date (equals, before, after)date: 2024-02-29
date: <2024-02-29
date: >2024-02-29
level0Full match on TOR website categories: Ransomware, Shop, Forum, Links, Illegallevel0: Ransomware

Examples of Queries

Resources containing /2024/ in the url

plaintext
url:2024

Resources containing both guns and buy in their title

plaintext
title:guns title:buy

Resources containing "Airbus", indexed after the fifth of May

plaintext
airbus date:>2024-05-20

Note that this compact form is equivalent to:

plaintext
(text:airbus OR title:airbus OR url:airbus) AND date:>2024-05-20

AND operators can generally be ignored.

Airbus and Boeing door incidents, from ransomware groups Note the use of the glob operator (*) to match plurals.

plaintext
(airbus OR boeing) "door*" (accident* OR incident*) level0:ransomware

Operators

Search Term

plaintext
"Airbus planes"

Equivalent to:

plaintext
text:"Airbus planes" OR title:"Airbus planes" OR url:"Airbus planes"

OR Operator

The OR operator searches for a given search term OR an equivalent term.

plaintext
expr1 OR expr2

AND Operator

The AND operator searches for results that contain both terms. This is equivalent to:

plaintext
expr1 AND expr2

Or simply:

plaintext
expr1 expr2

Glob Operator

The glob operator matches the start of a longer word. Note it can only be used at the end (postfix) and not at the start of a word.

plaintext
startOfALongerWord*

Exclude Results

To exclude results containing a specific word:

plaintext
-wordToExclude