Network Log Analysis Using Categorical Anomaly Detection

Data Transformation Map

With the transformation in place, I was able to ingest the records and build a tree to visualize the connection history, ultimately giving us insight into a general fingerprint of conversation behavior. Once the system has recognized the fingerprint, it will begin to highlight connection paths that have deviated from normal behavior.

Visualization Of Communication Patterns

The principle reason for using thatDot’s Novelty Detector for this analysis however, is to surface the “novel” data from amongst the volumes of “normal” data. This sampled plot chart does a nice job of identifying the highly novel network conversations. The items highest on the X axis are the most Novel observations which may or may not also be Unique in the data. It is always interesting to see when Unique data, shown via the coloring, is NOT Novel. Differentiating such “false-positive” events is a significant benefit of including categorical data in our analysis.

Observation Detail Visualization

This same mechanism is useful for a range of use cases:

Real-time DDoS detection, such as TCP half-open (SYN flood) attacks.

Public-Private hosts communications. Use to determine which hosts are trying to connect and why (protocol, port, etc)

New protocol use between known hosts

New hosts successfully communicating with known hosts

In summary, this turns out to be a useful tool to aid in enriching existing telemetry data to aid in discovery, remediation and automation.

thatDot Novelty Detector

thatDot Novelty Detector is the first general-use application designed for finding anomalies in real-time in data sets that include categorical data. Available as an application for deployment in any cloud or data center thatDot Novelty Detector exposes an API that scores submitted observations for their “novelty” enabling real-time anomaly detention with fewer false positives than traditional threshold based metric analysis.

‍

The Secret Ingredient in the Alphabet Soup of Cybersecurity

by John Cloonan | Mar 4, 2025

This is the first in a series of blogs exploring how the Quine Streaming Graph analytics engine is the secret ingredient in the Alphabet Soup of...

Streaming Graph Get Started

by thatDot | Jul 23, 2024

It's been said that graphs are everywhere. Graph-based data models provide a flexible and intuitive way to represent complex relationships and...

Streaming Graph for Real-Time Risk Analysis at Data Connect in Columbus 2024

by thatDot | Jul 23, 2024

After more than 25 years in the data management and analysis industry, I had a brand new experience. I attended a technical conference. No, that...

Cypher all the things!

by thatDot | Jul 3, 2024

Uses for individual data engineering technologies are often broadened to more than just interacting with databases. The same goes for graph database techniques and, specifically, the leading language for building and querying graph databases – Cypher.

thatDot CEO Explains Streaming Graph to Cybersecurity Thought Leader

by Paige Roberts | Jul 2, 2024

Briefing Room on demand webinar on thatDot Youtube channel: The Unreasonable Effectiveness of Streaming Graph thatDot founder and CEO Ryan Wright...

Microservice Hell: The State of the Art in Streaming Services

by thatDot | Jun 19, 2024

Exploring the challenges of data processing in microservices, the article introduces thatDot’s Streaming Graph, which seamlessly integrates various data sources like Apache Kafka, AWS Kinesis, and more.

Network Log Analysis Using Categorical Anomaly Detection

Data Transformation Map

Visualization Of Communication Patterns

Example Observation Detail Visualization

Observation Detail Visualization

thatDot Novelty Detector

Read more