Left bracket

success story

Right bracket

Fragile OSINT architecture burning 94% of the infrastructure budget

Placeholder

Type

Legacy Rescue & Modernization

Year

2022 - Ongoing

The Situation

An international client creating services and technologies in the Finance sector contacted us to evaluate the critical issues of an OSINT SaaS product inherited from a previous acquisition. The system had grown disorderly, resulting in high infrastructure costs, low reliability, maintenance difficulties, and a deteriorating time-to-market.

We had already successfully collaborated with the client in the past; this accelerated the project kickoff and allowed for an incremental journey: stabilization of the existing system, gradual redesign, and an evolutionary/maintenance phase that is still active.

The engagement involved an initial assessment phase—our Technical Due Diligence—followed by a second phase of incremental redesign, and a third evolutionary and maintenance phase, which is currently ongoing.

What we identified

The request was clear: get a platform back on track that was born with an excellent service intuition but had become, over time, a fragile and expensive system to manage.

At the time of engagement, the platform contained approximately 50 billion records, totaling 12.5 TB. However, the system was unreliable, costly to maintain, and increasingly difficult to evolve.

The infrastructure had grown haphazardly, with a direct impact on costs, maintenance, and scalability.

In this scenario, our Technical Due Diligence was decisive. We analyzed the infrastructure, codebase, configurations, and end-to-end data flows (ingestion → normalization → indexing → search → watchlist). The resulting map of critical issues included:



In parallel, we formalized the domain constraints: datasets on the order of tens/hundreds of billions of documents with significant monthly growth; real-time exact-match queries on a single field; the need to create new indexes with minimal service impact; "spiky" search loads; and a watchlist system that saves and notifies new matches.

The Solution

Our approach

As simple as possible, but not simpler

In the complex context we inherited, our goal was twofold: to radically simplify the architecture while guaranteeing performance, reliability, and scalability. We therefore adopted a “mechanical sympathy” approach: reducing components and dependencies, leveraging managed cloud primitives, and building linear, measurable, and easily optimizable data paths.

Search Engine: Deterministic partitioning on object storage

We built an indexing and search engine based on a simple principle: dividing the dataset into deterministic partitions on AWS S3 so that every query becomes a targeted access to a small, predictable portion of the data.

This choice enables stable lookup times, natural scalability via on-demand compute, and stricter cost control compared to approaches based on always-on components.

Data Pipeline: Standardization, compression, and incremental updates

We restructured the ingestion flows, making them linear and observable, standardizing formats and transformation rules throughout the process (ingestion → normalization → master → indexing).

Managed AWS services and serverless to reduce operational overhead

Wherever possible, we shifted the operational load to managed AWS services, reducing systemic complexity and points of failure.

Protection and governance: consumption control and stability under load

We introduced a governance layer to make the platform more robust and sustainable over time:

Operability: observability and bottleneck reduction

Finally, we made the entire system more “operable” in a structural way:

Results

The intervention produced measurable results both technically and economically:

Graph

Project Highlights

Key Results

-93%

operational costs

 

+400%

managed data volume

Project Duration

1 year

(with active maintenance phase)

Conclusion

Thanks to the new architecture, the client can now rely on a lean, sustainable OSINT SaaS platform that is growing once again: a solid technical foundation, designed to scale with massive datasets while keeping cloud spending under control.

Beyond the technical aspects, the project delivered these results thanks to a relationship of mutual trust and a pragmatic working method: rigorous assessment, POC-validated choices, incremental migration, and constant alignment with business constraints (costs, maintainability, evolvability).