Data Stack Migration:

Overcoming Legacy Bottlenecks to Scale on Snowflake

Unstructured was facing significant operational friction with its existing Microsoft Fabric infrastructure. Frequent pipeline execution errors, compute capacity limits that halted report loading, and unreliable native connectors had forced the team into highly manual, time-consuming data ingestion processes. To support their rapid organizational growth, they partnered with Dynamic Data to replace this setup with a highly stable, automated Snowflake environment.

Why Dynamic Data?

Multidisciplinary team that came together to offer a comprehensive service

A rare combination of
technical abilities and
interpersonal skills

Fully customized service,
adaptable to the
client's needs

About Unstructured

www.unstructured.io

Industry: Information Technology & AI

Lines of business: Data Infrastructure and Analytics

Unstructured helps organizations extract and convert complex unstructured data from documents into AI-ready formats. To support their business analytics, they required a centralized, reliable data platform capable of being maintained without friction by a single-member data team.

The process

The goal

Execute a 100% accurate architectural overhaul in one month.

Dynamic Data replaced a complex infrastructure that was severely bottlenecking the company's data operations on a one-month timeline over the holiday season.

To succeed, the team had to resolve several deep operational blockers:

- Compute & Stability Issues: Fabric compute capacity limits, "out-of-sync" integrations, and SQL analytics endpoint failures were consistently disrupting daily reporting.

- Ingestion Bottlenecks: Unreliable API connections and failing native connectors meant marketplace data (AWS/Azure) and Postgres database dumps were heavily reliant on manual downloads and uploads.

- Complex Data Debt: Handling deeply nested JSON arrays from Cosmos DB required extensive, manual manipulation within Fabric notebooks that was difficult to version-control or scale.

"We had a very shortened timeframe for the project to be completed, and the team came through flawlessly."

Nicholas Pepera

Senior Data Analyst at Unstructured

The stack

To resolve these systemic issues, the migration completely replaced the Microsoft Fabric environment with a modern data stack centered around a Snowflake Medallion architecture. The team used Snowflake, dbt, Fivetran, and Census to execute the migration.

unstructured-fabric-to-snowflake-migration

Standardizing Transformations and Ensuring Data Quality.

In the legacy Microsoft Fabric environment, Unstructured’s data transformations lacked standardization. Logic was fragmented, some scripts were written in Python, while others relied on SQL. This lack of uniformity meant that troubleshooting broken pipelines required advanced programming knowledge and deep historical context of the system. Furthermore, there were no automated data tests in place, leaving the system vulnerable to data quality issues.

By migrating the transformation layer to dbt, Dynamic Data standardized all data modeling exclusively into SQL. This strategic shift democratized the data stack: moving forward, any analyst with standard SQL knowledge can easily read, maintain, and update the pipelines without needing advanced Python expertise. To further solidify the new architecture, the team leveraged dbt’s native testing framework to configure rigorous, automated data tests, guaranteeing long-term data integrity and reliability.

Team Empowerment and Training.

Beyond just deploying the technology, Dynamic Data actively introduced Snowflake, Fivetran, and dbt to the Unstructured team. Through hands-on training and the delivery of highly detailed documentation, we empowered the client’s single-member data team to take full ownership of the modern stack, ensuring they could confidently navigate, troubleshoot, and scale the infrastructure independently.

The results

Successfully migrated the required pipelines, completely eliminating the compute limit failures experienced in the old environment.

Eradicated the need for manual data downloads by deploying fully automated, reliable ingestion workflows.

Delivered comprehensive system documentation and configured environments, enabling a smooth, immediate transition to the internal data team.

Successful deployment within the highly restrictive, shortened timeframe.

Nicholas Pepera

Senior Data Analyst at Unstructured

"The depth of knowledge the team members displayed combined with their willingness to dive into the work was truly a fantastic experience! I loved working with them and will definitely reach out to them again if a project of this scale requires their expertise."

Return to case studies