Customer Story

Processed 1.5 billion events per day for a large communication analytics company

To power real-time decision making on large data sets, enterprises need an expert team, high-performing hardware systems, and a scalable ETL solution that can accelerate development and deployment of ETL frameworks, while swiftly accommodating changing business needs.

Next-generation ETL tools allow enterprises to eff­ectively design and create an environment to mine and analyze data for making informed decisions. They isolate data from transactional systems, which ensures business-as-usual while data is analyzed in an optimized environment. These frameworks also help users solve business problems without spending cycles perfecting boilerplate code.

Business needs

A communication analytics solutions provider wanted to modernize their existing data applications and was looking for an easy-to-use and scalable solution that could process over 1.5 billion user interactions generated per day from multiple real-time feeds.

Solution

Gathr enabled the client to implement applications that run on a scalable Spark compute engine as structured streaming data pipelines while providing self-service and analytics capabilities for large-scale data processing.
The ETL solution used Gathr’s vast library of components for data acquisition, processing, enrichment, and storage. The entire data flow was created and orchestrated using a low-code methodology.

Results

Gathr enabled end-to-end data ingestion, enrichment, machine learning, action triggers, and visualization to modernize hand-written data applications to Spark structured streaming in weeks. This, in turn, helped the customer realize several strategic benefits:

  • Replaced roughly ~1 million lines of code in ~3 weeks using Gathr frameworks
  • Achieved a high throughput of 100000+ transactions/second, enabling processing of 1.5 billion records per day
  • Reduced the overall release cycle from 8 months to 8 weeks

    Gathr Data Inc will use the data provided here in accordance with our Privacy Policy.

      Gathr Data Inc will use the data provided here in accordance with our Privacy Policy.

      Meet Gathr.

      The only all-in-one data pipeline platform

      • One platform to do it all - ETL, ELT, ingestion, CDC, ML
      • Self Service, zero-code, drag and drop interface
      • Built-in DataOps, MLOps, and DevOps tools
      • Cloud-agnostic and interoperable
      • Data
        Ingestion

      • Change Data
        Capture

      • ETL/ELT Data
        Integration

      • Streaming
        Analytics

      • Data
        Preparation

      • Machine
        Learning

      Expert Opinion

      Gathr is an end-to-end, unified data platform that handles ingestion, integration/ETL (extract, transform, load), streaming analytics, and machine learning. It offers strengths in usability, data connectors, tools, and extensibilty.


      Customer Speak

      Gathr helped us build “in-the-moment” actionable insights from massive volumes of complex operational data to effectively solve multiple use cases and improve the customer experience.


      IN THE SPOTLIGHT

      Learning and Insights

      Stay ahead of the curve

      Q&A with Forrester

      Building a modern data stack: What playbooks don’t tell you

      Blog

      4 common data integration pitfalls to avoid

      Blog

      Why modernizing ETL is imperative for massive scale, real-time data processing

      Fireside Chat

      Don’t just migrate. Modernize your legacy ETL.