Customer Story

Processed 1.5 billion events per day for a large communication analytics company

To power real-time decision making on large data sets, enterprises need an expert team, high-performing hardware systems, and a scalable ETL solution that can accelerate development and deployment of ETL frameworks, while swiftly accommodating changing business needs.

Next-generation ETL tools allow enterprises to eff­ectively design and create an environment to mine and analyze data for making informed decisions. They isolate data from transactional systems, which ensures business-as-usual while data is analyzed in an optimized environment. These frameworks also help users solve business problems without spending cycles perfecting boilerplate code.

Business needs

A communication analytics solutions provider wanted to modernize their existing data applications and was looking for an easy-to-use and scalable solution that could process over 1.5 billion user interactions generated per day from multiple real-time feeds.


Gathr enabled the client to implement applications that run on a scalable Spark compute engine as structured streaming data pipelines while providing self-service and analytics capabilities for large-scale data processing.
The ETL solution used Gathr’s vast library of components for data acquisition, processing, enrichment, and storage. The entire data flow was created and orchestrated using a low-code methodology.


Gathr enabled end-to-end data ingestion, enrichment, machine learning, action triggers, and visualization to modernize hand-written data applications to Spark structured streaming in weeks. This, in turn, helped the customer realize several strategic benefits:

  • Replaced roughly ~1 million lines of code in ~3 weeks using Gathr frameworks
  • Achieved a high throughput of 100000+ transactions/second, enabling processing of 1.5 billion records per day
  • Reduced the overall release cycle from 8 months to 8 weeks

    Yes, Gathr may contact me via email and telephone. I can opt out at any time.
    Gathr Data Inc will use the data provided here in accordance with our Privacy Policy.

      Yes, Gathr may contact me via email and telephone. I can opt out at any time.
      Gathr Data Inc will use the data provided here in accordance with our Privacy Policy.

      Meet Gathr.

      The only all-in-one data pipeline platform

      • One platform to do it all - ETL, ELT, ingestion, CDC, ML
      • Self Service, zero-code, drag and drop interface
      • Built-in DataOps, MLOps, and DevOps tools
      • Cloud-agnostic and interoperable
      • Data

      • Change Data

      • ETL/ELT Data

      • Streaming

      • Data

      • Machine

      Expert Opinion

      Gathr is an end-to-end, unified data platform that handles ingestion, integration/ETL (extract, transform, load), streaming analytics, and machine learning. It offers strengths in usability, data connectors, tools, and extensibilty.

      Customer Speak

      Gathr helped us build “in-the-moment” actionable insights from massive volumes of complex operational data to effectively solve multiple use cases and improve the customer experience.


      Learning and Insights

      Stay ahead of the curve

      Q&A with Forrester

      Building a modern data stack: What playbooks don’t tell you


      4 common data integration pitfalls to avoid


      Why modernizing ETL is imperative for massive scale, real-time data processing

      Fireside Chat

      Don’t just migrate. Modernize your legacy ETL.