A-Team Insight Blogs

Big Data – The Other Side of Low Latency

15 August 2012

Subscribe to our newsletter

We write a lot here about the latency of moving data from point A to point B. But latency is also inherent in the processing of that data at points A and B. Most likely, the processing of data is a more complex undertaking than transporting it. And that’s where big data comes in.

For sure, big data (or Big Data as other publications refer to it) is the tech buzzword – and investment focus – of the moment, not unlike low latency has been in the past. Also, like low latency, the definition of big data is fluid, and is often bent to suit a particular vendor’s business. We’re no exception, so here’s a definition we use at BigDataForFinance.com …

“Datasets whose characteristics – size, data type and frequency – are beyond efficient processing, storage and extraction by traditional database management tools.”

Note that it’s not all about size, and that’s especially true for the financial markets, where even many years of time series tick data does not come close to the data volumes processed by the likes of Google and Facebook. But what the financial markets might lack in data size, it makes up for in complexity and frequency. And the need for accuracy and precision.

One of the more common big data applications relates to the storage of and analytics on tick-by-tick time series data. Here, the need is to capture data that is hitting at rates up to several million updates per second, for markets such as North American options. This generally requires pretty specialised, in-memory, approaches, since even massively parallel processing (MPP) databases – think EMC Greenplum, IBM Netezza and ParAccel – are not going to keep up. Vendors such as OneMarketData and Kx Systems are likely to be called upon for such storage, the latter perhaps paired with Kove’s RAM-based storage appliance.

Time series applications include creation and back-testing of quantitative trading models, some pre-trade risk checks, and transaction cost analysis (TCA). In the future, time series could support more complex execution algorithms too.

Another increasing big data application is processing of natural language text and analytics on real time and historical textual information in order to derive trading signals from news sources and social media. As an example, so-called Sentiment Analysis might process text messages related to a company or market segment from Twitter, building a view on whether the subject of the tweets is being referred to in a positive or negative manner. This sentiment – combined with other inputs – can be used to make trading decisions ahead of systems the key off price changes in the market.

The processing of big data generally requires grid or cluster infrastructure, with networks connecting 10s, 100, or 1000s of servers and processing nodes. No wonder then, that many of the network and middleware vendors that are engaged in low-latency connectivity also have a ‘story’ for big data. Names such as Informatica, Tibco Software, Cisco Systems, Arista Networks, Solarflare Communications, Mellanox Technologies, Tervela and Solace Systems all spring to mind. A couple of those – Informatica and Tibco – also offer more traditional big data analysis applications too.

One technology often mentioned in the context of big data is Hadoop, which is an open source implementation of the MapReduce framework for the processing of very large datasets, such as searching for data patterns. It too is a multi-server parallel approach, but one that is historically batch oriented. One direction for it is to make it more real time, introducing in-memory and low-latency middleware to boost performance. A real time Hadoop approach would lend itself to driving electronic trading and on-demand determination of risk across an enterprise.

As automated trading systems move to leverage cloud-based infrastructure, then accessing cloud-based big data services will be a natural route to take, leading to faster implementation and less ongoing management.

As we have suggested before, for many trading firms, tapping into the convergence of low latency, cloud and big data technologies will be the way to go. You’ll be hearing more on this architectural approach – so stay tuned!

Subscribe to our newsletter

TradingTech Insight

WEBINAR

Recorded Webinar: Unlocking value: Harnessing modern data platforms for data integration, advanced investment analytics, visualisation and reporting

Modern data platforms are bringing efficiencies, scalability and powerful new capabilities to institutions and their data pipelines. They are enabling the use of new automation and analytical technologies that are also helping firms to derive more value from their data and reduce costs. Use cases of specific importance to the finance sector, such as data...

Find out more

04 September 2025

Data Management Insight TradingTech Insight

BLOG

Optimising the Trader Desktop: A Strategic Imperative

The modern trading desk, a nexus of high-speed decision-making and complex data flows, is in the midst of profound transformation. At a recent A-Team Group webinar entitled ‘Enhancing Trader Efficiency with Interoperability – Innovative Solutions for Automated and Streamlined Trader Desktops and Workflows’, experts Dan Schleifer, President and co-founder of Interop.io, Richard Leder, CEO of...

25 September 2025

TradingTech Insight

EVENT

Data Management Summit London

Now in its 16th year, the Data Management Summit (DMS) in London brings together the European capital markets enterprise data management community, to explore how data strategy is evolving to drive business outcomes and speed to market in changing times.

26 March 2026

Data Management Insight

GUIDE

BCBS 239 Data Management Handbook

Our 2015/2016 edition of the BCBS 239 Data Management Handbook has arrived! Printed copies went like hotcakes at our Data Management Summit in New York but you can download your own copy here and get access to detailed information on the principles and implications of BCBS 239 on Data Management. This Handbook provides an at-a-glance...

03 November 2015

Data Management Insight RegTech Insight TradingTech Insight

Browse by brand

RegTech Insight

TradingTech Insight

Data Management Insight

Browse by content type

A-Team Insight Blogs

Big Data – The Other Side of Low Latency

Share article

Related content

WEBINAR

Recorded Webinar: Unlocking value: Harnessing modern data platforms for data integration, advanced investment analytics, visualisation and reporting

BLOG

Optimising the Trader Desktop: A Strategic Imperative

EVENT

Data Management Summit London

GUIDE

BCBS 239 Data Management Handbook

Share on Mastodon

A-Team Insight Blogs

Big Data – The Other Side of Low Latency

Share article

Related content

webinars

Recorded Webinar: The Role of Data Fabric and Data Mesh in Modern Trading Infrastructures

Related content

WEBINAR

Recorded Webinar: Unlocking value: Harnessing modern data platforms for data integration, advanced investment analytics, visualisation and reporting

BLOG

Optimising the Trader Desktop: A Strategic Imperative

EVENT

Data Management Summit London

GUIDE

BCBS 239 Data Management Handbook