Computational Storage: Potential Benefits of Reducing Data Movement

Talk of the Storage town today is Computational Storage.

In the previous blog, we saw the evolution of storage architectures and emerging storage architectures. And one of the widely talked topics is computational storage. If you would have got a chance to attend SNIA SDC USA 2020, you would have seen the inclination of the session and discussions towards this trending storage architecture.

In yet another previous blog, we saw the 5Ws that you need to know to understand everything you need to know about Computational Storage. We saw that Computational Storage is providing higher capacity solutions with lower power consumption, as we use distributed processing. As a result, Computational Storage provides improved efficiency and performance.

In one of the keynote sessions at SNIA SDC USA 2020, JB Baker from ScaleFlux talked about the advantages of reducing data movement in Computational Storage. He categorized the advantages into two – Saving Time and Saving Money.

  1. Saving Time: With growing storage media, interfaces, and networks, and speeding bandwidth, data movement is becoming sluggish. Moving Tera Bytes and Zetta Bytes of data to perform mission-critical tasks such as transactional processing, big data analytics, and machine learning, can become time-consuming and reduce efficiencies.
  2. Saving Money: Supporting infrastructure to handle all this massive data movement requires consistent investment, creating a lot of challenges for all those managing the data.

So, reducing data movement might help reduce processing time and infrastructure costs. This can be a boon for the IT department and data center architects. Further, Baker gave an example of utilizing Computational Storage to reduce data movement. He said that it would be through a Data Filtering Computational Storage Service (CSS). Let’s dig deeper into the example and results explained by Baker.

Let’s take a 12TB data set that represents all the transactions, worldwide purchases that happened over the past several years. Let’s say a data analyst needs to run a query that covers just 4 months in 2016, rounding up to only 100 GB of data relevant to this query, which is <1% of the entire data set. With ordinary storage, we might have to take all 12TB and push it up through the CPU, which is an invitation to the bottleneck up there to do that filtering and then complete the query.

Instead, if we implement data filtering CSS down at the drives, it filters out the relevant data before it even leaves the drive. So we will have to move only that 1% of data relevant to the query across the PCI bus and to the CPU. This will reduce the total data movement by 99% in this case and even reduce the data processing by the CPU to finish the query, resulting in a faster query completion time. This enables more queries to run in parallel and scale more rapidly. Baker also supported the theory with the practical implementation of the above example by measuring the data movement, CPU utilization, and query completion time for ordinary storage and data filtering CSS.

The results were as below:

  Ordinary Storage Data Filtering CSS
Data Movement High bandwidth for a very long period to move massive data High data movement, but for a very short period
CPU Utilization Experienced bottleneck CPU scaled nicely due to less data movement
Query Completion Time Slower query completion Rapid query completion (2-4 times faster than the usual)

This example clearly explains the potential benefit of using Computational Storage and Data Filtering Service over ordinary storage. To know more about advantages of Computational Storage, don’t forget to register for our upcoming keynote session at SNIA SDC India 2020, in which we will elaborate on the idea of computational storage and its position in the market. You can reach out to our speaker Rohit Srivastava during the session and ask your queries about CS at the event.

You can also watch the complete keynote session by Baker at SNIA SDC USA 2020 here and read more on SNIA.org/computational

 
Share:

Related Posts

Understanding the Potential of Storage and Security in IoT

Understanding the Potential of Storage and Security in IoT

The potential of storage and security in IoT plays a significant role in transforming industries and the lives of people. However, tackling challenges such as data isolation, interoperability, and scalability will be essential in underpinning this potential. To embrace the full potential of storage and security in IoT involves a holistic method, incorporating technological advancements with comprehensive tactics. Read the blog to understand the potential of security and storage in the IoT ecosystem, its challenges, and keyways to overcome them.

Share:
MicroStream: Modernizing Data Storage

MicroStream: Modernizing Data Storage

MicroStream is an efficient Java persistence framework meant for continuous object storage and recovery. MicroStream’s lightweight solution and in-memory storage policy enables rapid development and augments application speed, making it an ideal choice for microservices architectures, serverless systems, embedded systems, and real-time applications. Read the blog to explore how MicroStream is reforming the outlook of data persistence.

Share:
The Effects of Edge on Data Storage

The Effects of Edge on Data Storage

Edge computing is creating a ripple effect across the tech industry, and it is most strongly felt in the Data Storage sector. In this article, we explore four major ways Edge computing is changing Data Storage. Read on…

Share:
6 Challenges In Going Cloud-Native - And The Perfect Solution

6 Challenges In Going Cloud-Native – And The Perfect Solution

Going cloud-native carries with it a number of challenges, but danger, properly harnessed, becomes opportunity. Check out this article to see what obstacles lie in your path – click here!

Share:

What CTOs must know about Cloud-Native Development

Cloud-native development offers huge opportunities to companies. CTOs need to know how to navigate the upcoming changes to grab them – this article explores how.

Share:
The Growth of XaaS in Enterprise Data Storage Strategies

The Growth of XaaS in Enterprise Data Storage Strategies

As the world grows ever more virtualized, more and more service become unshackled from the hardware they run on. Now, Everything-as-a-Service is here, and it’s changing the way data storage works. Somenath Nag explores how, in this article.

Share: