34: GreyBeards talk Copy Data Management with Ash Ashutosh, CEO Actifio

In this episode, we talk with Ash Ashutosh (@ashashutosh), CEO of Actifio a copy data virtualization company. Howard met up with Ash at TechFieldDay11 (TFD11) a couple of weeks back and wanted another chance to talk with him.  Ash seems to have been around forever, the first time we met I was at a former employer and he was with AppIQ (later purchased by HP).  Actifio is populated by a number of industry veterans and since being founded in 2009 is doing really well, with over 1000 customers.

So what’s copy data virtualization (management) anyway?  At my former employer, we did an industry study that determined that IT shops (back in the 90’s) were making 9-13 copies of their data. These days,  IT is making, even more, copies of the exact same data.

Data copies proliferate like weeds

Engineers use snapshots for development, QA and validation. Analysts use data copies to better understand what’s going on in their customer-partner interactions, manufacturing activities, industry trends, etc. Finance, marketing , legal, etc. all have similar needs which just makes the number of data copies grow out of sight. And we haven’t even started to discuss backup.

Ash says things reached a tipping point when server virtualization become the dominant approach to running applications, which led to an ever increasing need for data copies as app’s started being developed and run all over the place. Then came along data deduplication which displaced tape in IT’s backup process, so that backup data (copies) now could reside on disk.  Finally, with the advent of disk deduplication, backups no longer had to be in TAR (backup) formats but could now be left in-app native formats. In native formats, any app/developer/analyst could access the backup data copy.

Actifio Copy Data Virtualization

So what is Actifio? It’s essentially a massively distributed object storage with a global name space, file system on top of it. Application hosts/servers run agents in their environments (VMware, SQL Server, Oracle, etc.) to provide change block tracking and other metadata as to what’s going on with the primary data to be backed up. So when a backup is requested, only changed blocks have to be transferred to Actifio and deduped. From that deduplicated change block backup, a full copy can be synthesized, in native format, for any and all purposes.

With change block tracking, backups become very efficient and deduplication only has to work on changed data so that also becomes more effective. Data copying can also be done more effectively since their only tracking deduplicated data. If necessary, changed blocks can also be applied to data copies to bring them up to date and current.

With Actifio, one can apply SLA’s to copy data. These SLA’s can take the form of data governance, such that some copies can’t be viewed outside the country, or by certain users. And they can also provide analytics on data copies. Both of these capabilities take copy data to whole new level.

We didn’t get into all Actifio’s offerings on the podcast but Actifio CDS is as a high availability appliance which runs their  object/file system and contains data storage. Actifio also comes in a virtual appliance as Actifio SKY, which runs as a VM under VMware, using anyone’s storage.  Actifio supports NFS, SMB/CIFS, FC, and iSCSI access to data copies, depending on the solution chosen. There’s a lot more information on their website.

It sounds a little bit like PrimaryData but focused on data copies rather than data migration and mostly tier 2 data access.

The podcast runs ~46 minutes and  covers a lot of ground. I spent most of the time asking Ash to explain Actifio (for Howard, TFD11 filled this in). Howard had some technical difficulties during the call which caused him to go offline but then came back on the call. Ash and I never missed him :), listen to the podcast to learn more.

Ash Ashutosh, CEO Actifio

Ash Ashutosh Hi Res copy-resizedAsh Ashutosh brings more than 25 years of storage industry and entrepreneurship experience to his role of CEO at Actifio. Ashutosh is a recognized leader and architect in the storage industry where he has spearheaded several major industry initiatives, including iSCSI and storage virtualization, and led the authoring of numerous storage industry standards. Ashutosh was most recently a Partner with Greylock Partners where he focused on making investments in enterprise IT companies. Prior to Greylock, he was Vice President and Chief Technologist for HP Storage.

Ashutosh founded and led AppIQ, a market leader of Storage Resource Management (SRM) solutions, which was acquired by HP in 2005. He was also the founder of Serano Systems, a Fibre Channel controller solutions provider, acquired by Vitesse Semiconductor in 1999. Prior to Serano, Ashutosh was Senior Vice President at StorageNetworks, the industry’s first Storage Service Provider. He previously worked as an architect and engineer at LSI and Intergraph.

GreyBeards talk hyper-convergence with Jason Collier, CTO & Co-founder Scale Computing

Welcome to our 11th monthly episode where we return to discussing hyper converged systems, this time with Jason Collier, CTO and Co-Founder Scale Computing. The GreyBeards haven’t talked with an hyper converged system vendor since our 1st episode and we had talked with Scale Computing and Jason at Storage Field Day 5 (SFD5) a couple of months ago. Jason is a blast to talk with so it seemed like a good time to talk with him.

Jason started out in the IT end of this business helping to scale a Web 2.0 company transactions almost 300X with architectural innovations.   Scale Computing originally came out as clustered, scale out storage but their intent all along was to provide hyper converged systems which include storage, networking, compute and virtualization under one and the same system and UI. When they came out with the full solution they dropped their standalone, scale out storage offering.

This month’s episode comes in at around 42 minutes long. 

In this podcast we discuss the “true meaning” of hyper converged systems and how this differs from most of the rest of the industry’s converged system architectures.  They are targeting their solution to the SMB and mid-range marketplace.  Scale Computing offers a fully bundled solution, hardware and software for one fee, which includes all the server virtualization software.  As an example, a 3-node entry level cluster, using 1000 series nodes, only costs $25K MSRP. The GreyBeards go a little deep into how they do storage but it’s not as deep as some prior talks.  Nonetheless, Jason’s a gas to talk with and Howard and I really enjoyed talking again with him. The talk seemed to last only 10 minutes or so but when I looked at the clock we were already out of time. For some reason I never got around to asking Jason why his Twitter id is @bocanuts.  Listen to podcast to learn more.

If your interested in more of in-depth, deep dive on their technology the Greybeards would suggest you view the set of videos from Scale Computing’s SFD5 (link above) sessions.

Jason Collier, CTO and Co-founder Scale Computing

Jason Collier 2

As a founder and Chief Technology Officer, Jason is responsible for the technology vision of the company. Previously, Jason was VP of Technical Operations at Corvigo where he oversaw sales engineering, technical support, internal IT and datacenter operations. Prior to Corvigo, Jason was VP of Information Technology and Infrastructure at Radiate. There he architected and oversaw the deployment of the entire Radiate ad-network infrastructure, scaling it from under one million transactions per month when he started to more than 300 million at its peak.