Microsoft open sources its data compression algorithm and hardware for the cloud

Microsoft open sources its data compression algorithm and hardware for the cloud

5 years ago
Anonymous $Dftgs0JzgE

https://techcrunch.com/2019/03/14/zipline-microsoft-open-sources-its-data-compression-algorithm-and-hardware-for-the-cloud/

The amount of data that the big cloud computing providers now store is staggering, so it’s no surprise that most store all of this information as compressed data in some form or another β€” just like you used to zip your files back in the days of floppy disks, CD-ROMs and low-bandwidth connections. Typically, those systems are closely guarded secrets, but today, Microsoft open sourced the algorithm, hardware specification and Verilog source code for how it compresses data in its Azure cloud. The company is contributing all of this to the Open Compute Project (OCP).

Project Zipline, as Microsoft calls this project, can achieve 2x higher compression ratios compared to the standard Zlib-L4 64KB model. To do this, the algorithm β€” and its hardware implementation β€” were specifically tuned for the kind of large datasets Microsoft sees in its cloud. Because the system works at the systems level, there is virtually no overhead and Microsoft says that it is actually able to manage higher throughput rates and lower latency than other algorithms are currently able to achieve.