Diego Klabjan
  • Home
  • Vita
  • Publications
  • Contact

SPARK AND IN-MEMORY DATABASES: TACHYON LEADING THE PACK

1/7/2015

23 Comments

 
The biggest grunts about Hadoop is its batch processing focus and the fact that iterative algorithms cannot be written efficiently. For this reason it is mostly used in data lakes for storing huge datasets together with its ETL/ELT capabilities and for running ad-hoc queries with map reduce.

In-memory database on the other hand offer great response time but are limited in their capacity by physical memory. The market is embracing several solutions from Hana by SAP, to VoltDB, memSql, Redis, and other.

Then came Spark with its brilliant idea of resilient distributed datasets (RDDs) which allow to mimic map reduce but holding the data in (persistent) cache. While a single map reduce process is not much faster in Spark over Hadoop’s map reduce, algorithms iterating on the same dataset are greatly more efficient since data is stored in memory cache for continuous access through iterations.

Spark being a processing framework is not a database or filesystem, albeit offering drivers to many databases and filesystems. Its memory oriented cache offers great computational speed but no storage capabilities. So combining its speed with quick access of in-memory databases is the holy grail of computational efficiency and storage.

As an example, memSQL announced a driver for Spark. Functionality of Spark is not readily accessible on top of data residing in the memSQL in-memory database. Real-time use cases such as fraud detection are sure to benefit from the marriage of the two.

A step further is Tachyon developed at Berkeley. It offers in-memory storage with a seamless integration with Spark. If several Spark jobs are accessing the same dataset stored in Tachyon, the dataset is not replicated but loaded only once. This is definitely ultimate efficiency of storage and computation.

As Hadoop will never supplant RDMS (at least in the foreseeable future), Spark with Tachyon (or any other in-memory database) will not make the two extinct. Huge data sets are unlikely to economically fit in memory and thus the three roommates will continue to dance together and occasionally bounce into each other. 

23 Comments
tabla classes in jaipur link
3/25/2015 06:48:45 pm



Good to know that you support Common Core State Standards. I find it very interesting. I am sure you will continue doing this amazing work in the future.

Reply
Qurani Wazifa for Love Marriage link
4/26/2015 04:32:24 pm

I really want to reading through your post. Thanks lots for using the time for you to share such a nice data......

Reply
Husband Wife Problem Solution link
4/26/2015 05:45:21 pm

Another great post, I appreciate all the work you put into this site, helping out others with your fun and creative works.

Reply
Get lost love back by vashikaran link
4/26/2015 09:42:45 pm

Thank you! It was a wonderful chance to visit this kind of site. I hope you will publish more on this topic. Thanks a lot for sharing with us!

Reply
vashikaran specialist link
4/26/2015 10:58:52 pm

This is a nice post, the word of information shared here i like it.

Reply
Voodoo Spell link
4/27/2015 11:43:18 pm

I really appreciate sharing this great. I really enjoyed reading it, you are a brilliant writer

Reply
Muslim Vashikaran Mantra Specialist link
4/28/2015 09:12:05 pm

Hi, nice post. I have been thinking about this topic,so thanks for sharing.

Reply
Love spell link
4/28/2015 11:26:37 pm

Thanks a lot of for made a informative and valuable post. i have bookmarked your post for visit again in the future.

Reply
Free astrology predictions reports online in hindi link
4/30/2015 05:04:08 pm

Pretty cool post. It’s seriously quite nice and useful post.Thanks for sharing this with us!it’s my very first visit..

Reply
Famous Muslim astrologer in india link
5/3/2015 10:28:05 pm

Another great post, I appreciate all the work you put into this site

Reply
Free Muslim Vashikaran Mantra link
5/7/2015 07:41:10 pm

Very interesting topic will bookmark your site to check if you write more about in the future

Reply
Girl Vashikaran specialist link
5/15/2015 04:18:08 pm

Great post !!! I have been thinking about this topic,so thanks for sharing.

Reply
Inter caste love marriage problem solutions by astrology link
5/24/2015 06:48:35 pm

Great Post !! Very interesting topic will bookmark your site to check if you write more about in the future.

Reply
islamic vashikaran specialist link
6/15/2015 06:59:08 pm

Thanks a lot of for made a informative and valuable post. i have bookmarked your post for visit again in the future.

Reply
data analysis services link
7/24/2015 05:28:52 am

We should use databases effectively to implement data properly. It opens various new factors and fields which must be study to remain abut earth disasters and for doing some preventive measures before their happening.

Reply
best dissertation link
9/9/2015 06:24:26 pm

Hello! Order the high-quality custom dissertation from our service. Visit our website right now!

Reply
World Famous astrologer link
9/9/2015 10:33:37 pm

Nice article. Think so new form of features have included in your article. Waiting for your next article.

Reply
Love Problem Solution Baba Ji link
9/10/2015 09:43:01 pm

Wonderful post!!! Genuinely loved this kind of post. Although I want much more information on like precious subject matter.

Reply
essay service link
9/14/2015 12:21:25 am

Works with Java perfectly!

Reply
sahil link
9/23/2015 05:30:53 pm

Nice Post, I really Appreciate this post. This post is base of memory database

Reply
Download PAN card application form link
10/25/2015 09:27:51 pm

Really best article for all and it increases knowledge.....Thanks.

Reply
rashan card status link
10/25/2015 09:33:56 pm

Fine information, many thanks to the author.

Reply
visit the website link
10/26/2015 01:50:54 am

Great information! I appreciate your efforts to write this.

Reply

Your comment will be posted after it is approved.


Leave a Reply.

    Diego Klabjan

    Professor at Northwestern University, Department of Industrial Engineering and Management Sciences. Founding Director, Master of Science in Analytics.

    Archives

    July 2019
    June 2019
    March 2019
    February 2019
    January 2017
    August 2016
    March 2016
    November 2015
    April 2015
    March 2015
    February 2015
    January 2015
    December 2014

    Categories

    All
    Analytics

    RSS Feed