Diego Klabjan
  • Home
  • Vita
  • Publications
  • Contact

CSV to Spark SQL tables

3/20/2015

18 Comments

 
Recently we were involved in a project that required reading and importing more than 30 csv files into Spark SQL. We started writing scala code to ‘manually’ import file by file, but we soon realized that there is substantial repetition.

As a result we created a nice helper object that takes as input information about the csv files and automatically creates a schema per file. Each csv file must have a header, which dictates the name of the columns in the corresponding table. The user has to customize an object where the details are listed by file. For example, for each file the user can specify:

  • The name of the schema
  • Delimiter
  • A possible formatter function that takes a string value from the input csv file and returns the corresponding scala object that is used in schemaRDD (by default, i.e., if the formatter is not specified, strings are assumed). 

The code is available at  https://github.com/wxhC3SC6OPm8M1HXboMy/spark-csv2sql.git and as a package in www.spark-packages.org

18 Comments
writing service link
8/11/2015 07:33:55 pm

Hi! Check our writing service. Join us and order an essay now! You won't regret this.

Reply
dissertation writing help link
8/25/2015 05:28:59 pm

What you did save a lot of time. I have here several CSV files and I need to import them into an SQL language as well. I absolutely learned a lot from your discussion and I will have a friend of mine to code so we can eliminate several steps in the conversion process. Thank you for sharing this information because I learn so much from your post.

Reply
Dollar 1 hosting link
9/17/2015 09:19:52 pm

This is a very great & impressive article for me by you.

Reply
HerlinFarnandis link
9/21/2015 08:39:01 pm

Gleaming sharing by you once again! The advance of yours is very incomparable as well as easy to identify. Thanks to distribute this!

Reply
islamic vashikaran mantra in hindi link
9/22/2015 04:34:12 pm

Really interesting work you done thanks for share it.

Reply
Vashikaran mantra for husband wife link
10/6/2015 03:45:46 pm

I read your blog really its informative and unique blog which provide awesome information .

Reply
Wazaif for marriage link
10/6/2015 04:07:01 pm

Nice post !! Thanks for share that a nice in article.

Reply
Love Spell to Get Love Back link
10/7/2015 09:05:35 pm

Great Post !! Very interesting topic will bookmark your site to check if you write more about in the future.

Reply
essay services link
10/8/2015 02:05:31 am

Ok I see you can do this. But one question. WHY?

Reply
Tantrik baba link
10/8/2015 07:44:36 pm

This post is really astounding one! I was delighted to read this, very much useful. Many thanks

Reply
Powerful vashikaran mantras for love-spell link
10/14/2015 05:23:22 pm

Awesome post share I say thanks to share this impressive post.Keep it.

Reply
Online black magic spells link
10/15/2015 10:23:53 pm

Nice post share..Thanks for share with us that kind informative blog.

Reply
Vashikaran Specialist Baba link
11/5/2015 05:49:41 pm

Thanks for share this type of informative information. I like it .

Reply
Vashikaran Mantra link
11/5/2015 06:46:59 pm

Great work done in this post. Interesting Content. I like it . Thanks for share it .

Reply
Vashikaran yantra link
11/5/2015 07:06:31 pm

Thanks for your efforts to making this type of informative post .Describing of content is state to the point.Appreciate to your work & keep posting .

Reply
Love Marriage Specialist link
11/5/2015 07:28:20 pm

Excellent work is done by you . Thanks for share this informative post . I like it.

Reply
Vashikaran Specialist in Chennai link
11/5/2015 07:42:20 pm

Nice post ! Thanks for share it .

Reply
Blackmagic Specialist in Chennai link
11/5/2015 07:42:54 pm

Excellent work you done .Thanks for share this informative post.

Reply

Your comment will be posted after it is approved.


Leave a Reply.

    Diego Klabjan

    Professor at Northwestern University, Department of Industrial Engineering and Management Sciences. Founding Director, Master of Science in Analytics.

    Archives

    July 2019
    June 2019
    March 2019
    February 2019
    January 2017
    August 2016
    March 2016
    November 2015
    April 2015
    March 2015
    February 2015
    January 2015
    December 2014

    Categories

    All
    Analytics

    RSS Feed