Wednesday, October 26, 2016

10 Required Non-technical Skills for a Data Scientist

"Data Science(DS)" is nothing new but the term itself and the recent level of interest in it. As a practice it has commercially (not academically) existed for more than 25 years, mainly under "Data Mining (DM)" and "predictive analytics(PA)," since early 1990's. DM and PA got a lot of traction originally in financial, Telco, and retail industries that had a lot of granular historical data. Like anything that gets sudden attention and interest, DS has been misused and abused in a variety of ways. Given the fast surge in market demand in the last several years, many claim to be or want to be data scientists. True data scientists and DS managers who had to deal with screening DS resumes, can testify to the level of present noise (false positives) in that application process.


"Data Science" tries to be an umbrella field that covers more of what data mining and predictive analytics practices have covered. That is justified since with the growth of data of all kinds in recent decade and what is expected in the coming years, we need a lot more of the people with relevant DS skill sets. The challenge however has been the definition of that "skillset." What makes a good data scientist?

In my previous post "What is BuDAI?," I explained that a successful DS project requires the involvement of the data science team through the whole cycle. The core part of a data science project deliverable is the insight and decision coming out of analytics. The analytics could be trivial (generally aggregated view of data and only looking at a handful of variables together) where in that case there would be no need for DS. That would be in the realm of a data or business analyst. DS comes into picture usually where:
  • More sophisticated analytics approaches are required,
  • More complex transformations are required to prepare the data,
  • Granular or atomic analysis of entities of interests is required,
  • Analytics could be straightforward but big data is involved requiring attention to optimization of analytics,
  • ...
Within BuDAI process, the S team has to interact with business, data engineers, data architects, project managers, and product managers to name a few. Aside from some relevant technical skills/knowledge[1] in math, stats, machine learning, programming, databases, and systems (the breadth and depth will depend on the level of seniority of the Data Scientist), through the years I have found the following ten traits to be as important as technical skills for junior hires and absolutely essential for senior data scientists.

  1. Problem solving ability 
  2. Business acumen
  3. Ability to question the work of self and others,
  4. Passion for data (the more data, the better)
  5. Attention to details and ability to validate own work in multiple ways
  6. Statistical thinking (a thinker who knows when to reason deterministically and when not)
  7. Passion for exploration and discovery (quick learner from fails)
  8. Ability to devise optimal ways to experiment new or creativity (finding novel useful insight is cumbersome. One can never find a sure way to find it)
  9. Presentation ability (written and oral)
  10. Ability to simplify complex concepts for explaining to others.

----------------------
[1] This is the subject of another blog and given the today's coverage of data science, the required technical abilities vary greatly.

---------------------------------------------------------------------------
I discuss these topics in detail in my book. Visit the book site for "High-Performance Data Mining and Big Data Analytics: The Story of Insight from Big Data" (http://bigdataminingbook.info ).

77 comments:

  1. Thank you very much for telling us about the big data and its insights. The information were helpful for me and it does to others as well .

    AWS Cloud Migration Services
    Azure Cloud Migration Services
    VMware Cloud Migration Services
    Cloud Migration tool
    Database Migration Services
    Cloud Migration Services

    ReplyDelete
  2. Very Good Information...
    Data science Course in Mumbai


    Thank You Very Much For Sharing These Nice Tips..

    ReplyDelete
  3. Thank you for sharing the article. The data that you provided in the blog is informative and effective.
    Best Data Science Online Training Institute

    ReplyDelete
  4. Wow Such a great Blog. I thought that it was exceptionally helpful. I discovered this which is exceptionally utilize full. Extraordinary article and data continue sharing more! Love yours blog. Heap of Thanks.

    Power Bi online training

    ReplyDelete
  5. Nice and good article. It is very useful for me to learn and understand easily.
    Data Science Training in Delhi
    Data Science Training institute in Delhi

    ReplyDelete
  6. Python web development is quite in demand and a very good option for Python developers. In over the span of 25 years, Python has managed to reach a level that is high above others making it the fastest growing language.

    Best Python Training Center in Delhi, India

    Advanced Python Training Institute in Delhi
    Advanced Python Training Institute in Noida

    ReplyDelete
  7. It was great knowledge after reading this. Thanks for sharing such good stuff with us. I am pleased for sharing on this blog its remarkable blog I really fascinated. Otherwise If anyone Want to learn Basic to Adv. MIS Training & Complete Data Science So Contact here-9311002620.

    Certified MIS & Data Science Training Center in Delhi, India

    MIS Training Institute in Noida
    MIS Training Institute in Delhi
    MIS Training Institute in Faridabad

    ReplyDelete
  8. We are really grateful for your blog post. You will find a lot of approaches after visiting your post. I was exactly searching for. Thanks for such post and please keep it up. Great work. install tensorflow anaconda

    ReplyDelete
  9. Thank you for sharing valuable information. Nice post. I enjoyed reading this post. The whole blog is very nice found some good stuff and good information here Thanks..Also visit my page. Binary Demand

    ReplyDelete
  10. Thank you for sharing this post the data in the post is very effective and informative

    Best Data Science Online Training Institute

    ReplyDelete
  11. Thank you for sharing this wonderful information....keep sharing.

    Looking for best Data Science Certification in Bangalore. Prwatech offers big data certification courses with 100% Placement assistance to All Our Students. Also it provides placement assistance service in Bangalore for IT.

    Some training courses we offered are:
    Big Data Hadoop Online Training.
    Best Tableau Classroom Training in Pune.

    ReplyDelete
  12. I have been through a blog, it was so distinct & I had a chance to collect the information that helps me a lot to improvise myself. I hope this will help many readers who are in need of this vital piece of information. Thanks for sharing & keep your blog updated.Visit my blog HQL Sales Campaigns

    ReplyDelete
  13. Randomly found your blog. Your blog is away-some. Get data science courses in Mumbai, Pune. And you can get training from one of the best training for other courses also like Artificial Intelligence, Machine Learning, Machine Learning, SAS Training, Python Programming etc.

    ReplyDelete
  14. Look at our Latest listed properties and check out the facilities on them, We have already sold more than 5,000 Homes and we are still going at very good pace. We would love you to look into these properties and we hope that you will find something match-able to your needs.
    Properties In Hyderabad

    ReplyDelete
  15. Informative blog that really helps to Data Science Developers. Thanks for valuable blog.

    Vicky from Way2Smile Solutions DMCC (A Leading Data Analytics Solutions provider in Dubai)

    ReplyDelete
  16. This comment has been removed by the author.

    ReplyDelete
  17. Excellent info, I really appreciate your work. Continue sharing more with latest updates.Thanks for your excellent blog and giving great kind of information. So useful. Nice work keep it up thanks for sharing the knowledge.
    Salesforce Training in Chennai

    Salesforce Online Training in Chennai

    Salesforce Training in Bangalore

    Salesforce Training in Hyderabad

    Salesforce training in ameerpet

    Salesforce Training in Pune

    Salesforce Online Training

    Salesforce Training

    ReplyDelete
  18. data warehousing solutions should understand the need of Data, and they should work to build more appropriate services to meet the requirements of their clients.

    ReplyDelete
  19. This is most informative and also this post most user friendly and super navigation to all posts.
    Best Data Science Certification Course in Bangalore
    Big Data Certification Course in Bangalore

    ReplyDelete
  20. Thank you for sharing the article. The data that you provided in the blog is informative and effective.

    Power Bi Training in Hyderabad

    ReplyDelete
  21. Great post! I am actually getting ready to across this information, It’s very helpful for this blog. Also great with all of the valuable information you have Keep up the good work you are doing well.
    paloalto secure

    ReplyDelete
  22. THanks a lot for such a nice blog. Please read my blog on How to Learn Data Science

    ReplyDelete
  23. Thanks a lot for such a nice blog. Please read my blog on How to Learn Data Science
    Linkedin URL How to Learn Data Science

    ReplyDelete
  24. Stunning! Such an astonishing and supportive post this is. I incredibly love it. It's so acceptable thus wonderful. I am simply astounded.

    data science course

    ReplyDelete
  25. Great Article… I love to read your articles because your writing style is too good, its is very very helpful for all of us and I never get bored while reading your article because, they are becomes a more and more interesting from the starting lines until the end.as I have joined for a data science course in Learn Digital Academy which is one of the best institute for data analytics courses online. Keep doing the good work.

    ReplyDelete
  26. This is a great post I saw thanks to sharing. I really want to hope that you will continue to share great posts in the future.
    data science course in noida

    ReplyDelete
  27. Thanks for providing such a valuable Knowledge on B2B Data Solutions . This is really very nice blog, your content is very interesting and worth reading it.Keep sharing. Very knowledgeable Blog.

    ReplyDelete


  28. Excellent Blog! I would like to thank for the efforts you have made in writing this post.

    Data Scientist Course in pune

    ReplyDelete
  29. iot training in chennai - Iot Training in Chennai - Iot is an fastest growing technology which has an lot of job opportunities both in India as well as abroad. Join the Best IOT Training Institute in Chennai today.

    DevOps training in chennai - DeVops is an course which is creating a lot of Job Oppurtunities in the IT Sector.

    blue prism training in Chennai - Blue prism is an best and wonderful course to start learning for college goers and as well as the freshers.

    uipath training in Chennai - Join the uipath course and training in Chennai.uipath is an IT course and quite easier to learn when getting trained in the Best Ui path training in Chennai.

    microsoft azure training in chennai - Microsoft azure is an course for both freshers and experienced, get trained under the Best Microsoft azure training Institute in Chennai.

    ReplyDelete
  30. Excellent effort to make this blog more wonderful and attractive. ExcelR Data Science Courses

    ReplyDelete
  31. Thanks for the detailed blog.The blog conssit of informational content about the topic.I really appreciate your effort in posting such and topic.You may also visit to the GLobal Tech Council to get the best deal.

    Visit- big data certification course

    ReplyDelete
  32. Really awesome blog!!! I finally found great post here.I really enjoyed reading this article. It's really a nice experience to read your post. Thanks for sharing your innovative ideas. Defence Simulation

    ReplyDelete

  33. Thanks for the detailed blog.The blog consist of informational data about what a user basically serach.You may visit to the Global Tech Council to get the best deal.

    Visit-Big data analytics certification

    ReplyDelete
  34. We provide Classroom training on IBM Certified Data Science at Hyderabad for the individuals who believe hand-held training. We teach as per the Indian Standard Time (IST) with In-depth practical Knowledge on each topic in classroom training, 80 – 90 Hrs of Real-time practical training classes. There are different slots available on weekends or weekdays according to your choices. We are also available over the call or mail or direct interaction with the trainer for active learning.
    For any queries feel free to Call/WhatsApp us on +91-9951666670 or mail at info@innomatics.in
    data science training in hyderabad
    Data Science Course in Hyderabad

    ReplyDelete
  35. Thanks for the detailed blog.The blog consist of informational content about the topic.I really appreciate the blog post.YOu may also visit to the Global tech council to get the best deal.

    Just click- Data science certificate online

    ReplyDelete
  36. Thank you for sharing this article with us.
    Thanks to this very good post here, I really like it and look for those topics (data mining) and everything relevant to them.

    ReplyDelete
  37. This comment has been removed by the author.

    ReplyDelete
  38. Thank you for sharing the article. The data that you provided in the blog is informative and effective.
    Data Analyst Course in Pune

    ReplyDelete
  39. This comment has been removed by the author.

    ReplyDelete
  40. Good Post and informative one. Thank you for sharing this good article.
    Data Analytics Certification

    ReplyDelete
  41. Thank you for sharing valuable information. Nice post. I enjoyed reading this post. The whole blog is very nice found some good stuff here Thanks..Also visit my page B2B Data Solutions

    ReplyDelete
  42. "Big data" is the buzzword of the year, and while it's not exactly fresh news, it's just starting to become a reality. Big data services refers to the analysis of very large datasets, like the hundreds of millions of Gmail messages that Google processes every day, or the terabytes of data collected by the Large Hadron Collider. The key to big data's success is its ability to collect, store, and analyze massive amounts of data.

    ReplyDelete
  43. This comment has been removed by the author.

    ReplyDelete
  44. Innomatics Research Labs is collaborated with JAIN (Deemed-to-be University) and offering the Online MBA in Artificial Intelligence & Business Intelligence Program. It is a sublime program of getting an MBA degree from one of the best renowned university – JAIN University and an IBM certification program in Data Science, Artificial Intelligence, and Business Intelligence from Innomatics Research Labs in collaboration with Royal Society London.
    Online MBA in Artificial intelligence from Jain University

    ReplyDelete
  45. Extremely helpful post, thanks for giving this wonderful article.
    Visit us: Data Science Course in Ranchi

    ReplyDelete
  46. Thank you for sharing valuable information. Nice post. I enjoyed reading this post. The whole blog is very nice found some good stuff here Thanks..Also visit my page B2B Data Solution Services

    ReplyDelete
  47. I as of late went over your blog and have been perusing along. I figured I would leave my first remark. I don't have the foggiest idea what to say aside from that I have delighted in perusing. Decent blog, I will continue to visit this blog regularly…

    AI Training in Hyderabad

    ReplyDelete
  48. Succeed in your Data Science career by leveraging real-world job-centric skills that would get you hired successfully by joining AI Patasala’s Data Science Course in Hyderabad.
    AI Patasala Data Science Training

    ReplyDelete
  49. Thanks for sharing this blog with us. Really awesome blog. Keep sharing stuff like this.
    Data Science Course Training in Hyderabad

    ReplyDelete
  50. Thanks for sharing this blog with us. Really awesome blog, information and knowledgeable content. Keep sharing more.
    Data Science Training with Placements in Hyderabad

    ReplyDelete
  51. Really impressed! Everything is very open and very clear clarification of issues. It contains true facts. Your website is very valuable. Thanks for sharing.
    business analytics course in hyderabad

    ReplyDelete
  52. Just saying thanks will not just be sufficient, for the fantasti c lucidity in your writing. I will instantly grab your rss feed to stay informed of any updates. business analytics course in mysore

    ReplyDelete