Data scientist vs machine learning engineerData Science in C (or C++)data science / machine learning resources?Understanding portfolio-level risk modelsWhat is valued more in the data science job market, statistical analysis or data processing?Small data set in machine learningLearning AI, Machine Learning, Deep LearningData Matching Using Machine Learning

Is there a word to describe the feeling of being transfixed out of horror?

Make "apt-get update" show the exact output as `apt update`

How do I repair my stair bannister?

Death of a family member

Hostile work environment after whistle-blowing on coworker and our boss. What do I do?

Negative correlation but positive beta value

Getting the lowest value with key in array

Reply ‘no position’ while the job posting is still there (‘HiWi’ position in Germany)

Is it possible to build a CPA Secure encryption scheme which remains secure even when the encryption of secret key is given?

tikz grid without top edge

Simple image editor tool to draw a simple box/rectangle in an existing image

does this mean what I think it means - 4th last time

Find fails if filename contains brackets

Freedom of speech and where it applies

Would it be legal for a US State to ban exports of a natural resource?

Can a controlled ghast be a leader of a pack of ghouls?

Organic chemistry Iodoform Reaction

JavaScript array of objects contains the same array data

Can an armblade require double attunement if it integrates a magic weapon that normally requires attunement?

Superhero words!

What does the "3am" section means in manpages?

When is separating the total wavefunction into a space part and a spin part possible?

Should my PhD thesis be submitted under my legal name?

What (else) happened July 1st 1858 in London?



Data scientist vs machine learning engineer


Data Science in C (or C++)data science / machine learning resources?Understanding portfolio-level risk modelsWhat is valued more in the data science job market, statistical analysis or data processing?Small data set in machine learningLearning AI, Machine Learning, Deep LearningData Matching Using Machine Learning













60












$begingroup$


What are the differences, if any, between a "data scientist" and a "machine learning engineer"?



Over the past year or so "machine learning engineer" has started to show up a lot in job postings. This is particularly noticeable in San Francisco, which is arguably where the term "data scientist" originated. At one point "data scientist" overtook "statistician", and I'm wondering if the same is now slowly beginning to happen to "data scientist".



Career advice is listed as off-topic on this site, but I view my question as highly relevant since I'm asking about definitions; I'm not asking about recommendations given my own career trajectory or personal circumstances like other off-topic questions have.



This question is on-topic because it might someday have significant implications for many users of this site. In fact, this stack-exchange site might not exist if the "statistician" vs "data scientist" evolution had not occurred. In that sense, this is a rather pertinent, potentially existential question.










share|improve this question











$endgroup$







  • 2




    $begingroup$
    Data scientist sounds like a designation with little clarity on what the actual work will be, while machine learning engineer is more specific. In first case, your company will give you a target and you need to figure out what approach (machine learning, image processing, neural network, fuzzy logic, etc) you would use. In second case, you company has already narrowed down to what approach has to be used.
    $endgroup$
    – gurvinder372
    Feb 20 '18 at 6:31










  • $begingroup$
    Related: data science vs operations research . Also, a scientist is something different than an engineer. Unfortunately, industry doesn't seem to care about this.
    $endgroup$
    – Discrete lizard
    Feb 21 '18 at 9:56







  • 1




    $begingroup$
    As someone else pointed out, a ML engineer is simply someone who puts ML models into production. He's not expected to understand in depth the actual predictive models and their underlying mathematics, they're required however to master the software tools that make these models usable. A Data Scientist is expected to have a deep understanding of stats/math and ML/AI, and is often the person who creates the tools used by ML engineers. So a ML engineer is basically closer to a specialised software engineer and a DS is closer to a computational statistician.
    $endgroup$
    – Digio
    Aug 26 '18 at 12:19















60












$begingroup$


What are the differences, if any, between a "data scientist" and a "machine learning engineer"?



Over the past year or so "machine learning engineer" has started to show up a lot in job postings. This is particularly noticeable in San Francisco, which is arguably where the term "data scientist" originated. At one point "data scientist" overtook "statistician", and I'm wondering if the same is now slowly beginning to happen to "data scientist".



Career advice is listed as off-topic on this site, but I view my question as highly relevant since I'm asking about definitions; I'm not asking about recommendations given my own career trajectory or personal circumstances like other off-topic questions have.



This question is on-topic because it might someday have significant implications for many users of this site. In fact, this stack-exchange site might not exist if the "statistician" vs "data scientist" evolution had not occurred. In that sense, this is a rather pertinent, potentially existential question.










share|improve this question











$endgroup$







  • 2




    $begingroup$
    Data scientist sounds like a designation with little clarity on what the actual work will be, while machine learning engineer is more specific. In first case, your company will give you a target and you need to figure out what approach (machine learning, image processing, neural network, fuzzy logic, etc) you would use. In second case, you company has already narrowed down to what approach has to be used.
    $endgroup$
    – gurvinder372
    Feb 20 '18 at 6:31










  • $begingroup$
    Related: data science vs operations research . Also, a scientist is something different than an engineer. Unfortunately, industry doesn't seem to care about this.
    $endgroup$
    – Discrete lizard
    Feb 21 '18 at 9:56







  • 1




    $begingroup$
    As someone else pointed out, a ML engineer is simply someone who puts ML models into production. He's not expected to understand in depth the actual predictive models and their underlying mathematics, they're required however to master the software tools that make these models usable. A Data Scientist is expected to have a deep understanding of stats/math and ML/AI, and is often the person who creates the tools used by ML engineers. So a ML engineer is basically closer to a specialised software engineer and a DS is closer to a computational statistician.
    $endgroup$
    – Digio
    Aug 26 '18 at 12:19













60












60








60


34



$begingroup$


What are the differences, if any, between a "data scientist" and a "machine learning engineer"?



Over the past year or so "machine learning engineer" has started to show up a lot in job postings. This is particularly noticeable in San Francisco, which is arguably where the term "data scientist" originated. At one point "data scientist" overtook "statistician", and I'm wondering if the same is now slowly beginning to happen to "data scientist".



Career advice is listed as off-topic on this site, but I view my question as highly relevant since I'm asking about definitions; I'm not asking about recommendations given my own career trajectory or personal circumstances like other off-topic questions have.



This question is on-topic because it might someday have significant implications for many users of this site. In fact, this stack-exchange site might not exist if the "statistician" vs "data scientist" evolution had not occurred. In that sense, this is a rather pertinent, potentially existential question.










share|improve this question











$endgroup$




What are the differences, if any, between a "data scientist" and a "machine learning engineer"?



Over the past year or so "machine learning engineer" has started to show up a lot in job postings. This is particularly noticeable in San Francisco, which is arguably where the term "data scientist" originated. At one point "data scientist" overtook "statistician", and I'm wondering if the same is now slowly beginning to happen to "data scientist".



Career advice is listed as off-topic on this site, but I view my question as highly relevant since I'm asking about definitions; I'm not asking about recommendations given my own career trajectory or personal circumstances like other off-topic questions have.



This question is on-topic because it might someday have significant implications for many users of this site. In fact, this stack-exchange site might not exist if the "statistician" vs "data scientist" evolution had not occurred. In that sense, this is a rather pertinent, potentially existential question.







machine-learning






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Feb 20 '18 at 13:27









Stephen Rauch

1,52551229




1,52551229










asked Feb 20 '18 at 6:15









Ryan ZottiRyan Zotti

2,57931227




2,57931227







  • 2




    $begingroup$
    Data scientist sounds like a designation with little clarity on what the actual work will be, while machine learning engineer is more specific. In first case, your company will give you a target and you need to figure out what approach (machine learning, image processing, neural network, fuzzy logic, etc) you would use. In second case, you company has already narrowed down to what approach has to be used.
    $endgroup$
    – gurvinder372
    Feb 20 '18 at 6:31










  • $begingroup$
    Related: data science vs operations research . Also, a scientist is something different than an engineer. Unfortunately, industry doesn't seem to care about this.
    $endgroup$
    – Discrete lizard
    Feb 21 '18 at 9:56







  • 1




    $begingroup$
    As someone else pointed out, a ML engineer is simply someone who puts ML models into production. He's not expected to understand in depth the actual predictive models and their underlying mathematics, they're required however to master the software tools that make these models usable. A Data Scientist is expected to have a deep understanding of stats/math and ML/AI, and is often the person who creates the tools used by ML engineers. So a ML engineer is basically closer to a specialised software engineer and a DS is closer to a computational statistician.
    $endgroup$
    – Digio
    Aug 26 '18 at 12:19












  • 2




    $begingroup$
    Data scientist sounds like a designation with little clarity on what the actual work will be, while machine learning engineer is more specific. In first case, your company will give you a target and you need to figure out what approach (machine learning, image processing, neural network, fuzzy logic, etc) you would use. In second case, you company has already narrowed down to what approach has to be used.
    $endgroup$
    – gurvinder372
    Feb 20 '18 at 6:31










  • $begingroup$
    Related: data science vs operations research . Also, a scientist is something different than an engineer. Unfortunately, industry doesn't seem to care about this.
    $endgroup$
    – Discrete lizard
    Feb 21 '18 at 9:56







  • 1




    $begingroup$
    As someone else pointed out, a ML engineer is simply someone who puts ML models into production. He's not expected to understand in depth the actual predictive models and their underlying mathematics, they're required however to master the software tools that make these models usable. A Data Scientist is expected to have a deep understanding of stats/math and ML/AI, and is often the person who creates the tools used by ML engineers. So a ML engineer is basically closer to a specialised software engineer and a DS is closer to a computational statistician.
    $endgroup$
    – Digio
    Aug 26 '18 at 12:19







2




2




$begingroup$
Data scientist sounds like a designation with little clarity on what the actual work will be, while machine learning engineer is more specific. In first case, your company will give you a target and you need to figure out what approach (machine learning, image processing, neural network, fuzzy logic, etc) you would use. In second case, you company has already narrowed down to what approach has to be used.
$endgroup$
– gurvinder372
Feb 20 '18 at 6:31




$begingroup$
Data scientist sounds like a designation with little clarity on what the actual work will be, while machine learning engineer is more specific. In first case, your company will give you a target and you need to figure out what approach (machine learning, image processing, neural network, fuzzy logic, etc) you would use. In second case, you company has already narrowed down to what approach has to be used.
$endgroup$
– gurvinder372
Feb 20 '18 at 6:31












$begingroup$
Related: data science vs operations research . Also, a scientist is something different than an engineer. Unfortunately, industry doesn't seem to care about this.
$endgroup$
– Discrete lizard
Feb 21 '18 at 9:56





$begingroup$
Related: data science vs operations research . Also, a scientist is something different than an engineer. Unfortunately, industry doesn't seem to care about this.
$endgroup$
– Discrete lizard
Feb 21 '18 at 9:56





1




1




$begingroup$
As someone else pointed out, a ML engineer is simply someone who puts ML models into production. He's not expected to understand in depth the actual predictive models and their underlying mathematics, they're required however to master the software tools that make these models usable. A Data Scientist is expected to have a deep understanding of stats/math and ML/AI, and is often the person who creates the tools used by ML engineers. So a ML engineer is basically closer to a specialised software engineer and a DS is closer to a computational statistician.
$endgroup$
– Digio
Aug 26 '18 at 12:19




$begingroup$
As someone else pointed out, a ML engineer is simply someone who puts ML models into production. He's not expected to understand in depth the actual predictive models and their underlying mathematics, they're required however to master the software tools that make these models usable. A Data Scientist is expected to have a deep understanding of stats/math and ML/AI, and is often the person who creates the tools used by ML engineers. So a ML engineer is basically closer to a specialised software engineer and a DS is closer to a computational statistician.
$endgroup$
– Digio
Aug 26 '18 at 12:19










8 Answers
8






active

oldest

votes


















52












$begingroup$

Good question. Actually there is a lot of confusion on this subject, mainly because both are quite new jobs. But if we focus on the semantics, the real meaning of the jobs become clear.



Beforehand is better to compare apples with apples, talking about a single subject, the Data. Machine Learning and it's sons (Deep Learning, etc.) is just one sub-subject of the Data World, together with the statistic theories, the data acquisition (DAQ), the processing (which can be non-machine learning driven), the interpretation of the results, etc.



So, for my explanation, I will broad the Machine Learning Engineer role to the one of Data Engineer.



Science is about experiment, trials and fails, theory building, phenomenological understanding.
Engineering is about work on what science already knows, perfecting it and carry to the "real world".



Think about a proxy: what is the difference between a nuclear scientist and a nuclear engineer?



The nuclear scientist is the one which know the science behind the atom, the interaction between them, the one which wrote the recipe which allow to get energy from the atoms.



The nuclear engineer is the guy charged to take the recipe of the scientist, and carry it to the real world. So it's knowledge about the atomic physics is quite limited, but he also know about materials, buildings, economics, and whatever else useful to build a proper nuclear plant.



Coming back to the Data world, here another example: the guys which developed Convolutional Neural Networks (Yann LeCun) is a Data Scientist, the guy which deploy the model to recognize faces in pictures is a Machine Learning Engineer. The guy responsible of the whole process, from the data acquisition to the registration of the .JPG image, is a Data Engineer.



So, basically, 90% of the Data Scientist today are actually Data Engineers or Machine Learning Engineers, and 90% of the positions opened as Data Scientist actually need Engineers. An easy check: in the interview, you will be asked about how many ML models you deployed in production, not on how many papers on new methods you published.



Instead, when you see announces about "Machine Learning Engineer", that means that the recruiters are well aware of the difference, and they really need someone able to put some model in production.






share|improve this answer









$endgroup$












  • $begingroup$
    I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
    $endgroup$
    – Tony
    Feb 20 '18 at 14:52










  • $begingroup$
    Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
    $endgroup$
    – Didier A.
    Jan 26 at 7:16










  • $begingroup$
    YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
    $endgroup$
    – Vincenzo Lavorini
    Jan 27 at 8:27










  • $begingroup$
    Could you kindly answer this question regarding Data Engineer career guidance.
    $endgroup$
    – stom
    Feb 1 at 6:44










  • $begingroup$
    How is science about "phenomenological understanding"?
    $endgroup$
    – ubadub
    Feb 25 at 20:47


















10












$begingroup$

The terms are nebulous because they are new



Being in the middle of a job search in the 'data science' field, I think that there are two things going on here. First, the jobs are new, and there is no set definitions of various terms, so no commonly agreed upon matching of terms with job descriptions. Compare this to 'web developer' or 'back-end developer.' These are two similar jobs that have reasonably well agreed upon and distinct descriptions.



Second, a lot of people doing the job posting and initial interviews don't know that well what they are hiring for. This is particularly true in the case of small to medium sized-companies that hire recruiters to find applicants for them. It is these intermediaries that are posting the job descriptions on CareerBuilder or whatever forum. This isn't to say that many of them don't know their stuff, many of them are quite knowledgeable about the companies they represent and the requirements of the workplace. But, without well defined terms to describe different specific jobs, nebulous job titles are often the result.



There are three general divisions of the field



In my experience, there are three general divisions of the 'job space' of data science.



The first is the development of the mathematical and computational techniques that make data science possible. This covers things like statistical research into new machine learning methods, the implementation of these methods, and the building of computational infrastructure to employ these methods in the real world. This is the division farthest separated from the customer, and the smallest division. Much of this work is done by either academics or researchers at the big companies (Google, Facebook, etc). This is for things like developing Google's TensorFlow, IBM's SPSS neural nets, or whatever the next big graph database is going to be.



The second division is using the underlying tools to create application specific packages to perform whatever data analysis needs to be done. People are hired to use Python or R or whatever to build analysis capability on some set of data. A lot of this work, in my experience, involves doing the 'data laundry,' turning raw data in whatever form into something usable. Another big chunk of this work is databasing; figuring out how to store the data in a way that it can be accessed in whatever timeline you need it in. This job isn't so much taking tools, but using existing database, statistics, and graphical analysis libraries to produce some results.



The third division is producing analysis from the newly organized and accessible data. This is the most customer facing side, depending on your organization. You have to produce analysis that business leaders can use to make decisions. This would be the least technical of the three divisions; many jobs are hybrids between the second and third divisions at this point, since data science is in its infancy. But in the future, I strongly suspect that there will be a more clean division between these two jobs, with people win the second job needing a technical, computer science or statistics based education, and this third job needing only a general education.



In general, all three could describe themselves as 'data scientist', but only the first two could reasonably describe themselves as 'machine learning engineer.'



Conclusion



For the time being, you will have to find out yourself what each job entails. My current job hired me on as an 'analyst,' to do some machine learning stuff. But as we got to work, it became apparent that the company's databasing was inadequate, and now probably 90% of my time is spent working on the databases. My machine learning exposure is now just quickly running stuff through whatever scikit-learn package seems most appropriate, and shooting csv files to the third division analysts to make powerpoint presentations for the customer.



The field is in flux. A lot of organizations are trying to add data science decision making to their processes, but without knowing clearly what that means. Its not their fault, its pretty hard to predict the future, and the ramifications of a new technology are never very clear. Until the field is more established, many jobs themselves will be as nebulous as the terms used to describe them.






share|improve this answer











$endgroup$




















    9












    $begingroup$

    [Completely a personal opinion]



    When the term 'Data Scientist' overtook 'Statistician', it is more towards sounding cool, rather than any major difference. Similarly, the term 'Deep Learning'. It is just neural networks (which is another Machine Learning algorithm) with a couple of more layers. No one can explain when a particular neural net can be called DL, rather than ML, cause the definition itself is fuzzy. So, is the term 'Data Scientist'.



    However, as companies are adopting the DevOps mindset to data science, the term ML Engineer evolved.



    What is the DevOps mindset to data science?



    This is where you build the model, deploy it and also expected to maintain it in production. This helps in avoiding a lot of friction in software teams.



    [PS: DevOps is a way of doing software, more like a philosophy. So, using it as a designation, again confuses me].



    So, ML engineers are supposed to know the nuances of systems engineering, ML, and stats (obviously).



    A vague generalization would be Data Engineer + Data Scientist = ML Engineer.



    Having said that, the designations in this space are becoming vague day by day, and the term 'Statistician' is becoming more and more relevant (the irony!).






    share|improve this answer











    $endgroup$








    • 2




      $begingroup$
      Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
      $endgroup$
      – Stephan Kolassa
      Feb 20 '18 at 12:38










    • $begingroup$
      @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
      $endgroup$
      – Dawny33
      Feb 20 '18 at 13:49






    • 1




      $begingroup$
      (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
      $endgroup$
      – usεr11852
      Feb 20 '18 at 22:28



















    7












    $begingroup$

    It may vary from company to company, but Data Scientist as a designation has been around for some time now and is usually meant for extracting knowledge and insights from data.



    I have seen data scientists doing



    • Writing Image processing and image recognition algorithms,

    • Design and implement decision trees for a business use case,

    • Or simply design and implement some reports or write ETLs for data transformations.

    Data science, however, is a super-domain of machine learning




    It employs techniques and theories drawn from many fields within the
    broad areas of mathematics, statistics, information science, and
    computer science, in particular from the subdomains of machine
    learning, classification, cluster analysis, uncertainty
    quantification, computational science, data mining, databases, and
    visualization
    .




    Machine learning engineer seems to be a designation where your employer has already narrowed down to the



    • Approach,

    • Tools,

    • and a rough model (of what to deliver)

    to extract knowledge or insights from data using machine learning and your work will be to design and implement machine learning algorithms to deliver the same.






    share|improve this answer









    $endgroup$




















      4












      $begingroup$

      Machine Learning Engineers and engineering focused Data Scientist are the same, but not all Data Scientist are engineering focused. About 5 years ago almost all Data Scientist were engineering focused, e.g, they had to write production code. Now, however, there are many Data Scientist roles that are for most part: playing in Jupyter notebook, understanding data, making pretty graphs, explaining to clients, managers, analysts... They don't do any engineering. And I believe that term Machine Learning Engineers came up to underline that this an engineering position.






      share|improve this answer









      $endgroup$




















        2












        $begingroup$

        TL;DR: It depends on who is asking.



        The answer to this question depends largely on the expectations, knowledge, and experience of whomever is asking. An analogous question with just as fuzzy of an answer is:




        What is the difference between a software developer, a software
        engineer, and a computer scientist?




        To some people, particularly people who study or teach computer science and software engineering, there is a large and defined difference between these fields. But to the average HR worker, technical recruiter, or manager, these are all just "Computer People".



        I love this quote by Vincent Granville, emphasis mine:




        Earlier in my career (circa 1990) I worked on image remote sensing
        technology, among other things to identify patterns (or shapes or
        features, for instance lakes) in satellite images and to perform image
        segmentation: at that time my research was labeled as computational
        statistics, but the people doing the exact same thing in the computer
        science department next door in my home university, called their
        research artificial intelligence. Today, it would be called data
        science
        or artificial intelligence, the sub-domains being signal
        processing, computer vision or IoT.







        share|improve this answer











        $endgroup$




















          0












          $begingroup$

          I don't disagree with any of the answers given. However, I do think that there is a role of Data Scientist that is being glossed over in virtually all of the answers here. Most of these answers say something to the effect of, "Well, an engineer just writes and deploys the model . . . ". Hold on a sec - there's A LOT of work in those two steps!



          My core definition of a Data Scientist is someone that applies the scientific method to working with data. So I'm constantly thinking of hypostheses, designing tests, collecting my data and executing those tests, checking my cross validation results, trying new approaches, transforming my data, etc, etc. That's essentially what goes into "just writes and deploys the model" in a professional setting.



          So, for your answer, I think "the devil is in the details" because you can't just gloss over some of these steps/terms. Also, if you are job hunting, you should be careful because "data engineer" and "data scientist" can have woefully different pay scales - you do not want to be a data scientist on a data engineer salary!



          I always put myself out there as a data scientist, I tell companies that I work on predictive models (not just analytical) and that I'm not an Excel jockey - I write in programming languages (R, Python, etc). If you can find a position that let's you do both of those, then you're on your way to being a data scientist.






          share|improve this answer









          $endgroup$




















            0












            $begingroup$

            I think Machine learning engineer and Data Scientist are very much different . Many people get confused because machine learning is included in Data Science. But it is not that similar as the knowledge of machine is put together in Data Science where as The knowledge of Data Scientist comprises of Machine Learning , python, R , Statistics and basic mathematical skills. A machine learning engineer have proper knowledge of Machine learning only but Data Scientist will have the proper knowledge of all the above mentioned topic.






            share|improve this answer









            $endgroup$












              Your Answer





              StackExchange.ifUsing("editor", function ()
              return StackExchange.using("mathjaxEditing", function ()
              StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
              StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
              );
              );
              , "mathjax-editing");

              StackExchange.ready(function()
              var channelOptions =
              tags: "".split(" "),
              id: "557"
              ;
              initTagRenderer("".split(" "), "".split(" "), channelOptions);

              StackExchange.using("externalEditor", function()
              // Have to fire editor after snippets, if snippets enabled
              if (StackExchange.settings.snippets.snippetsEnabled)
              StackExchange.using("snippets", function()
              createEditor();
              );

              else
              createEditor();

              );

              function createEditor()
              StackExchange.prepareEditor(
              heartbeatType: 'answer',
              autoActivateHeartbeat: false,
              convertImagesToLinks: false,
              noModals: true,
              showLowRepImageUploadWarning: true,
              reputationToPostImages: null,
              bindNavPrevention: true,
              postfix: "",
              imageUploader:
              brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
              contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
              allowUrls: true
              ,
              onDemand: true,
              discardSelector: ".discard-answer"
              ,immediatelyShowMarkdownHelp:true
              );



              );













              draft saved

              draft discarded


















              StackExchange.ready(
              function ()
              StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f28006%2fdata-scientist-vs-machine-learning-engineer%23new-answer', 'question_page');

              );

              Post as a guest















              Required, but never shown

























              8 Answers
              8






              active

              oldest

              votes








              8 Answers
              8






              active

              oldest

              votes









              active

              oldest

              votes






              active

              oldest

              votes









              52












              $begingroup$

              Good question. Actually there is a lot of confusion on this subject, mainly because both are quite new jobs. But if we focus on the semantics, the real meaning of the jobs become clear.



              Beforehand is better to compare apples with apples, talking about a single subject, the Data. Machine Learning and it's sons (Deep Learning, etc.) is just one sub-subject of the Data World, together with the statistic theories, the data acquisition (DAQ), the processing (which can be non-machine learning driven), the interpretation of the results, etc.



              So, for my explanation, I will broad the Machine Learning Engineer role to the one of Data Engineer.



              Science is about experiment, trials and fails, theory building, phenomenological understanding.
              Engineering is about work on what science already knows, perfecting it and carry to the "real world".



              Think about a proxy: what is the difference between a nuclear scientist and a nuclear engineer?



              The nuclear scientist is the one which know the science behind the atom, the interaction between them, the one which wrote the recipe which allow to get energy from the atoms.



              The nuclear engineer is the guy charged to take the recipe of the scientist, and carry it to the real world. So it's knowledge about the atomic physics is quite limited, but he also know about materials, buildings, economics, and whatever else useful to build a proper nuclear plant.



              Coming back to the Data world, here another example: the guys which developed Convolutional Neural Networks (Yann LeCun) is a Data Scientist, the guy which deploy the model to recognize faces in pictures is a Machine Learning Engineer. The guy responsible of the whole process, from the data acquisition to the registration of the .JPG image, is a Data Engineer.



              So, basically, 90% of the Data Scientist today are actually Data Engineers or Machine Learning Engineers, and 90% of the positions opened as Data Scientist actually need Engineers. An easy check: in the interview, you will be asked about how many ML models you deployed in production, not on how many papers on new methods you published.



              Instead, when you see announces about "Machine Learning Engineer", that means that the recruiters are well aware of the difference, and they really need someone able to put some model in production.






              share|improve this answer









              $endgroup$












              • $begingroup$
                I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
                $endgroup$
                – Tony
                Feb 20 '18 at 14:52










              • $begingroup$
                Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
                $endgroup$
                – Didier A.
                Jan 26 at 7:16










              • $begingroup$
                YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
                $endgroup$
                – Vincenzo Lavorini
                Jan 27 at 8:27










              • $begingroup$
                Could you kindly answer this question regarding Data Engineer career guidance.
                $endgroup$
                – stom
                Feb 1 at 6:44










              • $begingroup$
                How is science about "phenomenological understanding"?
                $endgroup$
                – ubadub
                Feb 25 at 20:47















              52












              $begingroup$

              Good question. Actually there is a lot of confusion on this subject, mainly because both are quite new jobs. But if we focus on the semantics, the real meaning of the jobs become clear.



              Beforehand is better to compare apples with apples, talking about a single subject, the Data. Machine Learning and it's sons (Deep Learning, etc.) is just one sub-subject of the Data World, together with the statistic theories, the data acquisition (DAQ), the processing (which can be non-machine learning driven), the interpretation of the results, etc.



              So, for my explanation, I will broad the Machine Learning Engineer role to the one of Data Engineer.



              Science is about experiment, trials and fails, theory building, phenomenological understanding.
              Engineering is about work on what science already knows, perfecting it and carry to the "real world".



              Think about a proxy: what is the difference between a nuclear scientist and a nuclear engineer?



              The nuclear scientist is the one which know the science behind the atom, the interaction between them, the one which wrote the recipe which allow to get energy from the atoms.



              The nuclear engineer is the guy charged to take the recipe of the scientist, and carry it to the real world. So it's knowledge about the atomic physics is quite limited, but he also know about materials, buildings, economics, and whatever else useful to build a proper nuclear plant.



              Coming back to the Data world, here another example: the guys which developed Convolutional Neural Networks (Yann LeCun) is a Data Scientist, the guy which deploy the model to recognize faces in pictures is a Machine Learning Engineer. The guy responsible of the whole process, from the data acquisition to the registration of the .JPG image, is a Data Engineer.



              So, basically, 90% of the Data Scientist today are actually Data Engineers or Machine Learning Engineers, and 90% of the positions opened as Data Scientist actually need Engineers. An easy check: in the interview, you will be asked about how many ML models you deployed in production, not on how many papers on new methods you published.



              Instead, when you see announces about "Machine Learning Engineer", that means that the recruiters are well aware of the difference, and they really need someone able to put some model in production.






              share|improve this answer









              $endgroup$












              • $begingroup$
                I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
                $endgroup$
                – Tony
                Feb 20 '18 at 14:52










              • $begingroup$
                Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
                $endgroup$
                – Didier A.
                Jan 26 at 7:16










              • $begingroup$
                YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
                $endgroup$
                – Vincenzo Lavorini
                Jan 27 at 8:27










              • $begingroup$
                Could you kindly answer this question regarding Data Engineer career guidance.
                $endgroup$
                – stom
                Feb 1 at 6:44










              • $begingroup$
                How is science about "phenomenological understanding"?
                $endgroup$
                – ubadub
                Feb 25 at 20:47













              52












              52








              52





              $begingroup$

              Good question. Actually there is a lot of confusion on this subject, mainly because both are quite new jobs. But if we focus on the semantics, the real meaning of the jobs become clear.



              Beforehand is better to compare apples with apples, talking about a single subject, the Data. Machine Learning and it's sons (Deep Learning, etc.) is just one sub-subject of the Data World, together with the statistic theories, the data acquisition (DAQ), the processing (which can be non-machine learning driven), the interpretation of the results, etc.



              So, for my explanation, I will broad the Machine Learning Engineer role to the one of Data Engineer.



              Science is about experiment, trials and fails, theory building, phenomenological understanding.
              Engineering is about work on what science already knows, perfecting it and carry to the "real world".



              Think about a proxy: what is the difference between a nuclear scientist and a nuclear engineer?



              The nuclear scientist is the one which know the science behind the atom, the interaction between them, the one which wrote the recipe which allow to get energy from the atoms.



              The nuclear engineer is the guy charged to take the recipe of the scientist, and carry it to the real world. So it's knowledge about the atomic physics is quite limited, but he also know about materials, buildings, economics, and whatever else useful to build a proper nuclear plant.



              Coming back to the Data world, here another example: the guys which developed Convolutional Neural Networks (Yann LeCun) is a Data Scientist, the guy which deploy the model to recognize faces in pictures is a Machine Learning Engineer. The guy responsible of the whole process, from the data acquisition to the registration of the .JPG image, is a Data Engineer.



              So, basically, 90% of the Data Scientist today are actually Data Engineers or Machine Learning Engineers, and 90% of the positions opened as Data Scientist actually need Engineers. An easy check: in the interview, you will be asked about how many ML models you deployed in production, not on how many papers on new methods you published.



              Instead, when you see announces about "Machine Learning Engineer", that means that the recruiters are well aware of the difference, and they really need someone able to put some model in production.






              share|improve this answer









              $endgroup$



              Good question. Actually there is a lot of confusion on this subject, mainly because both are quite new jobs. But if we focus on the semantics, the real meaning of the jobs become clear.



              Beforehand is better to compare apples with apples, talking about a single subject, the Data. Machine Learning and it's sons (Deep Learning, etc.) is just one sub-subject of the Data World, together with the statistic theories, the data acquisition (DAQ), the processing (which can be non-machine learning driven), the interpretation of the results, etc.



              So, for my explanation, I will broad the Machine Learning Engineer role to the one of Data Engineer.



              Science is about experiment, trials and fails, theory building, phenomenological understanding.
              Engineering is about work on what science already knows, perfecting it and carry to the "real world".



              Think about a proxy: what is the difference between a nuclear scientist and a nuclear engineer?



              The nuclear scientist is the one which know the science behind the atom, the interaction between them, the one which wrote the recipe which allow to get energy from the atoms.



              The nuclear engineer is the guy charged to take the recipe of the scientist, and carry it to the real world. So it's knowledge about the atomic physics is quite limited, but he also know about materials, buildings, economics, and whatever else useful to build a proper nuclear plant.



              Coming back to the Data world, here another example: the guys which developed Convolutional Neural Networks (Yann LeCun) is a Data Scientist, the guy which deploy the model to recognize faces in pictures is a Machine Learning Engineer. The guy responsible of the whole process, from the data acquisition to the registration of the .JPG image, is a Data Engineer.



              So, basically, 90% of the Data Scientist today are actually Data Engineers or Machine Learning Engineers, and 90% of the positions opened as Data Scientist actually need Engineers. An easy check: in the interview, you will be asked about how many ML models you deployed in production, not on how many papers on new methods you published.



              Instead, when you see announces about "Machine Learning Engineer", that means that the recruiters are well aware of the difference, and they really need someone able to put some model in production.







              share|improve this answer












              share|improve this answer



              share|improve this answer










              answered Feb 20 '18 at 8:57









              Vincenzo LavoriniVincenzo Lavorini

              1,314416




              1,314416











              • $begingroup$
                I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
                $endgroup$
                – Tony
                Feb 20 '18 at 14:52










              • $begingroup$
                Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
                $endgroup$
                – Didier A.
                Jan 26 at 7:16










              • $begingroup$
                YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
                $endgroup$
                – Vincenzo Lavorini
                Jan 27 at 8:27










              • $begingroup$
                Could you kindly answer this question regarding Data Engineer career guidance.
                $endgroup$
                – stom
                Feb 1 at 6:44










              • $begingroup$
                How is science about "phenomenological understanding"?
                $endgroup$
                – ubadub
                Feb 25 at 20:47
















              • $begingroup$
                I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
                $endgroup$
                – Tony
                Feb 20 '18 at 14:52










              • $begingroup$
                Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
                $endgroup$
                – Didier A.
                Jan 26 at 7:16










              • $begingroup$
                YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
                $endgroup$
                – Vincenzo Lavorini
                Jan 27 at 8:27










              • $begingroup$
                Could you kindly answer this question regarding Data Engineer career guidance.
                $endgroup$
                – stom
                Feb 1 at 6:44










              • $begingroup$
                How is science about "phenomenological understanding"?
                $endgroup$
                – ubadub
                Feb 25 at 20:47















              $begingroup$
              I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
              $endgroup$
              – Tony
              Feb 20 '18 at 14:52




              $begingroup$
              I've never thought of the nuclear scientists vs. engineer I think this is a thorough answer. It's appropriate to my experience, when i'm doing analysis it's like that white lab coat (jupyter and pretty graphs). When i'm "getting my hands dirty" with engineering production work (etl & webapp containers), i'm constantly finding weird edge cases, bugs, and bad code smell.
              $endgroup$
              – Tony
              Feb 20 '18 at 14:52












              $begingroup$
              Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
              $endgroup$
              – Didier A.
              Jan 26 at 7:16




              $begingroup$
              Isn't Yann LeCun a Computer Scientist? And a Data Scientist would be someone who uses pre-made computer algorithms and techniques (invented by Computer Scientists like Yann LeCun) to perform scientific analysis of data ? The same way that other scientists leverage computers in their work? So acquiring data, cleaning it, combining different analysis techniques (plotting, pattern matching, ML models, etc.) together in order to learn hidden truths within the data?
              $endgroup$
              – Didier A.
              Jan 26 at 7:16












              $begingroup$
              YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
              $endgroup$
              – Vincenzo Lavorini
              Jan 27 at 8:27




              $begingroup$
              YLC, is a Computer Scientist indeed, but he is specialized in Data. CS has become a too broad field, from which all those new definitions (like DS) camed out. And so using CS become not really discriminant. Like the appellative "Physicist" a couple of hundreds of years ago: today that word actually do not define someone's job, unless you specify it better (ex. Particle P., Solid State P., etc.). But anyway, a Scientist (CS, DS, any -S) is not someone who limit himself on use other's discoveries. Instead, his job is to understand, and by this mean, make discoveries.
              $endgroup$
              – Vincenzo Lavorini
              Jan 27 at 8:27












              $begingroup$
              Could you kindly answer this question regarding Data Engineer career guidance.
              $endgroup$
              – stom
              Feb 1 at 6:44




              $begingroup$
              Could you kindly answer this question regarding Data Engineer career guidance.
              $endgroup$
              – stom
              Feb 1 at 6:44












              $begingroup$
              How is science about "phenomenological understanding"?
              $endgroup$
              – ubadub
              Feb 25 at 20:47




              $begingroup$
              How is science about "phenomenological understanding"?
              $endgroup$
              – ubadub
              Feb 25 at 20:47











              10












              $begingroup$

              The terms are nebulous because they are new



              Being in the middle of a job search in the 'data science' field, I think that there are two things going on here. First, the jobs are new, and there is no set definitions of various terms, so no commonly agreed upon matching of terms with job descriptions. Compare this to 'web developer' or 'back-end developer.' These are two similar jobs that have reasonably well agreed upon and distinct descriptions.



              Second, a lot of people doing the job posting and initial interviews don't know that well what they are hiring for. This is particularly true in the case of small to medium sized-companies that hire recruiters to find applicants for them. It is these intermediaries that are posting the job descriptions on CareerBuilder or whatever forum. This isn't to say that many of them don't know their stuff, many of them are quite knowledgeable about the companies they represent and the requirements of the workplace. But, without well defined terms to describe different specific jobs, nebulous job titles are often the result.



              There are three general divisions of the field



              In my experience, there are three general divisions of the 'job space' of data science.



              The first is the development of the mathematical and computational techniques that make data science possible. This covers things like statistical research into new machine learning methods, the implementation of these methods, and the building of computational infrastructure to employ these methods in the real world. This is the division farthest separated from the customer, and the smallest division. Much of this work is done by either academics or researchers at the big companies (Google, Facebook, etc). This is for things like developing Google's TensorFlow, IBM's SPSS neural nets, or whatever the next big graph database is going to be.



              The second division is using the underlying tools to create application specific packages to perform whatever data analysis needs to be done. People are hired to use Python or R or whatever to build analysis capability on some set of data. A lot of this work, in my experience, involves doing the 'data laundry,' turning raw data in whatever form into something usable. Another big chunk of this work is databasing; figuring out how to store the data in a way that it can be accessed in whatever timeline you need it in. This job isn't so much taking tools, but using existing database, statistics, and graphical analysis libraries to produce some results.



              The third division is producing analysis from the newly organized and accessible data. This is the most customer facing side, depending on your organization. You have to produce analysis that business leaders can use to make decisions. This would be the least technical of the three divisions; many jobs are hybrids between the second and third divisions at this point, since data science is in its infancy. But in the future, I strongly suspect that there will be a more clean division between these two jobs, with people win the second job needing a technical, computer science or statistics based education, and this third job needing only a general education.



              In general, all three could describe themselves as 'data scientist', but only the first two could reasonably describe themselves as 'machine learning engineer.'



              Conclusion



              For the time being, you will have to find out yourself what each job entails. My current job hired me on as an 'analyst,' to do some machine learning stuff. But as we got to work, it became apparent that the company's databasing was inadequate, and now probably 90% of my time is spent working on the databases. My machine learning exposure is now just quickly running stuff through whatever scikit-learn package seems most appropriate, and shooting csv files to the third division analysts to make powerpoint presentations for the customer.



              The field is in flux. A lot of organizations are trying to add data science decision making to their processes, but without knowing clearly what that means. Its not their fault, its pretty hard to predict the future, and the ramifications of a new technology are never very clear. Until the field is more established, many jobs themselves will be as nebulous as the terms used to describe them.






              share|improve this answer











              $endgroup$

















                10












                $begingroup$

                The terms are nebulous because they are new



                Being in the middle of a job search in the 'data science' field, I think that there are two things going on here. First, the jobs are new, and there is no set definitions of various terms, so no commonly agreed upon matching of terms with job descriptions. Compare this to 'web developer' or 'back-end developer.' These are two similar jobs that have reasonably well agreed upon and distinct descriptions.



                Second, a lot of people doing the job posting and initial interviews don't know that well what they are hiring for. This is particularly true in the case of small to medium sized-companies that hire recruiters to find applicants for them. It is these intermediaries that are posting the job descriptions on CareerBuilder or whatever forum. This isn't to say that many of them don't know their stuff, many of them are quite knowledgeable about the companies they represent and the requirements of the workplace. But, without well defined terms to describe different specific jobs, nebulous job titles are often the result.



                There are three general divisions of the field



                In my experience, there are three general divisions of the 'job space' of data science.



                The first is the development of the mathematical and computational techniques that make data science possible. This covers things like statistical research into new machine learning methods, the implementation of these methods, and the building of computational infrastructure to employ these methods in the real world. This is the division farthest separated from the customer, and the smallest division. Much of this work is done by either academics or researchers at the big companies (Google, Facebook, etc). This is for things like developing Google's TensorFlow, IBM's SPSS neural nets, or whatever the next big graph database is going to be.



                The second division is using the underlying tools to create application specific packages to perform whatever data analysis needs to be done. People are hired to use Python or R or whatever to build analysis capability on some set of data. A lot of this work, in my experience, involves doing the 'data laundry,' turning raw data in whatever form into something usable. Another big chunk of this work is databasing; figuring out how to store the data in a way that it can be accessed in whatever timeline you need it in. This job isn't so much taking tools, but using existing database, statistics, and graphical analysis libraries to produce some results.



                The third division is producing analysis from the newly organized and accessible data. This is the most customer facing side, depending on your organization. You have to produce analysis that business leaders can use to make decisions. This would be the least technical of the three divisions; many jobs are hybrids between the second and third divisions at this point, since data science is in its infancy. But in the future, I strongly suspect that there will be a more clean division between these two jobs, with people win the second job needing a technical, computer science or statistics based education, and this third job needing only a general education.



                In general, all three could describe themselves as 'data scientist', but only the first two could reasonably describe themselves as 'machine learning engineer.'



                Conclusion



                For the time being, you will have to find out yourself what each job entails. My current job hired me on as an 'analyst,' to do some machine learning stuff. But as we got to work, it became apparent that the company's databasing was inadequate, and now probably 90% of my time is spent working on the databases. My machine learning exposure is now just quickly running stuff through whatever scikit-learn package seems most appropriate, and shooting csv files to the third division analysts to make powerpoint presentations for the customer.



                The field is in flux. A lot of organizations are trying to add data science decision making to their processes, but without knowing clearly what that means. Its not their fault, its pretty hard to predict the future, and the ramifications of a new technology are never very clear. Until the field is more established, many jobs themselves will be as nebulous as the terms used to describe them.






                share|improve this answer











                $endgroup$















                  10












                  10








                  10





                  $begingroup$

                  The terms are nebulous because they are new



                  Being in the middle of a job search in the 'data science' field, I think that there are two things going on here. First, the jobs are new, and there is no set definitions of various terms, so no commonly agreed upon matching of terms with job descriptions. Compare this to 'web developer' or 'back-end developer.' These are two similar jobs that have reasonably well agreed upon and distinct descriptions.



                  Second, a lot of people doing the job posting and initial interviews don't know that well what they are hiring for. This is particularly true in the case of small to medium sized-companies that hire recruiters to find applicants for them. It is these intermediaries that are posting the job descriptions on CareerBuilder or whatever forum. This isn't to say that many of them don't know their stuff, many of them are quite knowledgeable about the companies they represent and the requirements of the workplace. But, without well defined terms to describe different specific jobs, nebulous job titles are often the result.



                  There are three general divisions of the field



                  In my experience, there are three general divisions of the 'job space' of data science.



                  The first is the development of the mathematical and computational techniques that make data science possible. This covers things like statistical research into new machine learning methods, the implementation of these methods, and the building of computational infrastructure to employ these methods in the real world. This is the division farthest separated from the customer, and the smallest division. Much of this work is done by either academics or researchers at the big companies (Google, Facebook, etc). This is for things like developing Google's TensorFlow, IBM's SPSS neural nets, or whatever the next big graph database is going to be.



                  The second division is using the underlying tools to create application specific packages to perform whatever data analysis needs to be done. People are hired to use Python or R or whatever to build analysis capability on some set of data. A lot of this work, in my experience, involves doing the 'data laundry,' turning raw data in whatever form into something usable. Another big chunk of this work is databasing; figuring out how to store the data in a way that it can be accessed in whatever timeline you need it in. This job isn't so much taking tools, but using existing database, statistics, and graphical analysis libraries to produce some results.



                  The third division is producing analysis from the newly organized and accessible data. This is the most customer facing side, depending on your organization. You have to produce analysis that business leaders can use to make decisions. This would be the least technical of the three divisions; many jobs are hybrids between the second and third divisions at this point, since data science is in its infancy. But in the future, I strongly suspect that there will be a more clean division between these two jobs, with people win the second job needing a technical, computer science or statistics based education, and this third job needing only a general education.



                  In general, all three could describe themselves as 'data scientist', but only the first two could reasonably describe themselves as 'machine learning engineer.'



                  Conclusion



                  For the time being, you will have to find out yourself what each job entails. My current job hired me on as an 'analyst,' to do some machine learning stuff. But as we got to work, it became apparent that the company's databasing was inadequate, and now probably 90% of my time is spent working on the databases. My machine learning exposure is now just quickly running stuff through whatever scikit-learn package seems most appropriate, and shooting csv files to the third division analysts to make powerpoint presentations for the customer.



                  The field is in flux. A lot of organizations are trying to add data science decision making to their processes, but without knowing clearly what that means. Its not their fault, its pretty hard to predict the future, and the ramifications of a new technology are never very clear. Until the field is more established, many jobs themselves will be as nebulous as the terms used to describe them.






                  share|improve this answer











                  $endgroup$



                  The terms are nebulous because they are new



                  Being in the middle of a job search in the 'data science' field, I think that there are two things going on here. First, the jobs are new, and there is no set definitions of various terms, so no commonly agreed upon matching of terms with job descriptions. Compare this to 'web developer' or 'back-end developer.' These are two similar jobs that have reasonably well agreed upon and distinct descriptions.



                  Second, a lot of people doing the job posting and initial interviews don't know that well what they are hiring for. This is particularly true in the case of small to medium sized-companies that hire recruiters to find applicants for them. It is these intermediaries that are posting the job descriptions on CareerBuilder or whatever forum. This isn't to say that many of them don't know their stuff, many of them are quite knowledgeable about the companies they represent and the requirements of the workplace. But, without well defined terms to describe different specific jobs, nebulous job titles are often the result.



                  There are three general divisions of the field



                  In my experience, there are three general divisions of the 'job space' of data science.



                  The first is the development of the mathematical and computational techniques that make data science possible. This covers things like statistical research into new machine learning methods, the implementation of these methods, and the building of computational infrastructure to employ these methods in the real world. This is the division farthest separated from the customer, and the smallest division. Much of this work is done by either academics or researchers at the big companies (Google, Facebook, etc). This is for things like developing Google's TensorFlow, IBM's SPSS neural nets, or whatever the next big graph database is going to be.



                  The second division is using the underlying tools to create application specific packages to perform whatever data analysis needs to be done. People are hired to use Python or R or whatever to build analysis capability on some set of data. A lot of this work, in my experience, involves doing the 'data laundry,' turning raw data in whatever form into something usable. Another big chunk of this work is databasing; figuring out how to store the data in a way that it can be accessed in whatever timeline you need it in. This job isn't so much taking tools, but using existing database, statistics, and graphical analysis libraries to produce some results.



                  The third division is producing analysis from the newly organized and accessible data. This is the most customer facing side, depending on your organization. You have to produce analysis that business leaders can use to make decisions. This would be the least technical of the three divisions; many jobs are hybrids between the second and third divisions at this point, since data science is in its infancy. But in the future, I strongly suspect that there will be a more clean division between these two jobs, with people win the second job needing a technical, computer science or statistics based education, and this third job needing only a general education.



                  In general, all three could describe themselves as 'data scientist', but only the first two could reasonably describe themselves as 'machine learning engineer.'



                  Conclusion



                  For the time being, you will have to find out yourself what each job entails. My current job hired me on as an 'analyst,' to do some machine learning stuff. But as we got to work, it became apparent that the company's databasing was inadequate, and now probably 90% of my time is spent working on the databases. My machine learning exposure is now just quickly running stuff through whatever scikit-learn package seems most appropriate, and shooting csv files to the third division analysts to make powerpoint presentations for the customer.



                  The field is in flux. A lot of organizations are trying to add data science decision making to their processes, but without knowing clearly what that means. Its not their fault, its pretty hard to predict the future, and the ramifications of a new technology are never very clear. Until the field is more established, many jobs themselves will be as nebulous as the terms used to describe them.







                  share|improve this answer














                  share|improve this answer



                  share|improve this answer








                  edited Feb 20 '18 at 15:19

























                  answered Feb 20 '18 at 15:14









                  kingledionkingledion

                  306110




                  306110





















                      9












                      $begingroup$

                      [Completely a personal opinion]



                      When the term 'Data Scientist' overtook 'Statistician', it is more towards sounding cool, rather than any major difference. Similarly, the term 'Deep Learning'. It is just neural networks (which is another Machine Learning algorithm) with a couple of more layers. No one can explain when a particular neural net can be called DL, rather than ML, cause the definition itself is fuzzy. So, is the term 'Data Scientist'.



                      However, as companies are adopting the DevOps mindset to data science, the term ML Engineer evolved.



                      What is the DevOps mindset to data science?



                      This is where you build the model, deploy it and also expected to maintain it in production. This helps in avoiding a lot of friction in software teams.



                      [PS: DevOps is a way of doing software, more like a philosophy. So, using it as a designation, again confuses me].



                      So, ML engineers are supposed to know the nuances of systems engineering, ML, and stats (obviously).



                      A vague generalization would be Data Engineer + Data Scientist = ML Engineer.



                      Having said that, the designations in this space are becoming vague day by day, and the term 'Statistician' is becoming more and more relevant (the irony!).






                      share|improve this answer











                      $endgroup$








                      • 2




                        $begingroup$
                        Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
                        $endgroup$
                        – Stephan Kolassa
                        Feb 20 '18 at 12:38










                      • $begingroup$
                        @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
                        $endgroup$
                        – Dawny33
                        Feb 20 '18 at 13:49






                      • 1




                        $begingroup$
                        (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
                        $endgroup$
                        – usεr11852
                        Feb 20 '18 at 22:28
















                      9












                      $begingroup$

                      [Completely a personal opinion]



                      When the term 'Data Scientist' overtook 'Statistician', it is more towards sounding cool, rather than any major difference. Similarly, the term 'Deep Learning'. It is just neural networks (which is another Machine Learning algorithm) with a couple of more layers. No one can explain when a particular neural net can be called DL, rather than ML, cause the definition itself is fuzzy. So, is the term 'Data Scientist'.



                      However, as companies are adopting the DevOps mindset to data science, the term ML Engineer evolved.



                      What is the DevOps mindset to data science?



                      This is where you build the model, deploy it and also expected to maintain it in production. This helps in avoiding a lot of friction in software teams.



                      [PS: DevOps is a way of doing software, more like a philosophy. So, using it as a designation, again confuses me].



                      So, ML engineers are supposed to know the nuances of systems engineering, ML, and stats (obviously).



                      A vague generalization would be Data Engineer + Data Scientist = ML Engineer.



                      Having said that, the designations in this space are becoming vague day by day, and the term 'Statistician' is becoming more and more relevant (the irony!).






                      share|improve this answer











                      $endgroup$








                      • 2




                        $begingroup$
                        Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
                        $endgroup$
                        – Stephan Kolassa
                        Feb 20 '18 at 12:38










                      • $begingroup$
                        @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
                        $endgroup$
                        – Dawny33
                        Feb 20 '18 at 13:49






                      • 1




                        $begingroup$
                        (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
                        $endgroup$
                        – usεr11852
                        Feb 20 '18 at 22:28














                      9












                      9








                      9





                      $begingroup$

                      [Completely a personal opinion]



                      When the term 'Data Scientist' overtook 'Statistician', it is more towards sounding cool, rather than any major difference. Similarly, the term 'Deep Learning'. It is just neural networks (which is another Machine Learning algorithm) with a couple of more layers. No one can explain when a particular neural net can be called DL, rather than ML, cause the definition itself is fuzzy. So, is the term 'Data Scientist'.



                      However, as companies are adopting the DevOps mindset to data science, the term ML Engineer evolved.



                      What is the DevOps mindset to data science?



                      This is where you build the model, deploy it and also expected to maintain it in production. This helps in avoiding a lot of friction in software teams.



                      [PS: DevOps is a way of doing software, more like a philosophy. So, using it as a designation, again confuses me].



                      So, ML engineers are supposed to know the nuances of systems engineering, ML, and stats (obviously).



                      A vague generalization would be Data Engineer + Data Scientist = ML Engineer.



                      Having said that, the designations in this space are becoming vague day by day, and the term 'Statistician' is becoming more and more relevant (the irony!).






                      share|improve this answer











                      $endgroup$



                      [Completely a personal opinion]



                      When the term 'Data Scientist' overtook 'Statistician', it is more towards sounding cool, rather than any major difference. Similarly, the term 'Deep Learning'. It is just neural networks (which is another Machine Learning algorithm) with a couple of more layers. No one can explain when a particular neural net can be called DL, rather than ML, cause the definition itself is fuzzy. So, is the term 'Data Scientist'.



                      However, as companies are adopting the DevOps mindset to data science, the term ML Engineer evolved.



                      What is the DevOps mindset to data science?



                      This is where you build the model, deploy it and also expected to maintain it in production. This helps in avoiding a lot of friction in software teams.



                      [PS: DevOps is a way of doing software, more like a philosophy. So, using it as a designation, again confuses me].



                      So, ML engineers are supposed to know the nuances of systems engineering, ML, and stats (obviously).



                      A vague generalization would be Data Engineer + Data Scientist = ML Engineer.



                      Having said that, the designations in this space are becoming vague day by day, and the term 'Statistician' is becoming more and more relevant (the irony!).







                      share|improve this answer














                      share|improve this answer



                      share|improve this answer








                      edited Feb 20 '18 at 13:50

























                      answered Feb 20 '18 at 6:33









                      Dawny33Dawny33

                      5,50183188




                      5,50183188







                      • 2




                        $begingroup$
                        Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
                        $endgroup$
                        – Stephan Kolassa
                        Feb 20 '18 at 12:38










                      • $begingroup$
                        @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
                        $endgroup$
                        – Dawny33
                        Feb 20 '18 at 13:49






                      • 1




                        $begingroup$
                        (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
                        $endgroup$
                        – usεr11852
                        Feb 20 '18 at 22:28













                      • 2




                        $begingroup$
                        Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
                        $endgroup$
                        – Stephan Kolassa
                        Feb 20 '18 at 12:38










                      • $begingroup$
                        @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
                        $endgroup$
                        – Dawny33
                        Feb 20 '18 at 13:49






                      • 1




                        $begingroup$
                        (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
                        $endgroup$
                        – usεr11852
                        Feb 20 '18 at 22:28








                      2




                      2




                      $begingroup$
                      Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
                      $endgroup$
                      – Stephan Kolassa
                      Feb 20 '18 at 12:38




                      $begingroup$
                      Machine Learning is much more than just neural nets (just as an example, consider all kinds of tree-based classifiers), so don't see how "Deep Learning is just Machine Learning with a couple of more layers".
                      $endgroup$
                      – Stephan Kolassa
                      Feb 20 '18 at 12:38












                      $begingroup$
                      @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
                      $endgroup$
                      – Dawny33
                      Feb 20 '18 at 13:49




                      $begingroup$
                      @StephanKolassa Yeah. Agree. Shouldn't have generalized it too much :) Thanks for pointing it out.
                      $endgroup$
                      – Dawny33
                      Feb 20 '18 at 13:49




                      1




                      1




                      $begingroup$
                      (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
                      $endgroup$
                      – usεr11852
                      Feb 20 '18 at 22:28





                      $begingroup$
                      (+1) but I don't think "statistician" becoming more relevant is an irony, just... an expected transition? Where are the "operational researchers" these days? ;)
                      $endgroup$
                      – usεr11852
                      Feb 20 '18 at 22:28












                      7












                      $begingroup$

                      It may vary from company to company, but Data Scientist as a designation has been around for some time now and is usually meant for extracting knowledge and insights from data.



                      I have seen data scientists doing



                      • Writing Image processing and image recognition algorithms,

                      • Design and implement decision trees for a business use case,

                      • Or simply design and implement some reports or write ETLs for data transformations.

                      Data science, however, is a super-domain of machine learning




                      It employs techniques and theories drawn from many fields within the
                      broad areas of mathematics, statistics, information science, and
                      computer science, in particular from the subdomains of machine
                      learning, classification, cluster analysis, uncertainty
                      quantification, computational science, data mining, databases, and
                      visualization
                      .




                      Machine learning engineer seems to be a designation where your employer has already narrowed down to the



                      • Approach,

                      • Tools,

                      • and a rough model (of what to deliver)

                      to extract knowledge or insights from data using machine learning and your work will be to design and implement machine learning algorithms to deliver the same.






                      share|improve this answer









                      $endgroup$

















                        7












                        $begingroup$

                        It may vary from company to company, but Data Scientist as a designation has been around for some time now and is usually meant for extracting knowledge and insights from data.



                        I have seen data scientists doing



                        • Writing Image processing and image recognition algorithms,

                        • Design and implement decision trees for a business use case,

                        • Or simply design and implement some reports or write ETLs for data transformations.

                        Data science, however, is a super-domain of machine learning




                        It employs techniques and theories drawn from many fields within the
                        broad areas of mathematics, statistics, information science, and
                        computer science, in particular from the subdomains of machine
                        learning, classification, cluster analysis, uncertainty
                        quantification, computational science, data mining, databases, and
                        visualization
                        .




                        Machine learning engineer seems to be a designation where your employer has already narrowed down to the



                        • Approach,

                        • Tools,

                        • and a rough model (of what to deliver)

                        to extract knowledge or insights from data using machine learning and your work will be to design and implement machine learning algorithms to deliver the same.






                        share|improve this answer









                        $endgroup$















                          7












                          7








                          7





                          $begingroup$

                          It may vary from company to company, but Data Scientist as a designation has been around for some time now and is usually meant for extracting knowledge and insights from data.



                          I have seen data scientists doing



                          • Writing Image processing and image recognition algorithms,

                          • Design and implement decision trees for a business use case,

                          • Or simply design and implement some reports or write ETLs for data transformations.

                          Data science, however, is a super-domain of machine learning




                          It employs techniques and theories drawn from many fields within the
                          broad areas of mathematics, statistics, information science, and
                          computer science, in particular from the subdomains of machine
                          learning, classification, cluster analysis, uncertainty
                          quantification, computational science, data mining, databases, and
                          visualization
                          .




                          Machine learning engineer seems to be a designation where your employer has already narrowed down to the



                          • Approach,

                          • Tools,

                          • and a rough model (of what to deliver)

                          to extract knowledge or insights from data using machine learning and your work will be to design and implement machine learning algorithms to deliver the same.






                          share|improve this answer









                          $endgroup$



                          It may vary from company to company, but Data Scientist as a designation has been around for some time now and is usually meant for extracting knowledge and insights from data.



                          I have seen data scientists doing



                          • Writing Image processing and image recognition algorithms,

                          • Design and implement decision trees for a business use case,

                          • Or simply design and implement some reports or write ETLs for data transformations.

                          Data science, however, is a super-domain of machine learning




                          It employs techniques and theories drawn from many fields within the
                          broad areas of mathematics, statistics, information science, and
                          computer science, in particular from the subdomains of machine
                          learning, classification, cluster analysis, uncertainty
                          quantification, computational science, data mining, databases, and
                          visualization
                          .




                          Machine learning engineer seems to be a designation where your employer has already narrowed down to the



                          • Approach,

                          • Tools,

                          • and a rough model (of what to deliver)

                          to extract knowledge or insights from data using machine learning and your work will be to design and implement machine learning algorithms to deliver the same.







                          share|improve this answer












                          share|improve this answer



                          share|improve this answer










                          answered Feb 20 '18 at 13:36









                          gurvinder372gurvinder372

                          17613




                          17613





















                              4












                              $begingroup$

                              Machine Learning Engineers and engineering focused Data Scientist are the same, but not all Data Scientist are engineering focused. About 5 years ago almost all Data Scientist were engineering focused, e.g, they had to write production code. Now, however, there are many Data Scientist roles that are for most part: playing in Jupyter notebook, understanding data, making pretty graphs, explaining to clients, managers, analysts... They don't do any engineering. And I believe that term Machine Learning Engineers came up to underline that this an engineering position.






                              share|improve this answer









                              $endgroup$

















                                4












                                $begingroup$

                                Machine Learning Engineers and engineering focused Data Scientist are the same, but not all Data Scientist are engineering focused. About 5 years ago almost all Data Scientist were engineering focused, e.g, they had to write production code. Now, however, there are many Data Scientist roles that are for most part: playing in Jupyter notebook, understanding data, making pretty graphs, explaining to clients, managers, analysts... They don't do any engineering. And I believe that term Machine Learning Engineers came up to underline that this an engineering position.






                                share|improve this answer









                                $endgroup$















                                  4












                                  4








                                  4





                                  $begingroup$

                                  Machine Learning Engineers and engineering focused Data Scientist are the same, but not all Data Scientist are engineering focused. About 5 years ago almost all Data Scientist were engineering focused, e.g, they had to write production code. Now, however, there are many Data Scientist roles that are for most part: playing in Jupyter notebook, understanding data, making pretty graphs, explaining to clients, managers, analysts... They don't do any engineering. And I believe that term Machine Learning Engineers came up to underline that this an engineering position.






                                  share|improve this answer









                                  $endgroup$



                                  Machine Learning Engineers and engineering focused Data Scientist are the same, but not all Data Scientist are engineering focused. About 5 years ago almost all Data Scientist were engineering focused, e.g, they had to write production code. Now, however, there are many Data Scientist roles that are for most part: playing in Jupyter notebook, understanding data, making pretty graphs, explaining to clients, managers, analysts... They don't do any engineering. And I believe that term Machine Learning Engineers came up to underline that this an engineering position.







                                  share|improve this answer












                                  share|improve this answer



                                  share|improve this answer










                                  answered Feb 21 '18 at 3:52









                                  AkavallAkavall

                                  29518




                                  29518





















                                      2












                                      $begingroup$

                                      TL;DR: It depends on who is asking.



                                      The answer to this question depends largely on the expectations, knowledge, and experience of whomever is asking. An analogous question with just as fuzzy of an answer is:




                                      What is the difference between a software developer, a software
                                      engineer, and a computer scientist?




                                      To some people, particularly people who study or teach computer science and software engineering, there is a large and defined difference between these fields. But to the average HR worker, technical recruiter, or manager, these are all just "Computer People".



                                      I love this quote by Vincent Granville, emphasis mine:




                                      Earlier in my career (circa 1990) I worked on image remote sensing
                                      technology, among other things to identify patterns (or shapes or
                                      features, for instance lakes) in satellite images and to perform image
                                      segmentation: at that time my research was labeled as computational
                                      statistics, but the people doing the exact same thing in the computer
                                      science department next door in my home university, called their
                                      research artificial intelligence. Today, it would be called data
                                      science
                                      or artificial intelligence, the sub-domains being signal
                                      processing, computer vision or IoT.







                                      share|improve this answer











                                      $endgroup$

















                                        2












                                        $begingroup$

                                        TL;DR: It depends on who is asking.



                                        The answer to this question depends largely on the expectations, knowledge, and experience of whomever is asking. An analogous question with just as fuzzy of an answer is:




                                        What is the difference between a software developer, a software
                                        engineer, and a computer scientist?




                                        To some people, particularly people who study or teach computer science and software engineering, there is a large and defined difference between these fields. But to the average HR worker, technical recruiter, or manager, these are all just "Computer People".



                                        I love this quote by Vincent Granville, emphasis mine:




                                        Earlier in my career (circa 1990) I worked on image remote sensing
                                        technology, among other things to identify patterns (or shapes or
                                        features, for instance lakes) in satellite images and to perform image
                                        segmentation: at that time my research was labeled as computational
                                        statistics, but the people doing the exact same thing in the computer
                                        science department next door in my home university, called their
                                        research artificial intelligence. Today, it would be called data
                                        science
                                        or artificial intelligence, the sub-domains being signal
                                        processing, computer vision or IoT.







                                        share|improve this answer











                                        $endgroup$















                                          2












                                          2








                                          2





                                          $begingroup$

                                          TL;DR: It depends on who is asking.



                                          The answer to this question depends largely on the expectations, knowledge, and experience of whomever is asking. An analogous question with just as fuzzy of an answer is:




                                          What is the difference between a software developer, a software
                                          engineer, and a computer scientist?




                                          To some people, particularly people who study or teach computer science and software engineering, there is a large and defined difference between these fields. But to the average HR worker, technical recruiter, or manager, these are all just "Computer People".



                                          I love this quote by Vincent Granville, emphasis mine:




                                          Earlier in my career (circa 1990) I worked on image remote sensing
                                          technology, among other things to identify patterns (or shapes or
                                          features, for instance lakes) in satellite images and to perform image
                                          segmentation: at that time my research was labeled as computational
                                          statistics, but the people doing the exact same thing in the computer
                                          science department next door in my home university, called their
                                          research artificial intelligence. Today, it would be called data
                                          science
                                          or artificial intelligence, the sub-domains being signal
                                          processing, computer vision or IoT.







                                          share|improve this answer











                                          $endgroup$



                                          TL;DR: It depends on who is asking.



                                          The answer to this question depends largely on the expectations, knowledge, and experience of whomever is asking. An analogous question with just as fuzzy of an answer is:




                                          What is the difference between a software developer, a software
                                          engineer, and a computer scientist?




                                          To some people, particularly people who study or teach computer science and software engineering, there is a large and defined difference between these fields. But to the average HR worker, technical recruiter, or manager, these are all just "Computer People".



                                          I love this quote by Vincent Granville, emphasis mine:




                                          Earlier in my career (circa 1990) I worked on image remote sensing
                                          technology, among other things to identify patterns (or shapes or
                                          features, for instance lakes) in satellite images and to perform image
                                          segmentation: at that time my research was labeled as computational
                                          statistics, but the people doing the exact same thing in the computer
                                          science department next door in my home university, called their
                                          research artificial intelligence. Today, it would be called data
                                          science
                                          or artificial intelligence, the sub-domains being signal
                                          processing, computer vision or IoT.








                                          share|improve this answer














                                          share|improve this answer



                                          share|improve this answer








                                          edited Aug 17 '18 at 18:42

























                                          answered Feb 21 '18 at 1:29









                                          lfalinlfalin

                                          1213




                                          1213





















                                              0












                                              $begingroup$

                                              I don't disagree with any of the answers given. However, I do think that there is a role of Data Scientist that is being glossed over in virtually all of the answers here. Most of these answers say something to the effect of, "Well, an engineer just writes and deploys the model . . . ". Hold on a sec - there's A LOT of work in those two steps!



                                              My core definition of a Data Scientist is someone that applies the scientific method to working with data. So I'm constantly thinking of hypostheses, designing tests, collecting my data and executing those tests, checking my cross validation results, trying new approaches, transforming my data, etc, etc. That's essentially what goes into "just writes and deploys the model" in a professional setting.



                                              So, for your answer, I think "the devil is in the details" because you can't just gloss over some of these steps/terms. Also, if you are job hunting, you should be careful because "data engineer" and "data scientist" can have woefully different pay scales - you do not want to be a data scientist on a data engineer salary!



                                              I always put myself out there as a data scientist, I tell companies that I work on predictive models (not just analytical) and that I'm not an Excel jockey - I write in programming languages (R, Python, etc). If you can find a position that let's you do both of those, then you're on your way to being a data scientist.






                                              share|improve this answer









                                              $endgroup$

















                                                0












                                                $begingroup$

                                                I don't disagree with any of the answers given. However, I do think that there is a role of Data Scientist that is being glossed over in virtually all of the answers here. Most of these answers say something to the effect of, "Well, an engineer just writes and deploys the model . . . ". Hold on a sec - there's A LOT of work in those two steps!



                                                My core definition of a Data Scientist is someone that applies the scientific method to working with data. So I'm constantly thinking of hypostheses, designing tests, collecting my data and executing those tests, checking my cross validation results, trying new approaches, transforming my data, etc, etc. That's essentially what goes into "just writes and deploys the model" in a professional setting.



                                                So, for your answer, I think "the devil is in the details" because you can't just gloss over some of these steps/terms. Also, if you are job hunting, you should be careful because "data engineer" and "data scientist" can have woefully different pay scales - you do not want to be a data scientist on a data engineer salary!



                                                I always put myself out there as a data scientist, I tell companies that I work on predictive models (not just analytical) and that I'm not an Excel jockey - I write in programming languages (R, Python, etc). If you can find a position that let's you do both of those, then you're on your way to being a data scientist.






                                                share|improve this answer









                                                $endgroup$















                                                  0












                                                  0








                                                  0





                                                  $begingroup$

                                                  I don't disagree with any of the answers given. However, I do think that there is a role of Data Scientist that is being glossed over in virtually all of the answers here. Most of these answers say something to the effect of, "Well, an engineer just writes and deploys the model . . . ". Hold on a sec - there's A LOT of work in those two steps!



                                                  My core definition of a Data Scientist is someone that applies the scientific method to working with data. So I'm constantly thinking of hypostheses, designing tests, collecting my data and executing those tests, checking my cross validation results, trying new approaches, transforming my data, etc, etc. That's essentially what goes into "just writes and deploys the model" in a professional setting.



                                                  So, for your answer, I think "the devil is in the details" because you can't just gloss over some of these steps/terms. Also, if you are job hunting, you should be careful because "data engineer" and "data scientist" can have woefully different pay scales - you do not want to be a data scientist on a data engineer salary!



                                                  I always put myself out there as a data scientist, I tell companies that I work on predictive models (not just analytical) and that I'm not an Excel jockey - I write in programming languages (R, Python, etc). If you can find a position that let's you do both of those, then you're on your way to being a data scientist.






                                                  share|improve this answer









                                                  $endgroup$



                                                  I don't disagree with any of the answers given. However, I do think that there is a role of Data Scientist that is being glossed over in virtually all of the answers here. Most of these answers say something to the effect of, "Well, an engineer just writes and deploys the model . . . ". Hold on a sec - there's A LOT of work in those two steps!



                                                  My core definition of a Data Scientist is someone that applies the scientific method to working with data. So I'm constantly thinking of hypostheses, designing tests, collecting my data and executing those tests, checking my cross validation results, trying new approaches, transforming my data, etc, etc. That's essentially what goes into "just writes and deploys the model" in a professional setting.



                                                  So, for your answer, I think "the devil is in the details" because you can't just gloss over some of these steps/terms. Also, if you are job hunting, you should be careful because "data engineer" and "data scientist" can have woefully different pay scales - you do not want to be a data scientist on a data engineer salary!



                                                  I always put myself out there as a data scientist, I tell companies that I work on predictive models (not just analytical) and that I'm not an Excel jockey - I write in programming languages (R, Python, etc). If you can find a position that let's you do both of those, then you're on your way to being a data scientist.







                                                  share|improve this answer












                                                  share|improve this answer



                                                  share|improve this answer










                                                  answered Feb 20 '18 at 20:47









                                                  I_Play_With_DataI_Play_With_Data

                                                  1,234632




                                                  1,234632





















                                                      0












                                                      $begingroup$

                                                      I think Machine learning engineer and Data Scientist are very much different . Many people get confused because machine learning is included in Data Science. But it is not that similar as the knowledge of machine is put together in Data Science where as The knowledge of Data Scientist comprises of Machine Learning , python, R , Statistics and basic mathematical skills. A machine learning engineer have proper knowledge of Machine learning only but Data Scientist will have the proper knowledge of all the above mentioned topic.






                                                      share|improve this answer









                                                      $endgroup$

















                                                        0












                                                        $begingroup$

                                                        I think Machine learning engineer and Data Scientist are very much different . Many people get confused because machine learning is included in Data Science. But it is not that similar as the knowledge of machine is put together in Data Science where as The knowledge of Data Scientist comprises of Machine Learning , python, R , Statistics and basic mathematical skills. A machine learning engineer have proper knowledge of Machine learning only but Data Scientist will have the proper knowledge of all the above mentioned topic.






                                                        share|improve this answer









                                                        $endgroup$















                                                          0












                                                          0








                                                          0





                                                          $begingroup$

                                                          I think Machine learning engineer and Data Scientist are very much different . Many people get confused because machine learning is included in Data Science. But it is not that similar as the knowledge of machine is put together in Data Science where as The knowledge of Data Scientist comprises of Machine Learning , python, R , Statistics and basic mathematical skills. A machine learning engineer have proper knowledge of Machine learning only but Data Scientist will have the proper knowledge of all the above mentioned topic.






                                                          share|improve this answer









                                                          $endgroup$



                                                          I think Machine learning engineer and Data Scientist are very much different . Many people get confused because machine learning is included in Data Science. But it is not that similar as the knowledge of machine is put together in Data Science where as The knowledge of Data Scientist comprises of Machine Learning , python, R , Statistics and basic mathematical skills. A machine learning engineer have proper knowledge of Machine learning only but Data Scientist will have the proper knowledge of all the above mentioned topic.







                                                          share|improve this answer












                                                          share|improve this answer



                                                          share|improve this answer










                                                          answered Mar 2 at 8:08









                                                          Raj ShivakotiRaj Shivakoti

                                                          1




                                                          1



























                                                              draft saved

                                                              draft discarded
















































                                                              Thanks for contributing an answer to Data Science Stack Exchange!


                                                              • Please be sure to answer the question. Provide details and share your research!

                                                              But avoid


                                                              • Asking for help, clarification, or responding to other answers.

                                                              • Making statements based on opinion; back them up with references or personal experience.

                                                              Use MathJax to format equations. MathJax reference.


                                                              To learn more, see our tips on writing great answers.




                                                              draft saved


                                                              draft discarded














                                                              StackExchange.ready(
                                                              function ()
                                                              StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f28006%2fdata-scientist-vs-machine-learning-engineer%23new-answer', 'question_page');

                                                              );

                                                              Post as a guest















                                                              Required, but never shown





















































                                                              Required, but never shown














                                                              Required, but never shown












                                                              Required, but never shown







                                                              Required, but never shown

































                                                              Required, but never shown














                                                              Required, but never shown












                                                              Required, but never shown







                                                              Required, but never shown







                                                              Popular posts from this blog

                                                              Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

                                                              Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

                                                              Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High