Need help with confusing dataset formats for Images and annotations The Next CEO of Stack Overflow2019 Community Moderator ElectionBest format for recording time stamp and GPSHow to prepare/augment images for neural network?Tools to perform SQL analytics on 350TB of csv dataAnimal Detection datasetFormat for time series data with non-trivial sampled dataHybrid Convolutional and Conventional Neural NetworksHow can I combine images for Matlab deep learning?Neural Network Architecture for Identifying Image CopiesHow to Build Mobile Application for Image Recognition?

Why do remote companies require working in the US?

How to make a variable always equal to the result of some calculations?

How do I go from 300 unfinished/half written blog posts, to published posts?

How do we know the LHC results are robust?

What does "Its cash flow is deeply negative" mean?

Why doesn't a table tennis ball float on a surface of steel balls? How do we calculate buoyancy here?

What happens if you roll doubles 3 times then land on "Go to jail?"

Only print output after finding pattern

Why didn't Theresa May consult with Parliament before negotiating a deal with the EU?

Rotate a column

If/When UK leaves the EU, can a future goverment do a referendum to join EU

How to get regions to plot as graphics

How to start emacs in "nothing" mode (`fundamental-mode`)

Hindi speaking tourist to UK from India

Are there languages with no euphemisms?

What benefits would be gained by using human laborers instead of drones in deep sea mining?

Would a galaxy be visible from outside, but nearby?

What flight has the highest ratio of time difference to flight time?

Help understanding this unsettling image of Titan, Epimetheus, and Saturn's rings?

A "random" question: usage of "random" as adjective in Spanish

Is there a difference between "Fahrstuhl" and "Aufzug"

How to prepend a string to only the lines of text which are numbers

What is the purpose of the Evocation wizard's Potent Cantrip feature?

How to safely derail a train during transit?



Need help with confusing dataset formats for Images and annotations



The Next CEO of Stack Overflow
2019 Community Moderator ElectionBest format for recording time stamp and GPSHow to prepare/augment images for neural network?Tools to perform SQL analytics on 350TB of csv dataAnimal Detection datasetFormat for time series data with non-trivial sampled dataHybrid Convolutional and Conventional Neural NetworksHow can I combine images for Matlab deep learning?Neural Network Architecture for Identifying Image CopiesHow to Build Mobile Application for Image Recognition?










1












$begingroup$


I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



Background



What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



The Question(s):



Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



Could you give me a hint, your opinion, or an overview?




Maybe some other points which could give a clearer picture:



  • What is, in your opinion, the best format for hand gesture analysis?
    (different gestures, different background). Or more general, which
    formats suits which tasks the best?

  • Can you use the classic train-test split function with the COCO
    format?

  • Does using CSV mean in most cases to put the flattened image into it,
    while json the image name?

  • Which most know, preset network architectures from Github use which
    formatting. (For using them as pre-trained networks for transfer
    learning)

Thanks a lot



p.s. my answer to you might be delayed, thanks for the patience










share|improve this question







New contributor




Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$
















    1












    $begingroup$


    I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



    I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



    Background



    What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



    For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



    Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



    The Question(s):



    Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



    Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



    Could you give me a hint, your opinion, or an overview?




    Maybe some other points which could give a clearer picture:



    • What is, in your opinion, the best format for hand gesture analysis?
      (different gestures, different background). Or more general, which
      formats suits which tasks the best?

    • Can you use the classic train-test split function with the COCO
      format?

    • Does using CSV mean in most cases to put the flattened image into it,
      while json the image name?

    • Which most know, preset network architectures from Github use which
      formatting. (For using them as pre-trained networks for transfer
      learning)

    Thanks a lot



    p.s. my answer to you might be delayed, thanks for the patience










    share|improve this question







    New contributor




    Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.







    $endgroup$














      1












      1








      1





      $begingroup$


      I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



      I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



      Background



      What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



      For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



      Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



      The Question(s):



      Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



      Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



      Could you give me a hint, your opinion, or an overview?




      Maybe some other points which could give a clearer picture:



      • What is, in your opinion, the best format for hand gesture analysis?
        (different gestures, different background). Or more general, which
        formats suits which tasks the best?

      • Can you use the classic train-test split function with the COCO
        format?

      • Does using CSV mean in most cases to put the flattened image into it,
        while json the image name?

      • Which most know, preset network architectures from Github use which
        formatting. (For using them as pre-trained networks for transfer
        learning)

      Thanks a lot



      p.s. my answer to you might be delayed, thanks for the patience










      share|improve this question







      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.







      $endgroup$




      I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



      I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



      Background



      What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



      For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



      Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



      The Question(s):



      Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



      Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



      Could you give me a hint, your opinion, or an overview?




      Maybe some other points which could give a clearer picture:



      • What is, in your opinion, the best format for hand gesture analysis?
        (different gestures, different background). Or more general, which
        formats suits which tasks the best?

      • Can you use the classic train-test split function with the COCO
        format?

      • Does using CSV mean in most cases to put the flattened image into it,
        while json the image name?

      • Which most know, preset network architectures from Github use which
        formatting. (For using them as pre-trained networks for transfer
        learning)

      Thanks a lot



      p.s. my answer to you might be delayed, thanks for the patience







      neural-network dataset image-classification data-formats






      share|improve this question







      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|improve this question







      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|improve this question




      share|improve this question






      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked Mar 22 at 14:05









      Florida ManFlorida Man

      61




      61




      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.




















          0






          active

          oldest

          votes












          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );






          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.









          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47793%2fneed-help-with-confusing-dataset-formats-for-images-and-annotations%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes








          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.









          draft saved

          draft discarded


















          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.












          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.











          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.














          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47793%2fneed-help-with-confusing-dataset-formats-for-images-and-annotations%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

          Tähtien Talli Jäsenet | Lähteet | NavigointivalikkoSuomen Hippos – Tähtien Talli

          Do these cracks on my tires look bad? The Next CEO of Stack OverflowDry rot tire should I replace?Having to replace tiresFishtailed so easily? Bad tires? ABS?Filling the tires with something other than air, to avoid puncture hassles?Used Michelin tires safe to install?Do these tyre cracks necessitate replacement?Rumbling noise: tires or mechanicalIs it possible to fix noisy feathered tires?Are bad winter tires still better than summer tires in winter?Torque converter failure - Related to replacing only 2 tires?Why use snow tires on all 4 wheels on 2-wheel-drive cars?