Need help with confusing dataset formats for Images and annotations The Next CEO of Stack Overflow2019 Community Moderator ElectionBest format for recording time stamp and GPSHow to prepare/augment images for neural network?Tools to perform SQL analytics on 350TB of csv dataAnimal Detection datasetFormat for time series data with non-trivial sampled dataHybrid Convolutional and Conventional Neural NetworksHow can I combine images for Matlab deep learning?Neural Network Architecture for Identifying Image CopiesHow to Build Mobile Application for Image Recognition?

Why do remote companies require working in the US?

How to make a variable always equal to the result of some calculations?

How do I go from 300 unfinished/half written blog posts, to published posts?

How do we know the LHC results are robust?

What does "Its cash flow is deeply negative" mean?

Why doesn't a table tennis ball float on a surface of steel balls? How do we calculate buoyancy here?

What happens if you roll doubles 3 times then land on "Go to jail?"

Only print output after finding pattern

Why didn't Theresa May consult with Parliament before negotiating a deal with the EU?

Rotate a column

If/When UK leaves the EU, can a future goverment do a referendum to join EU

How to get regions to plot as graphics

How to start emacs in "nothing" mode (`fundamental-mode`)

Hindi speaking tourist to UK from India

Are there languages with no euphemisms?

What benefits would be gained by using human laborers instead of drones in deep sea mining?

Would a galaxy be visible from outside, but nearby?

What flight has the highest ratio of time difference to flight time?

Help understanding this unsettling image of Titan, Epimetheus, and Saturn's rings?

A "random" question: usage of "random" as adjective in Spanish

Is there a difference between "Fahrstuhl" and "Aufzug"

How to prepend a string to only the lines of text which are numbers

What is the purpose of the Evocation wizard's Potent Cantrip feature?

How to safely derail a train during transit?



Need help with confusing dataset formats for Images and annotations



The Next CEO of Stack Overflow
2019 Community Moderator ElectionBest format for recording time stamp and GPSHow to prepare/augment images for neural network?Tools to perform SQL analytics on 350TB of csv dataAnimal Detection datasetFormat for time series data with non-trivial sampled dataHybrid Convolutional and Conventional Neural NetworksHow can I combine images for Matlab deep learning?Neural Network Architecture for Identifying Image CopiesHow to Build Mobile Application for Image Recognition?










1












$begingroup$


I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



Background



What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



The Question(s):



Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



Could you give me a hint, your opinion, or an overview?




Maybe some other points which could give a clearer picture:



  • What is, in your opinion, the best format for hand gesture analysis?
    (different gestures, different background). Or more general, which
    formats suits which tasks the best?

  • Can you use the classic train-test split function with the COCO
    format?

  • Does using CSV mean in most cases to put the flattened image into it,
    while json the image name?

  • Which most know, preset network architectures from Github use which
    formatting. (For using them as pre-trained networks for transfer
    learning)

Thanks a lot



p.s. my answer to you might be delayed, thanks for the patience










share|improve this question







New contributor




Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$
















    1












    $begingroup$


    I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



    I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



    Background



    What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



    For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



    Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



    The Question(s):



    Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



    Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



    Could you give me a hint, your opinion, or an overview?




    Maybe some other points which could give a clearer picture:



    • What is, in your opinion, the best format for hand gesture analysis?
      (different gestures, different background). Or more general, which
      formats suits which tasks the best?

    • Can you use the classic train-test split function with the COCO
      format?

    • Does using CSV mean in most cases to put the flattened image into it,
      while json the image name?

    • Which most know, preset network architectures from Github use which
      formatting. (For using them as pre-trained networks for transfer
      learning)

    Thanks a lot



    p.s. my answer to you might be delayed, thanks for the patience










    share|improve this question







    New contributor




    Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.







    $endgroup$














      1












      1








      1





      $begingroup$


      I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



      I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



      Background



      What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



      For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



      Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



      The Question(s):



      Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



      Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



      Could you give me a hint, your opinion, or an overview?




      Maybe some other points which could give a clearer picture:



      • What is, in your opinion, the best format for hand gesture analysis?
        (different gestures, different background). Or more general, which
        formats suits which tasks the best?

      • Can you use the classic train-test split function with the COCO
        format?

      • Does using CSV mean in most cases to put the flattened image into it,
        while json the image name?

      • Which most know, preset network architectures from Github use which
        formatting. (For using them as pre-trained networks for transfer
        learning)

      Thanks a lot



      p.s. my answer to you might be delayed, thanks for the patience










      share|improve this question







      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.







      $endgroup$




      I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.



      I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.



      Background



      What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.



      For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.



      Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.



      The Question(s):



      Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?



      Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.



      Could you give me a hint, your opinion, or an overview?




      Maybe some other points which could give a clearer picture:



      • What is, in your opinion, the best format for hand gesture analysis?
        (different gestures, different background). Or more general, which
        formats suits which tasks the best?

      • Can you use the classic train-test split function with the COCO
        format?

      • Does using CSV mean in most cases to put the flattened image into it,
        while json the image name?

      • Which most know, preset network architectures from Github use which
        formatting. (For using them as pre-trained networks for transfer
        learning)

      Thanks a lot



      p.s. my answer to you might be delayed, thanks for the patience







      neural-network dataset image-classification data-formats






      share|improve this question







      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|improve this question







      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|improve this question




      share|improve this question






      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked Mar 22 at 14:05









      Florida ManFlorida Man

      61




      61




      New contributor




      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      Florida Man is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.




















          0






          active

          oldest

          votes












          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );






          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.









          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47793%2fneed-help-with-confusing-dataset-formats-for-images-and-annotations%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes








          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.









          draft saved

          draft discarded


















          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.












          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.











          Florida Man is a new contributor. Be nice, and check out our Code of Conduct.














          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47793%2fneed-help-with-confusing-dataset-formats-for-images-and-annotations%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Marja Vauras Lähteet | Aiheesta muualla | NavigointivalikkoMarja Vauras Turun yliopiston tutkimusportaalissaInfobox OKSuomalaisen Tiedeakatemian varsinaiset jäsenetKasvatustieteiden tiedekunnan dekaanit ja muu johtoMarja VaurasKoulutusvienti on kestävyys- ja ketteryyslaji (2.5.2017)laajentamallaWorldCat Identities0000 0001 0855 9405n86069603utb201588738523620927

          Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

          Is this part of the description of the Archfey warlock's Misty Escape feature redundant?When is entropic ward considered “used”?How does the reaction timing work for Wrath of the Storm? Can it potentially prevent the damage from the triggering attack?Does the Dark Arts Archlich warlock patrons's Arcane Invisibility activate every time you cast a level 1+ spell?When attacking while invisible, when exactly does invisibility break?Can I cast Hellish Rebuke on my turn?Do I have to “pre-cast” a reaction spell in order for it to be triggered?What happens if a Player Misty Escapes into an Invisible CreatureCan a reaction interrupt multiattack?Does the Fiend-patron warlock's Hurl Through Hell feature dispel effects that require the target to be on the same plane as the caster?What are you allowed to do while using the Warlock's Eldritch Master feature?