Need help with confusing dataset formats for Images and annotations The Next CEO of Stack Overflow2019 Community Moderator ElectionBest format for recording time stamp and GPSHow to prepare/augment images for neural network?Tools to perform SQL analytics on 350TB of csv dataAnimal Detection datasetFormat for time series data with non-trivial sampled dataHybrid Convolutional and Conventional Neural NetworksHow can I combine images for Matlab deep learning?Neural Network Architecture for Identifying Image CopiesHow to Build Mobile Application for Image Recognition?
Why do remote companies require working in the US?
How to make a variable always equal to the result of some calculations?
How do I go from 300 unfinished/half written blog posts, to published posts?
How do we know the LHC results are robust?
What does "Its cash flow is deeply negative" mean?
Why doesn't a table tennis ball float on a surface of steel balls? How do we calculate buoyancy here?
What happens if you roll doubles 3 times then land on "Go to jail?"
Only print output after finding pattern
Why didn't Theresa May consult with Parliament before negotiating a deal with the EU?
Rotate a column
If/When UK leaves the EU, can a future goverment do a referendum to join EU
How to get regions to plot as graphics
How to start emacs in "nothing" mode (`fundamental-mode`)
Hindi speaking tourist to UK from India
Are there languages with no euphemisms?
What benefits would be gained by using human laborers instead of drones in deep sea mining?
Would a galaxy be visible from outside, but nearby?
What flight has the highest ratio of time difference to flight time?
Help understanding this unsettling image of Titan, Epimetheus, and Saturn's rings?
A "random" question: usage of "random" as adjective in Spanish
Is there a difference between "Fahrstuhl" and "Aufzug"
How to prepend a string to only the lines of text which are numbers
What is the purpose of the Evocation wizard's Potent Cantrip feature?
How to safely derail a train during transit?
Need help with confusing dataset formats for Images and annotations
The Next CEO of Stack Overflow2019 Community Moderator ElectionBest format for recording time stamp and GPSHow to prepare/augment images for neural network?Tools to perform SQL analytics on 350TB of csv dataAnimal Detection datasetFormat for time series data with non-trivial sampled dataHybrid Convolutional and Conventional Neural NetworksHow can I combine images for Matlab deep learning?Neural Network Architecture for Identifying Image CopiesHow to Build Mobile Application for Image Recognition?
$begingroup$
I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.
I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.
Background
What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.
For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.
Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.
The Question(s):
Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?
Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.
Could you give me a hint, your opinion, or an overview?
Maybe some other points which could give a clearer picture:
- What is, in your opinion, the best format for hand gesture analysis?
(different gestures, different background). Or more general, which
formats suits which tasks the best? - Can you use the classic train-test split function with the COCO
format? - Does using CSV mean in most cases to put the flattened image into it,
while json the image name? - Which most know, preset network architectures from Github use which
formatting. (For using them as pre-trained networks for transfer
learning)
Thanks a lot
p.s. my answer to you might be delayed, thanks for the patience
neural-network dataset image-classification data-formats
New contributor
$endgroup$
add a comment |
$begingroup$
I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.
I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.
Background
What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.
For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.
Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.
The Question(s):
Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?
Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.
Could you give me a hint, your opinion, or an overview?
Maybe some other points which could give a clearer picture:
- What is, in your opinion, the best format for hand gesture analysis?
(different gestures, different background). Or more general, which
formats suits which tasks the best? - Can you use the classic train-test split function with the COCO
format? - Does using CSV mean in most cases to put the flattened image into it,
while json the image name? - Which most know, preset network architectures from Github use which
formatting. (For using them as pre-trained networks for transfer
learning)
Thanks a lot
p.s. my answer to you might be delayed, thanks for the patience
neural-network dataset image-classification data-formats
New contributor
$endgroup$
add a comment |
$begingroup$
I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.
I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.
Background
What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.
For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.
Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.
The Question(s):
Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?
Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.
Could you give me a hint, your opinion, or an overview?
Maybe some other points which could give a clearer picture:
- What is, in your opinion, the best format for hand gesture analysis?
(different gestures, different background). Or more general, which
formats suits which tasks the best? - Can you use the classic train-test split function with the COCO
format? - Does using CSV mean in most cases to put the flattened image into it,
while json the image name? - Which most know, preset network architectures from Github use which
formatting. (For using them as pre-trained networks for transfer
learning)
Thanks a lot
p.s. my answer to you might be delayed, thanks for the patience
neural-network dataset image-classification data-formats
New contributor
$endgroup$
I have a more general question, which I could not answer after quite a bit of google. Actually, I got more confused. I hope you can share some light.
I am familiar with deep learning, Keras but come from time series analysis and now start in image processing.
Background
What I noticed, that most tutorial use open datasets to explain implementations of CNNs. Each dataset (COCO, ImageNet, CIFAR,…) chooses all different formatting of the annotations and representation of the images.
For example, COCO just e.g. uses the name of the image in json while others use the flattened RGB information of the image in csv.
Now I want to create my own dataset with hand-gestures. Generally, I could just give each image one tag and be finished for image classification. However, I see many implementations of complex Network architectures like RetinaNet, which use formats such as COCOs also for object (here hand) recognition.
The Question(s):
Not knowing if I will later need more sophisticated annotations, should I start with a COCO format? But if so, which should I choose of the many formats? COCOs, ImageNets,… ?
Secondly, if I later would write a converter from one to the other (for comparison), I guess it would be best if I already have the most sophisticated formatting, as downgrading is easier than upgrading the information level.
Could you give me a hint, your opinion, or an overview?
Maybe some other points which could give a clearer picture:
- What is, in your opinion, the best format for hand gesture analysis?
(different gestures, different background). Or more general, which
formats suits which tasks the best? - Can you use the classic train-test split function with the COCO
format? - Does using CSV mean in most cases to put the flattened image into it,
while json the image name? - Which most know, preset network architectures from Github use which
formatting. (For using them as pre-trained networks for transfer
learning)
Thanks a lot
p.s. my answer to you might be delayed, thanks for the patience
neural-network dataset image-classification data-formats
neural-network dataset image-classification data-formats
New contributor
New contributor
New contributor
asked Mar 22 at 14:05
Florida ManFlorida Man
61
61
New contributor
New contributor
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Florida Man is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47793%2fneed-help-with-confusing-dataset-formats-for-images-and-annotations%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Florida Man is a new contributor. Be nice, and check out our Code of Conduct.
Florida Man is a new contributor. Be nice, and check out our Code of Conduct.
Florida Man is a new contributor. Be nice, and check out our Code of Conduct.
Florida Man is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47793%2fneed-help-with-confusing-dataset-formats-for-images-and-annotations%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown