(Almost) linearely separable dataset, where can I find one?Dataset for Named Entity Recognition on Informal TextWhere can I find free spatio-temporal dataset for download?Where can I get a comprehensive criminal dataset?Where can I find a >1K dataset of annotated store receipt pictures/scans?Where can I find a crowdsourced dataset for multi-label classification with individual participant labels?Tactics to avoid feeling overwhelmed by machine learningWhere can I find a dataset for long sequence text chunking?Where can I find freely available multi-label datasets online?sklearn.GridSearchCV predict method not providing the best estimate and accuracy scoreWhere can I find IQ data?
Exchange,swap or switch
Unexpected email from Yorkshire Bank
How can I practically buy stocks?
How can the Zone of Truth spell be defeated without the caster knowing?
Was there a Viking Exchange as well as a Columbian one?
What does it mean to express a gate in Dirac notation?
The Defining Moment
Realistic Necromancy?
Is the 5 MB static resource size limit 5,242,880 bytes or 5,000,000 bytes?
How to write a column outside the braces in a matrix?
how to find the equation of a circle given points of the circle
How to pronounce 'C++' in Spanish
Does holding a wand and speaking its command word count as V/S/M spell components?
Minor Revision with suggestion of an alternative proof by reviewer
How did Captain America manage to do this?
Is there a way to get a compiler for the original B programming language?
Normal Map bad shading in Rendered display
Why does processed meat contain preservatives, while canned fish needs not?
A Note on N!
Are Boeing 737-800’s grounded?
Why isn't the definition of absolute value applied when squaring a radical containing a variable?
How come there are so many candidates for the 2020 Democratic party presidential nomination?
How do I deal with a coworker that keeps asking to make small superficial changes to a report, and it is seriously triggering my anxiety?
What is the most expensive material in the world that could be used to create Pun-Pun's lute?
(Almost) linearely separable dataset, where can I find one?
Dataset for Named Entity Recognition on Informal TextWhere can I find free spatio-temporal dataset for download?Where can I get a comprehensive criminal dataset?Where can I find a >1K dataset of annotated store receipt pictures/scans?Where can I find a crowdsourced dataset for multi-label classification with individual participant labels?Tactics to avoid feeling overwhelmed by machine learningWhere can I find a dataset for long sequence text chunking?Where can I find freely available multi-label datasets online?sklearn.GridSearchCV predict method not providing the best estimate and accuracy scoreWhere can I find IQ data?
$begingroup$
I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:
The banknote authentication dataset:
https://archive.ics.uci.edu/ml/datasets/banknote+authenticationOccupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+
that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..
Does anyone of you know where I could find such a dataset?
I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..
NOTE: I can't use the kernel trick or any other higher dimensional paradigm.
machine-learning dataset perceptron
$endgroup$
add a comment |
$begingroup$
I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:
The banknote authentication dataset:
https://archive.ics.uci.edu/ml/datasets/banknote+authenticationOccupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+
that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..
Does anyone of you know where I could find such a dataset?
I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..
NOTE: I can't use the kernel trick or any other higher dimensional paradigm.
machine-learning dataset perceptron
$endgroup$
$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52
1
$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56
$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch♦
Apr 7 at 20:49
$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05
add a comment |
$begingroup$
I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:
The banknote authentication dataset:
https://archive.ics.uci.edu/ml/datasets/banknote+authenticationOccupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+
that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..
Does anyone of you know where I could find such a dataset?
I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..
NOTE: I can't use the kernel trick or any other higher dimensional paradigm.
machine-learning dataset perceptron
$endgroup$
I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:
The banknote authentication dataset:
https://archive.ics.uci.edu/ml/datasets/banknote+authenticationOccupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+
that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..
Does anyone of you know where I could find such a dataset?
I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..
NOTE: I can't use the kernel trick or any other higher dimensional paradigm.
machine-learning dataset perceptron
machine-learning dataset perceptron
asked Apr 7 at 19:19
Edoardo Wijaya GrappoliniEdoardo Wijaya Grappolini
1
1
$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52
1
$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56
$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch♦
Apr 7 at 20:49
$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05
add a comment |
$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52
1
$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56
$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch♦
Apr 7 at 20:49
$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05
$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52
$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52
1
1
$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56
$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56
$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch♦
Apr 7 at 20:49
$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch♦
Apr 7 at 20:49
$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05
$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48834%2falmost-linearely-separable-dataset-where-can-i-find-one%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48834%2falmost-linearely-separable-dataset-where-can-i-find-one%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52
1
$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56
$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch♦
Apr 7 at 20:49
$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05