(Almost) linearely separable dataset, where can I find one?Dataset for Named Entity Recognition on Informal TextWhere can I find free spatio-temporal dataset for download?Where can I get a comprehensive criminal dataset?Where can I find a >1K dataset of annotated store receipt pictures/scans?Where can I find a crowdsourced dataset for multi-label classification with individual participant labels?Tactics to avoid feeling overwhelmed by machine learningWhere can I find a dataset for long sequence text chunking?Where can I find freely available multi-label datasets online?sklearn.GridSearchCV predict method not providing the best estimate and accuracy scoreWhere can I find IQ data?

Exchange,swap or switch

Unexpected email from Yorkshire Bank

How can I practically buy stocks?

How can the Zone of Truth spell be defeated without the caster knowing?

Was there a Viking Exchange as well as a Columbian one?

What does it mean to express a gate in Dirac notation?

The Defining Moment

Realistic Necromancy?

Is the 5 MB static resource size limit 5,242,880 bytes or 5,000,000 bytes?

How to write a column outside the braces in a matrix?

how to find the equation of a circle given points of the circle

How to pronounce 'C++' in Spanish

Does holding a wand and speaking its command word count as V/S/M spell components?

Minor Revision with suggestion of an alternative proof by reviewer

How did Captain America manage to do this?

Is there a way to get a compiler for the original B programming language?

Normal Map bad shading in Rendered display

Why does processed meat contain preservatives, while canned fish needs not?

A ​Note ​on ​N!

Are Boeing 737-800’s grounded?

Why isn't the definition of absolute value applied when squaring a radical containing a variable?

How come there are so many candidates for the 2020 Democratic party presidential nomination?

How do I deal with a coworker that keeps asking to make small superficial changes to a report, and it is seriously triggering my anxiety?

What is the most expensive material in the world that could be used to create Pun-Pun's lute?



(Almost) linearely separable dataset, where can I find one?


Dataset for Named Entity Recognition on Informal TextWhere can I find free spatio-temporal dataset for download?Where can I get a comprehensive criminal dataset?Where can I find a >1K dataset of annotated store receipt pictures/scans?Where can I find a crowdsourced dataset for multi-label classification with individual participant labels?Tactics to avoid feeling overwhelmed by machine learningWhere can I find a dataset for long sequence text chunking?Where can I find freely available multi-label datasets online?sklearn.GridSearchCV predict method not providing the best estimate and accuracy scoreWhere can I find IQ data?













0












$begingroup$


I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:



  • The banknote authentication dataset:
    https://archive.ics.uci.edu/ml/datasets/banknote+authentication


  • Occupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+


that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..



Does anyone of you know where I could find such a dataset?



I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..



NOTE: I can't use the kernel trick or any other higher dimensional paradigm.










share|improve this question









$endgroup$











  • $begingroup$
    There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:52






  • 1




    $begingroup$
    Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:56











  • $begingroup$
    This question is likely a better fit at Opendata.SE
    $endgroup$
    – Stephen Rauch
    Apr 7 at 20:49










  • $begingroup$
    Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
    $endgroup$
    – G5W
    Apr 7 at 21:05















0












$begingroup$


I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:



  • The banknote authentication dataset:
    https://archive.ics.uci.edu/ml/datasets/banknote+authentication


  • Occupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+


that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..



Does anyone of you know where I could find such a dataset?



I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..



NOTE: I can't use the kernel trick or any other higher dimensional paradigm.










share|improve this question









$endgroup$











  • $begingroup$
    There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:52






  • 1




    $begingroup$
    Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:56











  • $begingroup$
    This question is likely a better fit at Opendata.SE
    $endgroup$
    – Stephen Rauch
    Apr 7 at 20:49










  • $begingroup$
    Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
    $endgroup$
    – G5W
    Apr 7 at 21:05













0












0








0





$begingroup$


I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:



  • The banknote authentication dataset:
    https://archive.ics.uci.edu/ml/datasets/banknote+authentication


  • Occupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+


that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..



Does anyone of you know where I could find such a dataset?



I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..



NOTE: I can't use the kernel trick or any other higher dimensional paradigm.










share|improve this question









$endgroup$




I'm implementing the perceptron algorithm and the voted perceptron algorithm for an assignment for university.
For that I need to find some decent datasets.. I've tried the UCI repos and I've come up with:



  • The banknote authentication dataset:
    https://archive.ics.uci.edu/ml/datasets/banknote+authentication


  • Occupancy dataset: https://archive.ics.uci.edu/ml/datasets/Occupancy+Detection+


that work quite nicely, the accuracy is pretty high for the linear separability.
HOWEVER I need to find a third dataset to end my assignment, I've been searching everywhere but I can't seem to find anymore of them, the dataset has to have more than 1000 examples, and it has to be binary classifiable obviously..



Does anyone of you know where I could find such a dataset?



I've been trying the Adult dataset, but it's pretty much sparse.. far from being linearly separable..



NOTE: I can't use the kernel trick or any other higher dimensional paradigm.







machine-learning dataset perceptron






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Apr 7 at 19:19









Edoardo Wijaya GrappoliniEdoardo Wijaya Grappolini

1




1











  • $begingroup$
    There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:52






  • 1




    $begingroup$
    Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:56











  • $begingroup$
    This question is likely a better fit at Opendata.SE
    $endgroup$
    – Stephen Rauch
    Apr 7 at 20:49










  • $begingroup$
    Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
    $endgroup$
    – G5W
    Apr 7 at 21:05
















  • $begingroup$
    There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:52






  • 1




    $begingroup$
    Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
    $endgroup$
    – Pedro Henrique Monforte
    Apr 7 at 19:56











  • $begingroup$
    This question is likely a better fit at Opendata.SE
    $endgroup$
    – Stephen Rauch
    Apr 7 at 20:49










  • $begingroup$
    Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
    $endgroup$
    – G5W
    Apr 7 at 21:05















$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52




$begingroup$
There is a similar question on stats.stackexchange.com/questions/247057/… and in one of the answers they points taht there is a Iris Dataset with some of the classes been linearly separable
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:52




1




1




$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56





$begingroup$
Also this reddit question has a nice proposal for this kind of problem > "If you're just playing with data, another approach would be to find the separation that best separates a set of data, then remove the points or dimensions that violate the linear separation. It wouldn't be valid for publication, but it could still be useful for practicing your ANN work."
$endgroup$
– Pedro Henrique Monforte
Apr 7 at 19:56













$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch
Apr 7 at 20:49




$begingroup$
This question is likely a better fit at Opendata.SE
$endgroup$
– Stephen Rauch
Apr 7 at 20:49












$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05




$begingroup$
Could you use a synthetic data set? It should be very easy to generate your own dataset that is "almost linearly separable".
$endgroup$
– G5W
Apr 7 at 21:05










0






active

oldest

votes












Your Answer








StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48834%2falmost-linearely-separable-dataset-where-can-i-find-one%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48834%2falmost-linearely-separable-dataset-where-can-i-find-one%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High