Overfitting in K-means2019 Community Moderator ElectionOverfitting in an unsupervised techniqueCan overfitting occur even with validation loss still dropping?Overfitting Naive Bayesinformation leakage when using empirical Bayesian to generate a predictorvalidation/training accuracy and overfittingSignificant overfitting with CVHow to improve loss and avoid overfittingIs my model overfitting when I add new features?Check Overfitting in CNNIs Overfitting always bad?Overfitting - how to detect it and reduce it?

What exactly is ineptocracy?

How do I deal with an unproductive colleague in a small company?

How would I stat a creature to be immune to everything but the Magic Missile spell? (just for fun)

Two tailed t test for two companies' monthly profits

Rotate ASCII Art by 45 Degrees

How can a day be exactly 24 hours long?

How dangerous is XSS

Is it possible to static_assert that a lambda is not generic?

How can saying a song's name be a copyright violation?

In 'Revenger,' what does 'cove' come from?

Does Fukaya see all symplectic topology?

Does the Cone of Cold spell freeze water?

Get order collection by order id in Magento 2?

Personal Teleportation: From Rags to Riches

Is there a hemisphere-neutral way of specifying a season?

Could the museum Saturn V's be refitted for one more flight?

Why would the Red Woman birth a shadow if she worshipped the Lord of the Light?

Is it "common practice in Fourier transform spectroscopy to multiply the measured interferogram by an apodizing function"? If so, why?

How seriously should I take size and weight limits of hand luggage?

Simple macro for new # symbol

How could indestructible materials be used in power generation?

Expand and Contract

How to show a landlord what we have in savings?

Is this draw by repetition?



Overfitting in K-means



2019 Community Moderator ElectionOverfitting in an unsupervised techniqueCan overfitting occur even with validation loss still dropping?Overfitting Naive Bayesinformation leakage when using empirical Bayesian to generate a predictorvalidation/training accuracy and overfittingSignificant overfitting with CVHow to improve loss and avoid overfittingIs my model overfitting when I add new features?Check Overfitting in CNNIs Overfitting always bad?Overfitting - how to detect it and reduce it?










2












$begingroup$


How do you test your results for overfitting in a k-means run? Some people have said use a training set. I have about 1500 records and about 20 fields.










share|improve this question









$endgroup$







  • 1




    $begingroup$
    You can't overfit K-Means. It can, however, be non-robust. It's unsupervised learning, not supervised. Keywords for theory: silhouette analysis, elbow method, gap statistics, mutual information
    $endgroup$
    – Carl Rynegardh
    Mar 26 at 20:03











  • $begingroup$
    @CarlRynegardh although this answer shares the same opinion, this answer seems quite reasonable too. We can settle for a subjective overfitting I think!
    $endgroup$
    – Esmailian
    Mar 26 at 20:43
















2












$begingroup$


How do you test your results for overfitting in a k-means run? Some people have said use a training set. I have about 1500 records and about 20 fields.










share|improve this question









$endgroup$







  • 1




    $begingroup$
    You can't overfit K-Means. It can, however, be non-robust. It's unsupervised learning, not supervised. Keywords for theory: silhouette analysis, elbow method, gap statistics, mutual information
    $endgroup$
    – Carl Rynegardh
    Mar 26 at 20:03











  • $begingroup$
    @CarlRynegardh although this answer shares the same opinion, this answer seems quite reasonable too. We can settle for a subjective overfitting I think!
    $endgroup$
    – Esmailian
    Mar 26 at 20:43














2












2








2





$begingroup$


How do you test your results for overfitting in a k-means run? Some people have said use a training set. I have about 1500 records and about 20 fields.










share|improve this question









$endgroup$




How do you test your results for overfitting in a k-means run? Some people have said use a training set. I have about 1500 records and about 20 fields.







k-means overfitting






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Mar 26 at 19:35









guestguest

112




112







  • 1




    $begingroup$
    You can't overfit K-Means. It can, however, be non-robust. It's unsupervised learning, not supervised. Keywords for theory: silhouette analysis, elbow method, gap statistics, mutual information
    $endgroup$
    – Carl Rynegardh
    Mar 26 at 20:03











  • $begingroup$
    @CarlRynegardh although this answer shares the same opinion, this answer seems quite reasonable too. We can settle for a subjective overfitting I think!
    $endgroup$
    – Esmailian
    Mar 26 at 20:43













  • 1




    $begingroup$
    You can't overfit K-Means. It can, however, be non-robust. It's unsupervised learning, not supervised. Keywords for theory: silhouette analysis, elbow method, gap statistics, mutual information
    $endgroup$
    – Carl Rynegardh
    Mar 26 at 20:03











  • $begingroup$
    @CarlRynegardh although this answer shares the same opinion, this answer seems quite reasonable too. We can settle for a subjective overfitting I think!
    $endgroup$
    – Esmailian
    Mar 26 at 20:43








1




1




$begingroup$
You can't overfit K-Means. It can, however, be non-robust. It's unsupervised learning, not supervised. Keywords for theory: silhouette analysis, elbow method, gap statistics, mutual information
$endgroup$
– Carl Rynegardh
Mar 26 at 20:03





$begingroup$
You can't overfit K-Means. It can, however, be non-robust. It's unsupervised learning, not supervised. Keywords for theory: silhouette analysis, elbow method, gap statistics, mutual information
$endgroup$
– Carl Rynegardh
Mar 26 at 20:03













$begingroup$
@CarlRynegardh although this answer shares the same opinion, this answer seems quite reasonable too. We can settle for a subjective overfitting I think!
$endgroup$
– Esmailian
Mar 26 at 20:43





$begingroup$
@CarlRynegardh although this answer shares the same opinion, this answer seems quite reasonable too. We can settle for a subjective overfitting I think!
$endgroup$
– Esmailian
Mar 26 at 20:43











0






active

oldest

votes












Your Answer





StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48052%2foverfitting-in-k-means%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48052%2foverfitting-in-k-means%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Marja Vauras Lähteet | Aiheesta muualla | NavigointivalikkoMarja Vauras Turun yliopiston tutkimusportaalissaInfobox OKSuomalaisen Tiedeakatemian varsinaiset jäsenetKasvatustieteiden tiedekunnan dekaanit ja muu johtoMarja VaurasKoulutusvienti on kestävyys- ja ketteryyslaji (2.5.2017)laajentamallaWorldCat Identities0000 0001 0855 9405n86069603utb201588738523620927

Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

Is flight data recorder erased after every flight?When are black boxes used?What protects the location beacon (pinger) of a flight data recorder?Is there anywhere I can pick up raw flight data recorder information?Who legally owns the Flight Data Recorder?Constructing flight recorder dataWhy are FDRs and CVRs still two separate physical devices?What are the data elements shown on the GE235 flight data recorder (FDR) plot?Are CVR and FDR reset after every flight?What is the format of data stored by a Flight Data Recorder?How much data is stored in the flight data recorder per hour in a typical flight of an A380?Is a smart flight data recorder possible?