Accuracy after selftraining didn't change The Next CEO of Stack Overflow2019 Community Moderator ElectionHow to increase accuracy of classifiers?Improving Naive Bayes accuracy for text classificationWhat is a reasonable way to compare the improvement in accuracy?Accuracy value constant even after different runsIs this a good classified model based confusion matrix and classification report?Max 75% accuracy! help!Coursera ML - Does the choice of optimization algorithm affect the accuracy of multiclass logistic regression?I got 100% accuracy on my test set,is there something wrong?Validation accuracy is always close to training accuracyHow to get probability of classification

A Man With a Stainless Steel Endoskeleton (like The Terminator) Fighting Cloaked Aliens Only He Can See

Plot of histogram similar to output from @risk

Which one is the true statement?

Is wanting to ask what to write an indication that you need to change your story?

How did people program for Consoles with multiple CPUs?

Should I tutor a student who I know has cheated on their homework?

Does soap repel water?

I believe this to be a fraud - hired, then asked to cash check and send cash as Bitcoin

Decomposition of product of two Plucker coordinates

Is there a way to save my career from absolute disaster?

Newlines in BSD sed vs gsed

unclear about Dynamic Binding

Is there a difference between "Fahrstuhl" and "Aufzug"

Is it possible to replace duplicates of a character with one character using tr

Measuring resistivity of dielectric liquid

Why is my new battery behaving weirdly?

What benefits would be gained by using human laborers instead of drones in deep sea mining?

How to get from Geneva Airport to Metabief?

Running a General Election and the European Elections together

Does falling count as part of my movement?

Is this "being" usage is essential?

Can we say or write : "No, it'sn't"?

Why doesn't UK go for the same deal Japan has with EU to resolve Brexit?

Is there a way to bypass a component in series in a circuit if that component fails?



Accuracy after selftraining didn't change



The Next CEO of Stack Overflow
2019 Community Moderator ElectionHow to increase accuracy of classifiers?Improving Naive Bayes accuracy for text classificationWhat is a reasonable way to compare the improvement in accuracy?Accuracy value constant even after different runsIs this a good classified model based confusion matrix and classification report?Max 75% accuracy! help!Coursera ML - Does the choice of optimization algorithm affect the accuracy of multiclass logistic regression?I got 100% accuracy on my test set,is there something wrong?Validation accuracy is always close to training accuracyHow to get probability of classification










1












$begingroup$


I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.










share|improve this question









$endgroup$
















    1












    $begingroup$


    I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.










    share|improve this question









    $endgroup$














      1












      1








      1





      $begingroup$


      I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.










      share|improve this question









      $endgroup$




      I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.







      accuracy semi-supervised-learning






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Mar 23 at 13:10









      SMI9SMI9

      63




      63




















          1 Answer
          1






          active

          oldest

          votes


















          0












          $begingroup$

          Well, that is a bit of a turn down but: your model has limitations.



          If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.



          What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.



          Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)



          You might try helping you decision tree by providing a few normalizations for data and feature engineering






          share|improve this answer








          New contributor




          Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
          Check out our Code of Conduct.






          $endgroup$













            Your Answer





            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "557"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47845%2faccuracy-after-selftraining-didnt-change%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            0












            $begingroup$

            Well, that is a bit of a turn down but: your model has limitations.



            If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.



            What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.



            Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)



            You might try helping you decision tree by providing a few normalizations for data and feature engineering






            share|improve this answer








            New contributor




            Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.






            $endgroup$

















              0












              $begingroup$

              Well, that is a bit of a turn down but: your model has limitations.



              If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.



              What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.



              Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)



              You might try helping you decision tree by providing a few normalizations for data and feature engineering






              share|improve this answer








              New contributor




              Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
              Check out our Code of Conduct.






              $endgroup$















                0












                0








                0





                $begingroup$

                Well, that is a bit of a turn down but: your model has limitations.



                If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.



                What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.



                Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)



                You might try helping you decision tree by providing a few normalizations for data and feature engineering






                share|improve this answer








                New contributor




                Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.






                $endgroup$



                Well, that is a bit of a turn down but: your model has limitations.



                If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.



                What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.



                Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)



                You might try helping you decision tree by providing a few normalizations for data and feature engineering







                share|improve this answer








                New contributor




                Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.









                share|improve this answer



                share|improve this answer






                New contributor




                Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.









                answered Mar 24 at 4:17









                Pedro Henrique MonfortePedro Henrique Monforte

                885




                885




                New contributor




                Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.





                New contributor





                Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.






                Pedro Henrique Monforte is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.



























                    draft saved

                    draft discarded
















































                    Thanks for contributing an answer to Data Science Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47845%2faccuracy-after-selftraining-didnt-change%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

                    Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

                    Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High