Accuracy after selftraining didn't change The Next CEO of Stack Overflow2019 Community Moderator ElectionHow to increase accuracy of classifiers?Improving Naive Bayes accuracy for text classificationWhat is a reasonable way to compare the improvement in accuracy?Accuracy value constant even after different runsIs this a good classified model based confusion matrix and classification report?Max 75% accuracy! help!Coursera ML - Does the choice of optimization algorithm affect the accuracy of multiclass logistic regression?I got 100% accuracy on my test set,is there something wrong?Validation accuracy is always close to training accuracyHow to get probability of classification
Multi tool use
A Man With a Stainless Steel Endoskeleton (like The Terminator) Fighting Cloaked Aliens Only He Can See
Plot of histogram similar to output from @risk
Which one is the true statement?
Is wanting to ask what to write an indication that you need to change your story?
How did people program for Consoles with multiple CPUs?
Should I tutor a student who I know has cheated on their homework?
Does soap repel water?
I believe this to be a fraud - hired, then asked to cash check and send cash as Bitcoin
Decomposition of product of two Plucker coordinates
Is there a way to save my career from absolute disaster?
Newlines in BSD sed vs gsed
unclear about Dynamic Binding
Is there a difference between "Fahrstuhl" and "Aufzug"
Is it possible to replace duplicates of a character with one character using tr
Measuring resistivity of dielectric liquid
Why is my new battery behaving weirdly?
What benefits would be gained by using human laborers instead of drones in deep sea mining?
How to get from Geneva Airport to Metabief?
Running a General Election and the European Elections together
Does falling count as part of my movement?
Is this "being" usage is essential?
Can we say or write : "No, it'sn't"?
Why doesn't UK go for the same deal Japan has with EU to resolve Brexit?
Is there a way to bypass a component in series in a circuit if that component fails?
Accuracy after selftraining didn't change
The Next CEO of Stack Overflow2019 Community Moderator ElectionHow to increase accuracy of classifiers?Improving Naive Bayes accuracy for text classificationWhat is a reasonable way to compare the improvement in accuracy?Accuracy value constant even after different runsIs this a good classified model based confusion matrix and classification report?Max 75% accuracy! help!Coursera ML - Does the choice of optimization algorithm affect the accuracy of multiclass logistic regression?I got 100% accuracy on my test set,is there something wrong?Validation accuracy is always close to training accuracyHow to get probability of classification
$begingroup$
I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.
accuracy semi-supervised-learning
$endgroup$
add a comment |
$begingroup$
I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.
accuracy semi-supervised-learning
$endgroup$
add a comment |
$begingroup$
I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.
accuracy semi-supervised-learning
$endgroup$
I used Decisiton Tree Classifier which I trained with 50 000 samples. I have also set with unlabeled samples, so I decided to use self training algorithm. Unlabeled set has 10 000 samples. I would like to ask if it is normal, that after retrainig model with these 10 000 unlabeled samples, accuracy didn't chaned as well as confusion matrix has same values? I expected some changes (better or worse prediction). Thank you in advance.
accuracy semi-supervised-learning
accuracy semi-supervised-learning
asked Mar 23 at 13:10
SMI9SMI9
63
63
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
Well, that is a bit of a turn down but: your model has limitations.
If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.
What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.
Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)
You might try helping you decision tree by providing a few normalizations for data and feature engineering
New contributor
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47845%2faccuracy-after-selftraining-didnt-change%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Well, that is a bit of a turn down but: your model has limitations.
If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.
What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.
Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)
You might try helping you decision tree by providing a few normalizations for data and feature engineering
New contributor
$endgroup$
add a comment |
$begingroup$
Well, that is a bit of a turn down but: your model has limitations.
If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.
What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.
Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)
You might try helping you decision tree by providing a few normalizations for data and feature engineering
New contributor
$endgroup$
add a comment |
$begingroup$
Well, that is a bit of a turn down but: your model has limitations.
If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.
What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.
Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)
You might try helping you decision tree by providing a few normalizations for data and feature engineering
New contributor
$endgroup$
Well, that is a bit of a turn down but: your model has limitations.
If the 50.000 data forms a complete set for your problem that means that more data won't be needed or helpful.
What do I mean by complete set is: there are enough samples to form a full rank correlation matrix in your feature space. So from your samples you can get a set that can generate all other samples in your feature space by linear combination.
Also, while your data might represent everything a decision three needs to know for classificating your data in the generated feature space, there may be other feature spaces that benefit from the extra data (such as deeper trees or other models)
You might try helping you decision tree by providing a few normalizations for data and feature engineering
New contributor
New contributor
answered Mar 24 at 4:17
Pedro Henrique MonfortePedro Henrique Monforte
885
885
New contributor
New contributor
add a comment |
add a comment |
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47845%2faccuracy-after-selftraining-didnt-change%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
HKpNlUXo2UMXacd57n u4,W,D2J6eJl6W36A dtC b,vzCNwNP5OeylJ1lbykrgKg,a,UqBiIcbB4 AohYNi