intercept correction in deep learning The 2019 Stack Overflow Developer Survey Results Are In Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 11:00UTC (8:00pm US/Eastern) 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsWhy real-world output of my classifier has similar label ratio to training data?Deep learning basicsPolicy network AlphaGo and transferring to other domainsMachine Learning vs Deep LearningBinary Classifier making only one predictionCreating an easy but not trivial datasetSeveral fundamental questions about CNNHow to add non-image features along side images as the input of CNNsRunning two stage classification to predict relatively rare event?Poor performance of SVM after training for rare eventsconvLSTM : how to structure input data

Can each chord in a progression create its own key?

How to politely respond to generic emails requesting a PhD/job in my lab? Without wasting too much time

What information about me do stores get via my credit card?

Word for: a synonym with a positive connotation?

Using `min_active_rowversion` for global temporary tables

Nested ellipses in tikzpicture: Chomsky hierarchy

Word to describe a time interval

Why can't devices on different VLANs, but on the same subnet, communicate?

Single author papers against my advisor's will?

Windows 10: How to Lock (not sleep) laptop on lid close?

Am I ethically obligated to go into work on an off day if the reason is sudden?

How to read αἱμύλιος or when to aspirate

Categorical vs continuous feature selection/engineering

Button changing its text & action. Good or terrible?

How to substitute curly brackets with round brackets in a grid of list

Didn't get enough time to take a Coding Test - what to do now?

What can I do if neighbor is blocking my solar panels intentionally?

What happens to a Warlock's expended Spell Slots when they gain a Level?

How to handle characters who are more educated than the author?

Is every episode of "Where are my Pants?" identical?

Is an up-to-date browser secure on an out-of-date OS?

Why doesn't shell automatically fix "useless use of cat"?

Are spiders unable to hurt humans, especially very small spiders?

Pretty sure I'm over complicating my loops but unsure how to simplify



intercept correction in deep learning



The 2019 Stack Overflow Developer Survey Results Are In
Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 11:00UTC (8:00pm US/Eastern)
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsWhy real-world output of my classifier has similar label ratio to training data?Deep learning basicsPolicy network AlphaGo and transferring to other domainsMachine Learning vs Deep LearningBinary Classifier making only one predictionCreating an easy but not trivial datasetSeveral fundamental questions about CNNHow to add non-image features along side images as the input of CNNsRunning two stage classification to predict relatively rare event?Poor performance of SVM after training for rare eventsconvLSTM : how to structure input data










2












$begingroup$


Say I have an imbalanced data set, and I decided to over/undersample it during model training. I would then like to predict on new records but using the original, true imbalance in the classes as an apriori for the model.



i.e my classes are distributed in a ratio of 1:100, I undersample to 1:2, but would like the model to know that class A is very rare so be careful in predicting it.



For logistic regression I am familiar with a method of intercept correction, detailed here:



My question is: what if I use deep leaning instead of logistic regression?
My reasoning is that since the last layer of the NN is basically the same a logistic regression, I can use the same method for the intercept of this layer.
Can you think of any objections? any reason this doesn't make sense? If so, how would you go about correcting the model?



Also, is there a generalized method for intercept correction for a case when I have more than 2 classes?










share|improve this question











$endgroup$











  • $begingroup$
    Welcome to this site! I agree with you, since the proposed correction does not make any assumption about the feature space $boldsymbolx$, therefore, all layers from input $boldsymbolx$ to the last layer, act as a transformation that produces a new feature space $boldsymbolx'$, and the rest is the same.
    $endgroup$
    – Esmailian
    Apr 1 at 17:21
















2












$begingroup$


Say I have an imbalanced data set, and I decided to over/undersample it during model training. I would then like to predict on new records but using the original, true imbalance in the classes as an apriori for the model.



i.e my classes are distributed in a ratio of 1:100, I undersample to 1:2, but would like the model to know that class A is very rare so be careful in predicting it.



For logistic regression I am familiar with a method of intercept correction, detailed here:



My question is: what if I use deep leaning instead of logistic regression?
My reasoning is that since the last layer of the NN is basically the same a logistic regression, I can use the same method for the intercept of this layer.
Can you think of any objections? any reason this doesn't make sense? If so, how would you go about correcting the model?



Also, is there a generalized method for intercept correction for a case when I have more than 2 classes?










share|improve this question











$endgroup$











  • $begingroup$
    Welcome to this site! I agree with you, since the proposed correction does not make any assumption about the feature space $boldsymbolx$, therefore, all layers from input $boldsymbolx$ to the last layer, act as a transformation that produces a new feature space $boldsymbolx'$, and the rest is the same.
    $endgroup$
    – Esmailian
    Apr 1 at 17:21














2












2








2


1



$begingroup$


Say I have an imbalanced data set, and I decided to over/undersample it during model training. I would then like to predict on new records but using the original, true imbalance in the classes as an apriori for the model.



i.e my classes are distributed in a ratio of 1:100, I undersample to 1:2, but would like the model to know that class A is very rare so be careful in predicting it.



For logistic regression I am familiar with a method of intercept correction, detailed here:



My question is: what if I use deep leaning instead of logistic regression?
My reasoning is that since the last layer of the NN is basically the same a logistic regression, I can use the same method for the intercept of this layer.
Can you think of any objections? any reason this doesn't make sense? If so, how would you go about correcting the model?



Also, is there a generalized method for intercept correction for a case when I have more than 2 classes?










share|improve this question











$endgroup$




Say I have an imbalanced data set, and I decided to over/undersample it during model training. I would then like to predict on new records but using the original, true imbalance in the classes as an apriori for the model.



i.e my classes are distributed in a ratio of 1:100, I undersample to 1:2, but would like the model to know that class A is very rare so be careful in predicting it.



For logistic regression I am familiar with a method of intercept correction, detailed here:



My question is: what if I use deep leaning instead of logistic regression?
My reasoning is that since the last layer of the NN is basically the same a logistic regression, I can use the same method for the intercept of this layer.
Can you think of any objections? any reason this doesn't make sense? If so, how would you go about correcting the model?



Also, is there a generalized method for intercept correction for a case when I have more than 2 classes?







deep-learning class-imbalance






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Apr 1 at 19:12









Ethan

701625




701625










asked Apr 1 at 17:08









Oren MatarOren Matar

111




111











  • $begingroup$
    Welcome to this site! I agree with you, since the proposed correction does not make any assumption about the feature space $boldsymbolx$, therefore, all layers from input $boldsymbolx$ to the last layer, act as a transformation that produces a new feature space $boldsymbolx'$, and the rest is the same.
    $endgroup$
    – Esmailian
    Apr 1 at 17:21

















  • $begingroup$
    Welcome to this site! I agree with you, since the proposed correction does not make any assumption about the feature space $boldsymbolx$, therefore, all layers from input $boldsymbolx$ to the last layer, act as a transformation that produces a new feature space $boldsymbolx'$, and the rest is the same.
    $endgroup$
    – Esmailian
    Apr 1 at 17:21
















$begingroup$
Welcome to this site! I agree with you, since the proposed correction does not make any assumption about the feature space $boldsymbolx$, therefore, all layers from input $boldsymbolx$ to the last layer, act as a transformation that produces a new feature space $boldsymbolx'$, and the rest is the same.
$endgroup$
– Esmailian
Apr 1 at 17:21





$begingroup$
Welcome to this site! I agree with you, since the proposed correction does not make any assumption about the feature space $boldsymbolx$, therefore, all layers from input $boldsymbolx$ to the last layer, act as a transformation that produces a new feature space $boldsymbolx'$, and the rest is the same.
$endgroup$
– Esmailian
Apr 1 at 17:21











0






active

oldest

votes












Your Answer








StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48367%2fintercept-correction-in-deep-learning%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48367%2fintercept-correction-in-deep-learning%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Marja Vauras Lähteet | Aiheesta muualla | NavigointivalikkoMarja Vauras Turun yliopiston tutkimusportaalissaInfobox OKSuomalaisen Tiedeakatemian varsinaiset jäsenetKasvatustieteiden tiedekunnan dekaanit ja muu johtoMarja VaurasKoulutusvienti on kestävyys- ja ketteryyslaji (2.5.2017)laajentamallaWorldCat Identities0000 0001 0855 9405n86069603utb201588738523620927

Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

Is this part of the description of the Archfey warlock's Misty Escape feature redundant?When is entropic ward considered “used”?How does the reaction timing work for Wrath of the Storm? Can it potentially prevent the damage from the triggering attack?Does the Dark Arts Archlich warlock patrons's Arcane Invisibility activate every time you cast a level 1+ spell?When attacking while invisible, when exactly does invisibility break?Can I cast Hellish Rebuke on my turn?Do I have to “pre-cast” a reaction spell in order for it to be triggered?What happens if a Player Misty Escapes into an Invisible CreatureCan a reaction interrupt multiattack?Does the Fiend-patron warlock's Hurl Through Hell feature dispel effects that require the target to be on the same plane as the caster?What are you allowed to do while using the Warlock's Eldritch Master feature?