Choosing a suitable learning rate based on validation or testing accuracy? The 2019 Stack Overflow Developer Survey Results Are In Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsChoosing a learning rateDebugging Neural Network for (Natural Language) TaggingConvolutional neural network overfitting. Dropout not helpingReporting test result for cross-validation with Neural NetworkInterpreting confusion matrix and validation results in convolutional networksBreaking through an accuracy brickwall with my LSTMHow to select the learned model using $k$-fold cross validation?Seemingly good results with training a CNN but bad when testingConstant validation loss & accuracy, training accuracy fluctuatesImprove model performance on unseen data

system call string length limit

Do warforged have souls?

Relations between two reciprocal partial derivatives?

What aspect of planet Earth must be changed to prevent the industrial revolution?

Would it be possible to rearrange a dragon's flight muscle to somewhat circumvent the square-cube law?

Grover's algorithm - DES circuit as oracle?

Keeping a retro style to sci-fi spaceships?

Derivation tree not rendering

He got a vote 80% that of Emmanuel Macron’s

Install many applications using one command

How to pronounce 1ターン?

How does this infinite series simplify to an integral?

Can the DM override racial traits?

The variadic template constructor of my class cannot modify my class members, why is that so?

Was credit for the black hole image misattributed?

Windows 10: How to Lock (not sleep) laptop on lid close?

Why is superheterodyning better than direct conversion?

Is it ok to offer lower paid work as a trial period before negotiating for a full-time job?

Did the new image of black hole confirm the general theory of relativity?

Can the prologue be the backstory of your main character?

How should I replace vector<uint8_t>::const_iterator in an API?

Are spiders unable to hurt humans, especially very small spiders?

Take groceries in checked luggage

Hopping to infinity along a string of digits



Choosing a suitable learning rate based on validation or testing accuracy?



The 2019 Stack Overflow Developer Survey Results Are In
Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsChoosing a learning rateDebugging Neural Network for (Natural Language) TaggingConvolutional neural network overfitting. Dropout not helpingReporting test result for cross-validation with Neural NetworkInterpreting confusion matrix and validation results in convolutional networksBreaking through an accuracy brickwall with my LSTMHow to select the learned model using $k$-fold cross validation?Seemingly good results with training a CNN but bad when testingConstant validation loss & accuracy, training accuracy fluctuatesImprove model performance on unseen data










1












$begingroup$


I have simulated a neural network with different learning rate, ranging from 0.00001 to 0.1, and recording each test and validation accuracy. The result i obtained is as below. There is 50 epoch for each learning rate, and i note down the validation accuracy at the last epoch, while the training accuracy is computed throughout the process.



Learning rate: 0.00001



Testing accuracy: 0.5850



Validation accuracy at final epoch: 0.5950




Learning rate: 0.0001



Testing accuracy:0.6550



Validation accuracy at final epoch: 0.6400




Learning rate: 0.001



Testing accuracy: 0.6350



Validation accuracy at final epoch: 0.6900




Learning rate: 0.01



Testing accuracy: 0.6650



Validation accuracy at final epoch: 0.6700




Learning rate: 0.1



Testing accuracy: 0.2500



Validation accuracy at final epoch: 0.2100



How does testing and validation accuracy influence which learning rate is better? Would a higher validation accuracy determine the most suitable learning rate for the model?



Hence, is it correct that 0.001 is the most suitable learning parameter since it has the highest validation accuracy at the last epoch?










share|improve this question











$endgroup$
















    1












    $begingroup$


    I have simulated a neural network with different learning rate, ranging from 0.00001 to 0.1, and recording each test and validation accuracy. The result i obtained is as below. There is 50 epoch for each learning rate, and i note down the validation accuracy at the last epoch, while the training accuracy is computed throughout the process.



    Learning rate: 0.00001



    Testing accuracy: 0.5850



    Validation accuracy at final epoch: 0.5950




    Learning rate: 0.0001



    Testing accuracy:0.6550



    Validation accuracy at final epoch: 0.6400




    Learning rate: 0.001



    Testing accuracy: 0.6350



    Validation accuracy at final epoch: 0.6900




    Learning rate: 0.01



    Testing accuracy: 0.6650



    Validation accuracy at final epoch: 0.6700




    Learning rate: 0.1



    Testing accuracy: 0.2500



    Validation accuracy at final epoch: 0.2100



    How does testing and validation accuracy influence which learning rate is better? Would a higher validation accuracy determine the most suitable learning rate for the model?



    Hence, is it correct that 0.001 is the most suitable learning parameter since it has the highest validation accuracy at the last epoch?










    share|improve this question











    $endgroup$














      1












      1








      1





      $begingroup$


      I have simulated a neural network with different learning rate, ranging from 0.00001 to 0.1, and recording each test and validation accuracy. The result i obtained is as below. There is 50 epoch for each learning rate, and i note down the validation accuracy at the last epoch, while the training accuracy is computed throughout the process.



      Learning rate: 0.00001



      Testing accuracy: 0.5850



      Validation accuracy at final epoch: 0.5950




      Learning rate: 0.0001



      Testing accuracy:0.6550



      Validation accuracy at final epoch: 0.6400




      Learning rate: 0.001



      Testing accuracy: 0.6350



      Validation accuracy at final epoch: 0.6900




      Learning rate: 0.01



      Testing accuracy: 0.6650



      Validation accuracy at final epoch: 0.6700




      Learning rate: 0.1



      Testing accuracy: 0.2500



      Validation accuracy at final epoch: 0.2100



      How does testing and validation accuracy influence which learning rate is better? Would a higher validation accuracy determine the most suitable learning rate for the model?



      Hence, is it correct that 0.001 is the most suitable learning parameter since it has the highest validation accuracy at the last epoch?










      share|improve this question











      $endgroup$




      I have simulated a neural network with different learning rate, ranging from 0.00001 to 0.1, and recording each test and validation accuracy. The result i obtained is as below. There is 50 epoch for each learning rate, and i note down the validation accuracy at the last epoch, while the training accuracy is computed throughout the process.



      Learning rate: 0.00001



      Testing accuracy: 0.5850



      Validation accuracy at final epoch: 0.5950




      Learning rate: 0.0001



      Testing accuracy:0.6550



      Validation accuracy at final epoch: 0.6400




      Learning rate: 0.001



      Testing accuracy: 0.6350



      Validation accuracy at final epoch: 0.6900




      Learning rate: 0.01



      Testing accuracy: 0.6650



      Validation accuracy at final epoch: 0.6700




      Learning rate: 0.1



      Testing accuracy: 0.2500



      Validation accuracy at final epoch: 0.2100



      How does testing and validation accuracy influence which learning rate is better? Would a higher validation accuracy determine the most suitable learning rate for the model?



      Hence, is it correct that 0.001 is the most suitable learning parameter since it has the highest validation accuracy at the last epoch?







      machine-learning neural-network deep-learning






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Mar 31 at 14:53







      Maxxx

















      asked Mar 31 at 14:30









      MaxxxMaxxx

      1273




      1273




















          1 Answer
          1






          active

          oldest

          votes


















          1












          $begingroup$

          You cannot select a parameter based on test accuracy, because the moment you do that, it becomes a validation accuracy as it has affected the final model. Therefore, you are always choosing based on validation accuracy.



          As a result, the best result comes from learning rate 0.001, with the highest validation accuracy 0.6900. We have ignored Testing accuracy. If we select based on Testing accuracy, it becomes a validation accuracy.



          Generally, a learning rate that is a looser at epoch 50, might be a winner at epoch 200. In other words, a slower convergence may lead to a higher accuracy. Therefore, this issue is worth considering too.






          share|improve this answer











          $endgroup$













            Your Answer








            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "557"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48295%2fchoosing-a-suitable-learning-rate-based-on-validation-or-testing-accuracy%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            1












            $begingroup$

            You cannot select a parameter based on test accuracy, because the moment you do that, it becomes a validation accuracy as it has affected the final model. Therefore, you are always choosing based on validation accuracy.



            As a result, the best result comes from learning rate 0.001, with the highest validation accuracy 0.6900. We have ignored Testing accuracy. If we select based on Testing accuracy, it becomes a validation accuracy.



            Generally, a learning rate that is a looser at epoch 50, might be a winner at epoch 200. In other words, a slower convergence may lead to a higher accuracy. Therefore, this issue is worth considering too.






            share|improve this answer











            $endgroup$

















              1












              $begingroup$

              You cannot select a parameter based on test accuracy, because the moment you do that, it becomes a validation accuracy as it has affected the final model. Therefore, you are always choosing based on validation accuracy.



              As a result, the best result comes from learning rate 0.001, with the highest validation accuracy 0.6900. We have ignored Testing accuracy. If we select based on Testing accuracy, it becomes a validation accuracy.



              Generally, a learning rate that is a looser at epoch 50, might be a winner at epoch 200. In other words, a slower convergence may lead to a higher accuracy. Therefore, this issue is worth considering too.






              share|improve this answer











              $endgroup$















                1












                1








                1





                $begingroup$

                You cannot select a parameter based on test accuracy, because the moment you do that, it becomes a validation accuracy as it has affected the final model. Therefore, you are always choosing based on validation accuracy.



                As a result, the best result comes from learning rate 0.001, with the highest validation accuracy 0.6900. We have ignored Testing accuracy. If we select based on Testing accuracy, it becomes a validation accuracy.



                Generally, a learning rate that is a looser at epoch 50, might be a winner at epoch 200. In other words, a slower convergence may lead to a higher accuracy. Therefore, this issue is worth considering too.






                share|improve this answer











                $endgroup$



                You cannot select a parameter based on test accuracy, because the moment you do that, it becomes a validation accuracy as it has affected the final model. Therefore, you are always choosing based on validation accuracy.



                As a result, the best result comes from learning rate 0.001, with the highest validation accuracy 0.6900. We have ignored Testing accuracy. If we select based on Testing accuracy, it becomes a validation accuracy.



                Generally, a learning rate that is a looser at epoch 50, might be a winner at epoch 200. In other words, a slower convergence may lead to a higher accuracy. Therefore, this issue is worth considering too.







                share|improve this answer














                share|improve this answer



                share|improve this answer








                edited Mar 31 at 18:26

























                answered Mar 31 at 18:20









                EsmailianEsmailian

                3,191320




                3,191320



























                    draft saved

                    draft discarded
















































                    Thanks for contributing an answer to Data Science Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48295%2fchoosing-a-suitable-learning-rate-based-on-validation-or-testing-accuracy%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Marja Vauras Lähteet | Aiheesta muualla | NavigointivalikkoMarja Vauras Turun yliopiston tutkimusportaalissaInfobox OKSuomalaisen Tiedeakatemian varsinaiset jäsenetKasvatustieteiden tiedekunnan dekaanit ja muu johtoMarja VaurasKoulutusvienti on kestävyys- ja ketteryyslaji (2.5.2017)laajentamallaWorldCat Identities0000 0001 0855 9405n86069603utb201588738523620927

                    Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

                    Is this part of the description of the Archfey warlock's Misty Escape feature redundant?When is entropic ward considered “used”?How does the reaction timing work for Wrath of the Storm? Can it potentially prevent the damage from the triggering attack?Does the Dark Arts Archlich warlock patrons's Arcane Invisibility activate every time you cast a level 1+ spell?When attacking while invisible, when exactly does invisibility break?Can I cast Hellish Rebuke on my turn?Do I have to “pre-cast” a reaction spell in order for it to be triggered?What happens if a Player Misty Escapes into an Invisible CreatureCan a reaction interrupt multiattack?Does the Fiend-patron warlock's Hurl Through Hell feature dispel effects that require the target to be on the same plane as the caster?What are you allowed to do while using the Warlock's Eldritch Master feature?