Incrementally Train BERT with minimum QnA records - to get improved resultsMachine Learning models in production environmentClarification wanted for make_step function of Google's deep dream scriptBest approach for image recognition/classification with few training dataMLPClassifier threshold factor to eliminate test samples that are not in match with train dataTools and Techniques for Analyzing German Automotive Discussion Forum PostsHow to train and validate a model continously which affects its own future data?Training dataset decreasing in quality (Google data science blog)Understanding why my binary classification is approaching 50% accuracy using TensorFlow and KerasHyper-parameter tuning when you don't have an access to the test dataIssues with training SSD on own dataset

Can a medieval gyroplane be built?

Is it true that good novels will automatically sell themselves on Amazon (and so on) and there is no need for one to waste time promoting?

Unfrosted light bulb

HP P840 HDD RAID 5 many strange drive failures

Suggestions on how to spend Shaabath (constructively) alone

Can you move over difficult terrain with only 5 feet of movement?

Am I eligible for the Eurail Youth pass? I am 27.5 years old

Would it be believable to defy demographics in a story?

Light propagating through a sound wave

Help prove this basic trig identity please!

In Aliens, how many people were on LV-426 before the Marines arrived​?

What favor did Moody owe Dumbledore?

World War I as a war of liberals against authoritarians?

A Ri-diddley-iley Riddle

Is honey really a supersaturated solution? Does heating to un-crystalize redissolve it or melt it?

How to define limit operations in general topological spaces? Are nets able to do this?

PTIJ: Do Irish Jews have "the luck of the Irish"?

Does the attack bonus from a Masterwork weapon stack with the attack bonus from Masterwork ammunition?

How to terminate ping <dest> &

What is the term when voters “dishonestly” choose something that they do not want to choose?

Why are there no stars visible in cislunar space?

Recruiter wants very extensive technical details about all of my previous work

What is the significance behind "40 days" that often appears in the Bible?

What does "^L" mean in C?



Incrementally Train BERT with minimum QnA records - to get improved results


Machine Learning models in production environmentClarification wanted for make_step function of Google's deep dream scriptBest approach for image recognition/classification with few training dataMLPClassifier threshold factor to eliminate test samples that are not in match with train dataTools and Techniques for Analyzing German Automotive Discussion Forum PostsHow to train and validate a model continously which affects its own future data?Training dataset decreasing in quality (Google data science blog)Understanding why my binary classification is approaching 50% accuracy using TensorFlow and KerasHyper-parameter tuning when you don't have an access to the test dataIssues with training SSD on own dataset













0












$begingroup$


We are using Google BERT for Question and Answering. We have fine tuned BERT with SQUAD QnA release train data set (https://github.com/google-research/bert , https://rajpurkar.github.io/SQuAD-explorer/dataset/train-v1.1.json)



It generated new checkpoints and BERT is giving good answers for most of questions we asked on our text documents. However, there are some questions which it is answering wrong, so we are trying to further fine tune with our Question and known answer on our text document. We further trained based on last generated checkpoint and got new checkpoint.



With new checkpoint when we are asking the same question, the answer did not got corrected! Previously BERT was giving wrong answer with 99% confidence and now also giving same wrong answer with 95% confidence.



Can someone suggest, if they have same or similar experience, and suggest please.

Following are questions in BERT github Issues, and are unanswered for quite some time:



  • BERT accuracy reduced after providing custom training..The answer is
    also not correct : https://github.com/google-research/bert/issues/492

  • Unable to incrementally train BERT with custom training: https://github.com/google-research/bert/issues/482

  • Little training has no impact: https://github.com/google-research/bert/issues/481

  • Custom Domain Training: https://github.com/google-research/bert/issues/498









share|improve this question











$endgroup$
















    0












    $begingroup$


    We are using Google BERT for Question and Answering. We have fine tuned BERT with SQUAD QnA release train data set (https://github.com/google-research/bert , https://rajpurkar.github.io/SQuAD-explorer/dataset/train-v1.1.json)



    It generated new checkpoints and BERT is giving good answers for most of questions we asked on our text documents. However, there are some questions which it is answering wrong, so we are trying to further fine tune with our Question and known answer on our text document. We further trained based on last generated checkpoint and got new checkpoint.



    With new checkpoint when we are asking the same question, the answer did not got corrected! Previously BERT was giving wrong answer with 99% confidence and now also giving same wrong answer with 95% confidence.



    Can someone suggest, if they have same or similar experience, and suggest please.

    Following are questions in BERT github Issues, and are unanswered for quite some time:



    • BERT accuracy reduced after providing custom training..The answer is
      also not correct : https://github.com/google-research/bert/issues/492

    • Unable to incrementally train BERT with custom training: https://github.com/google-research/bert/issues/482

    • Little training has no impact: https://github.com/google-research/bert/issues/481

    • Custom Domain Training: https://github.com/google-research/bert/issues/498









    share|improve this question











    $endgroup$














      0












      0








      0





      $begingroup$


      We are using Google BERT for Question and Answering. We have fine tuned BERT with SQUAD QnA release train data set (https://github.com/google-research/bert , https://rajpurkar.github.io/SQuAD-explorer/dataset/train-v1.1.json)



      It generated new checkpoints and BERT is giving good answers for most of questions we asked on our text documents. However, there are some questions which it is answering wrong, so we are trying to further fine tune with our Question and known answer on our text document. We further trained based on last generated checkpoint and got new checkpoint.



      With new checkpoint when we are asking the same question, the answer did not got corrected! Previously BERT was giving wrong answer with 99% confidence and now also giving same wrong answer with 95% confidence.



      Can someone suggest, if they have same or similar experience, and suggest please.

      Following are questions in BERT github Issues, and are unanswered for quite some time:



      • BERT accuracy reduced after providing custom training..The answer is
        also not correct : https://github.com/google-research/bert/issues/492

      • Unable to incrementally train BERT with custom training: https://github.com/google-research/bert/issues/482

      • Little training has no impact: https://github.com/google-research/bert/issues/481

      • Custom Domain Training: https://github.com/google-research/bert/issues/498









      share|improve this question











      $endgroup$




      We are using Google BERT for Question and Answering. We have fine tuned BERT with SQUAD QnA release train data set (https://github.com/google-research/bert , https://rajpurkar.github.io/SQuAD-explorer/dataset/train-v1.1.json)



      It generated new checkpoints and BERT is giving good answers for most of questions we asked on our text documents. However, there are some questions which it is answering wrong, so we are trying to further fine tune with our Question and known answer on our text document. We further trained based on last generated checkpoint and got new checkpoint.



      With new checkpoint when we are asking the same question, the answer did not got corrected! Previously BERT was giving wrong answer with 99% confidence and now also giving same wrong answer with 95% confidence.



      Can someone suggest, if they have same or similar experience, and suggest please.

      Following are questions in BERT github Issues, and are unanswered for quite some time:



      • BERT accuracy reduced after providing custom training..The answer is
        also not correct : https://github.com/google-research/bert/issues/492

      • Unable to incrementally train BERT with custom training: https://github.com/google-research/bert/issues/482

      • Little training has no impact: https://github.com/google-research/bert/issues/481

      • Custom Domain Training: https://github.com/google-research/bert/issues/498






      machine-learning training transformer bert






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited yesterday







      user10557045

















      asked yesterday









      user10557045user10557045

      165




      165




















          0






          active

          oldest

          votes











          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47406%2fincrementally-train-bert-with-minimum-qna-records-to-get-improved-results%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47406%2fincrementally-train-bert-with-minimum-qna-records-to-get-improved-results%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

          Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

          Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High