My question is about dependency between hidden states for Back Propagation Through Time in RNN2019 Community Moderator ElectionBack-propagation through max pooling layersBasic backpropagation questionHow/What to initialize the hidden states in RNN sequence-to-sequence models?A good reference for the back propagation algorithm?Back Propagation in time for tf.nn.dynamic_rnn for sequential input (from batch)Alternative method for RNN backpropagation through timeTypes of Recurrent Neural NetworksLoss function for an RNN used for binary classificationQuestions about Neural Network training (back propagation) in the book PRML (Pattern Recognition and Machine Learning)

Is a tag line useful on a cover?

tikz: show 0 at the axis origin

I’m planning on buying a laser printer but concerned about the life cycle of toner in the machine

Finding angle with pure Geometry.

Dragon forelimb placement

Modeling an IP Address

Why not use SQL instead of GraphQL?

How do we improve the relationship with a client software team that performs poorly and is becoming less collaborative?

Theorems that impeded progress

Why do I get two different answers for this counting problem?

What does it mean to describe someone as a butt steak?

Can an x86 CPU running in real mode be considered to be basically an 8086 CPU?

Is it important to consider tone, melody, and musical form while writing a song?

Is a conference paper whose proceedings will be published in IEEE Xplore counted as a publication?

Watching something be written to a file live with tail

What are these boxed doors outside store fronts in New York?

Fencing style for blades that can attack from a distance

LaTeX closing $ signs makes cursor jump

How to find program name(s) of an installed package?

String Manipulation Interpreter

What do the dots in this tr command do: tr .............A-Z A-ZA-Z <<< "JVPQBOV" (with 13 dots)

How can I prevent hyper evolved versions of regular creatures from wiping out their cousins?

Is it possible to do 50 km distance without any previous training?

Arthur Somervell: 1000 Exercises - Meaning of this notation



My question is about dependency between hidden states for Back Propagation Through Time in RNN



2019 Community Moderator ElectionBack-propagation through max pooling layersBasic backpropagation questionHow/What to initialize the hidden states in RNN sequence-to-sequence models?A good reference for the back propagation algorithm?Back Propagation in time for tf.nn.dynamic_rnn for sequential input (from batch)Alternative method for RNN backpropagation through timeTypes of Recurrent Neural NetworksLoss function for an RNN used for binary classificationQuestions about Neural Network training (back propagation) in the book PRML (Pattern Recognition and Machine Learning)










0












$begingroup$


In one video lecture, professor Ali Ghodsi of University of Waterloo says that the first node of S(t)(hidden state of RNN at time t) has an effect only on the first node of S(t+1)(hidden state of RNN at time t+1), the second node of S(t) has an effect only on the second node of S(t+1), and so on. I do not see how that is correct cause the weight matrix W connecting the state at one time step to the state at the next time step is (p,p) dimensional which means each node of S(t) is connected to every node of S(t+1). What I have come to understand from various blogs on the internet about RNN is that nodes in the hidden state of some time step are connected to every node in the hidden state corresponding to the next time step which is contradictory to what professor Ghodsi is stating in the lecture.



What I want to clarify is whether the hidden nodes in RNN are usually connected in the same way as fully connected networks throughout the temporal dimension, or are they connected elementwise as the professor says.



I am providing a link to the video https://www.youtube.com/watch?v=AvyhbrQptHk&t=1704s where the claim is made a 43:02.



I had posted this question on StackOverflow, but that doesn't seem to be the correct platform for such topics.



I am new to this site. So, please be a little patient if you do not like the format of the question and let me know how I can improve the quality of my question.



Thanks in advance.










share|improve this question











$endgroup$
















    0












    $begingroup$


    In one video lecture, professor Ali Ghodsi of University of Waterloo says that the first node of S(t)(hidden state of RNN at time t) has an effect only on the first node of S(t+1)(hidden state of RNN at time t+1), the second node of S(t) has an effect only on the second node of S(t+1), and so on. I do not see how that is correct cause the weight matrix W connecting the state at one time step to the state at the next time step is (p,p) dimensional which means each node of S(t) is connected to every node of S(t+1). What I have come to understand from various blogs on the internet about RNN is that nodes in the hidden state of some time step are connected to every node in the hidden state corresponding to the next time step which is contradictory to what professor Ghodsi is stating in the lecture.



    What I want to clarify is whether the hidden nodes in RNN are usually connected in the same way as fully connected networks throughout the temporal dimension, or are they connected elementwise as the professor says.



    I am providing a link to the video https://www.youtube.com/watch?v=AvyhbrQptHk&t=1704s where the claim is made a 43:02.



    I had posted this question on StackOverflow, but that doesn't seem to be the correct platform for such topics.



    I am new to this site. So, please be a little patient if you do not like the format of the question and let me know how I can improve the quality of my question.



    Thanks in advance.










    share|improve this question











    $endgroup$














      0












      0








      0





      $begingroup$


      In one video lecture, professor Ali Ghodsi of University of Waterloo says that the first node of S(t)(hidden state of RNN at time t) has an effect only on the first node of S(t+1)(hidden state of RNN at time t+1), the second node of S(t) has an effect only on the second node of S(t+1), and so on. I do not see how that is correct cause the weight matrix W connecting the state at one time step to the state at the next time step is (p,p) dimensional which means each node of S(t) is connected to every node of S(t+1). What I have come to understand from various blogs on the internet about RNN is that nodes in the hidden state of some time step are connected to every node in the hidden state corresponding to the next time step which is contradictory to what professor Ghodsi is stating in the lecture.



      What I want to clarify is whether the hidden nodes in RNN are usually connected in the same way as fully connected networks throughout the temporal dimension, or are they connected elementwise as the professor says.



      I am providing a link to the video https://www.youtube.com/watch?v=AvyhbrQptHk&t=1704s where the claim is made a 43:02.



      I had posted this question on StackOverflow, but that doesn't seem to be the correct platform for such topics.



      I am new to this site. So, please be a little patient if you do not like the format of the question and let me know how I can improve the quality of my question.



      Thanks in advance.










      share|improve this question











      $endgroup$




      In one video lecture, professor Ali Ghodsi of University of Waterloo says that the first node of S(t)(hidden state of RNN at time t) has an effect only on the first node of S(t+1)(hidden state of RNN at time t+1), the second node of S(t) has an effect only on the second node of S(t+1), and so on. I do not see how that is correct cause the weight matrix W connecting the state at one time step to the state at the next time step is (p,p) dimensional which means each node of S(t) is connected to every node of S(t+1). What I have come to understand from various blogs on the internet about RNN is that nodes in the hidden state of some time step are connected to every node in the hidden state corresponding to the next time step which is contradictory to what professor Ghodsi is stating in the lecture.



      What I want to clarify is whether the hidden nodes in RNN are usually connected in the same way as fully connected networks throughout the temporal dimension, or are they connected elementwise as the professor says.



      I am providing a link to the video https://www.youtube.com/watch?v=AvyhbrQptHk&t=1704s where the claim is made a 43:02.



      I had posted this question on StackOverflow, but that doesn't seem to be the correct platform for such topics.



      I am new to this site. So, please be a little patient if you do not like the format of the question and let me know how I can improve the quality of my question.



      Thanks in advance.







      backpropagation recurrent-neural-net






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Mar 27 at 10:33







      Sajid Ahmed

















      asked Mar 27 at 10:09









      Sajid AhmedSajid Ahmed

      314




      314




















          0






          active

          oldest

          votes












          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48076%2fmy-question-is-about-dependency-between-hidden-states-for-back-propagation-throu%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48076%2fmy-question-is-about-dependency-between-hidden-states-for-back-propagation-throu%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Marja Vauras Lähteet | Aiheesta muualla | NavigointivalikkoMarja Vauras Turun yliopiston tutkimusportaalissaInfobox OKSuomalaisen Tiedeakatemian varsinaiset jäsenetKasvatustieteiden tiedekunnan dekaanit ja muu johtoMarja VaurasKoulutusvienti on kestävyys- ja ketteryyslaji (2.5.2017)laajentamallaWorldCat Identities0000 0001 0855 9405n86069603utb201588738523620927

          Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

          Is this part of the description of the Archfey warlock's Misty Escape feature redundant?When is entropic ward considered “used”?How does the reaction timing work for Wrath of the Storm? Can it potentially prevent the damage from the triggering attack?Does the Dark Arts Archlich warlock patrons's Arcane Invisibility activate every time you cast a level 1+ spell?When attacking while invisible, when exactly does invisibility break?Can I cast Hellish Rebuke on my turn?Do I have to “pre-cast” a reaction spell in order for it to be triggered?What happens if a Player Misty Escapes into an Invisible CreatureCan a reaction interrupt multiattack?Does the Fiend-patron warlock's Hurl Through Hell feature dispel effects that require the target to be on the same plane as the caster?What are you allowed to do while using the Warlock's Eldritch Master feature?