Accurately choosing a model with sequential data Unicorn Meta Zoo #1: Why another podcast? Announcing the arrival of Valued Associate #679: Cesar Manara 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsMulticlass classification with large number of classes but for each user the set of target classes is knownChoosing an algorithm with normalized data(Classification)Choosing data clustering method to visualize dataLearning with groups of sequential dataPreparing data, choosing algorithmloss = function(iteration) gets super wobbly once it gets near the bottomWhich tool should I use for combining this large dataset?Model Joint Probability of N Words Appearing Together in a Sentencehelp with Keras sequential model outputMerging dataframes in Pandas is taking a surprisingly long time

What is ls Largest Number Formed by only moving two sticks in 508?

What were wait-states, and why was it only an issue for PCs?

Arriving in Atlanta after US Preclearance in Dublin. Will I go through TSA security in Atlanta to transfer to a connecting flight?

How can I wire a 9-position switch so that each position turns on one more LED than the one before?

What is /etc/mtab in Linux?

yticklabels on the right side of yaxis

Has a Nobel Peace laureate ever been accused of war crimes?

When speaking, how do you change your mind mid-sentence?

Co-worker works way more than he should

RIP Packet Format

Why did Israel vote against lifting the American embargo on Cuba?

My admission is revoked after accepting the admission offer

Is it acceptable to use working hours to read general interest books?

Where to find documentation for `whois` command options?

Are `mathfont` and `mathspec` intended for same purpose?

How to keep bees out of canned beverages?

Protagonist's race is hidden - should I reveal it?

State of Debian Stable (Stretch) Repository between time of two versions (e.g. 9.8 to 9.9)

Errors in solving coupled pdes

Married in secret, can marital status in passport be changed at a later date?

Why isn't everyone flabbergasted about Bran's "gift"?

Does using the Inspiration rules for character defects encourage My Guy Syndrome?

Suing a Police Officer Instead of the Police Department

How to dissolve shared line segments together in QGIS?



Accurately choosing a model with sequential data



Unicorn Meta Zoo #1: Why another podcast?
Announcing the arrival of Valued Associate #679: Cesar Manara
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsMulticlass classification with large number of classes but for each user the set of target classes is knownChoosing an algorithm with normalized data(Classification)Choosing data clustering method to visualize dataLearning with groups of sequential dataPreparing data, choosing algorithmloss = function(iteration) gets super wobbly once it gets near the bottomWhich tool should I use for combining this large dataset?Model Joint Probability of N Words Appearing Together in a Sentencehelp with Keras sequential model outputMerging dataframes in Pandas is taking a surprisingly long time










0












$begingroup$


The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).



I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).



I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.



What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?










share|improve this question











$endgroup$
















    0












    $begingroup$


    The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).



    I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).



    I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.



    What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?










    share|improve this question











    $endgroup$














      0












      0








      0





      $begingroup$


      The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).



      I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).



      I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.



      What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?










      share|improve this question











      $endgroup$




      The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).



      I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).



      I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.



      What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?







      machine-learning dataset data-cleaning






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Apr 5 at 11:13







      A Berry

















      asked Apr 5 at 10:57









      A BerryA Berry

      12




      12




















          0






          active

          oldest

          votes












          Your Answer








          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48690%2faccurately-choosing-a-model-with-sequential-data%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48690%2faccurately-choosing-a-model-with-sequential-data%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

          Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

          Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High