Accurately choosing a model with sequential data Unicorn Meta Zoo #1: Why another podcast? Announcing the arrival of Valued Associate #679: Cesar Manara 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsMulticlass classification with large number of classes but for each user the set of target classes is knownChoosing an algorithm with normalized data(Classification)Choosing data clustering method to visualize dataLearning with groups of sequential dataPreparing data, choosing algorithmloss = function(iteration) gets super wobbly once it gets near the bottomWhich tool should I use for combining this large dataset?Model Joint Probability of N Words Appearing Together in a Sentencehelp with Keras sequential model outputMerging dataframes in Pandas is taking a surprisingly long time
What is ls Largest Number Formed by only moving two sticks in 508?
What were wait-states, and why was it only an issue for PCs?
Arriving in Atlanta after US Preclearance in Dublin. Will I go through TSA security in Atlanta to transfer to a connecting flight?
How can I wire a 9-position switch so that each position turns on one more LED than the one before?
What is /etc/mtab in Linux?
yticklabels on the right side of yaxis
Has a Nobel Peace laureate ever been accused of war crimes?
When speaking, how do you change your mind mid-sentence?
Co-worker works way more than he should
RIP Packet Format
Why did Israel vote against lifting the American embargo on Cuba?
My admission is revoked after accepting the admission offer
Is it acceptable to use working hours to read general interest books?
Where to find documentation for `whois` command options?
Are `mathfont` and `mathspec` intended for same purpose?
How to keep bees out of canned beverages?
Protagonist's race is hidden - should I reveal it?
State of Debian Stable (Stretch) Repository between time of two versions (e.g. 9.8 to 9.9)
Errors in solving coupled pdes
Married in secret, can marital status in passport be changed at a later date?
Why isn't everyone flabbergasted about Bran's "gift"?
Does using the Inspiration rules for character defects encourage My Guy Syndrome?
Suing a Police Officer Instead of the Police Department
How to dissolve shared line segments together in QGIS?
Accurately choosing a model with sequential data
Unicorn Meta Zoo #1: Why another podcast?
Announcing the arrival of Valued Associate #679: Cesar Manara
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsMulticlass classification with large number of classes but for each user the set of target classes is knownChoosing an algorithm with normalized data(Classification)Choosing data clustering method to visualize dataLearning with groups of sequential dataPreparing data, choosing algorithmloss = function(iteration) gets super wobbly once it gets near the bottomWhich tool should I use for combining this large dataset?Model Joint Probability of N Words Appearing Together in a Sentencehelp with Keras sequential model outputMerging dataframes in Pandas is taking a surprisingly long time
$begingroup$
The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).
I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).
I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.
What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?
machine-learning dataset data-cleaning
$endgroup$
add a comment |
$begingroup$
The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).
I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).
I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.
What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?
machine-learning dataset data-cleaning
$endgroup$
add a comment |
$begingroup$
The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).
I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).
I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.
What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?
machine-learning dataset data-cleaning
$endgroup$
The dataset I'm working on is mapping journeys - breaking them down into entry & exit coordinates, and entry & exit times, for each part of the journey. My goal is to predict the final exit coordinates, given the final time (though I'm not 100% sure time matters).
I'm having an issue finding an appropriate model that takes the time features into account. At the moment, rather than predicting this final location (x,y coordinate), I'm using a catboost classifier to tell me whether the final location of each user will be in a given area or not, but I'm not sure if I'm barking up the wrong tree. A problem I have is when I flatten the data (which I feel I need to?), I have a lot of NaN values, because each journey is a different number of trajectories added together (up to 20).
I was doing a little research and found some papers on applying neural nets (specifically RNNs) to this kind of data, but my knowledge of NNs is rather incomplete.
What sort of model might I try to better fit my data? Would I be best off getting to grips with RNNs?
machine-learning dataset data-cleaning
machine-learning dataset data-cleaning
edited Apr 5 at 11:13
A Berry
asked Apr 5 at 10:57
A BerryA Berry
12
12
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48690%2faccurately-choosing-a-model-with-sequential-data%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48690%2faccurately-choosing-a-model-with-sequential-data%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown