Binary classification with time-series features2019 Community Moderator ElectionWhat are the best ways to use a time series data for binary classificationTime series classificationbinary longitudinal time seriesBuilding a machine learning model based on a set of timestamped features to predict/classify a label/value?Keras LSTM model for binary classification with sequencesk-Nearest Neighbours with time series data - how to obtain whole-time-period estimatorsTime series binary classificaiton with labelling issuesBinary classification model with time series as variablesMultivariate Time Series Binary Classificationhow to predict content based demandClassification of keystrokes

Pre-amplifier input protection

Two monoidal structures and copowering

Do sorcerers' Subtle Spells require a skill check to be unseen?

How does it work when somebody invests in my business?

Sort a list by elements of another list

How to write papers efficiently when English isn't my first language?

How does Loki do this?

Why not increase contact surface when reentering the atmosphere?

Why Were Madagascar and New Zealand Discovered So Late?

Hostile work environment after whistle-blowing on coworker and our boss. What do I do?

Escape a backup date in a file name

Is expanding the research of a group into machine learning as a PhD student risky?

Large drywall patch supports

Is the destination of a commercial flight important for the pilot?

Purchasing a ticket for someone else in another country?

What can we do to stop prior company from asking us questions?

Is there a problem with hiding "forgot password" until it's needed?

What does "I’d sit this one out, Cap," imply or mean in the context?

Detecting if an element is found inside a container

Pole-zeros of a real-valued causal FIR system

How do I rename a Linux host without needing to reboot for the rename to take effect?

Go Pregnant or Go Home

Different result between scanning in Epson's "color negative film" mode and scanning in positive -> invert curve in post?

Type int? vs type int



Binary classification with time-series features



2019 Community Moderator ElectionWhat are the best ways to use a time series data for binary classificationTime series classificationbinary longitudinal time seriesBuilding a machine learning model based on a set of timestamped features to predict/classify a label/value?Keras LSTM model for binary classification with sequencesk-Nearest Neighbours with time series data - how to obtain whole-time-period estimatorsTime series binary classificaiton with labelling issuesBinary classification model with time series as variablesMultivariate Time Series Binary Classificationhow to predict content based demandClassification of keystrokes










0












$begingroup$


I have the following time-series features: Diastolic Blood Pressure, Systolic Blood Pressure, Heart Rate, RR variability and Arterial Blood Pressure. Each of these clinical parameters was measured for 900 seconds during a surgical procedure and after the surgery, the patient was assessed for acute kidney injury: 1(yes) or 0(no).



My training data kind of looks like this: (see below for screenshot)



Patient 1 Time(s) Features AKI



Patient 2 Time(s) Features AKI



and so on.



What approach would I take to utilize this data for the binary classification task?



Sample view of Training Data










share|improve this question











$endgroup$











  • $begingroup$
    try lstm for your ptoblem
    $endgroup$
    – Andreas Look
    Mar 20 at 21:23










  • $begingroup$
    Is there any machine-learning approaches to this? Not too familiar with RNNs.
    $endgroup$
    – John Spanos
    Mar 20 at 21:41










  • $begingroup$
    Somehow related question / answer datascience.stackexchange.com/a/25518/29781
    $endgroup$
    – aivanov
    Mar 22 at 13:52
















0












$begingroup$


I have the following time-series features: Diastolic Blood Pressure, Systolic Blood Pressure, Heart Rate, RR variability and Arterial Blood Pressure. Each of these clinical parameters was measured for 900 seconds during a surgical procedure and after the surgery, the patient was assessed for acute kidney injury: 1(yes) or 0(no).



My training data kind of looks like this: (see below for screenshot)



Patient 1 Time(s) Features AKI



Patient 2 Time(s) Features AKI



and so on.



What approach would I take to utilize this data for the binary classification task?



Sample view of Training Data










share|improve this question











$endgroup$











  • $begingroup$
    try lstm for your ptoblem
    $endgroup$
    – Andreas Look
    Mar 20 at 21:23










  • $begingroup$
    Is there any machine-learning approaches to this? Not too familiar with RNNs.
    $endgroup$
    – John Spanos
    Mar 20 at 21:41










  • $begingroup$
    Somehow related question / answer datascience.stackexchange.com/a/25518/29781
    $endgroup$
    – aivanov
    Mar 22 at 13:52














0












0








0


1



$begingroup$


I have the following time-series features: Diastolic Blood Pressure, Systolic Blood Pressure, Heart Rate, RR variability and Arterial Blood Pressure. Each of these clinical parameters was measured for 900 seconds during a surgical procedure and after the surgery, the patient was assessed for acute kidney injury: 1(yes) or 0(no).



My training data kind of looks like this: (see below for screenshot)



Patient 1 Time(s) Features AKI



Patient 2 Time(s) Features AKI



and so on.



What approach would I take to utilize this data for the binary classification task?



Sample view of Training Data










share|improve this question











$endgroup$




I have the following time-series features: Diastolic Blood Pressure, Systolic Blood Pressure, Heart Rate, RR variability and Arterial Blood Pressure. Each of these clinical parameters was measured for 900 seconds during a surgical procedure and after the surgery, the patient was assessed for acute kidney injury: 1(yes) or 0(no).



My training data kind of looks like this: (see below for screenshot)



Patient 1 Time(s) Features AKI



Patient 2 Time(s) Features AKI



and so on.



What approach would I take to utilize this data for the binary classification task?



Sample view of Training Data







classification time-series






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 20 at 21:19







John Spanos

















asked Mar 20 at 21:10









John SpanosJohn Spanos

12




12











  • $begingroup$
    try lstm for your ptoblem
    $endgroup$
    – Andreas Look
    Mar 20 at 21:23










  • $begingroup$
    Is there any machine-learning approaches to this? Not too familiar with RNNs.
    $endgroup$
    – John Spanos
    Mar 20 at 21:41










  • $begingroup$
    Somehow related question / answer datascience.stackexchange.com/a/25518/29781
    $endgroup$
    – aivanov
    Mar 22 at 13:52

















  • $begingroup$
    try lstm for your ptoblem
    $endgroup$
    – Andreas Look
    Mar 20 at 21:23










  • $begingroup$
    Is there any machine-learning approaches to this? Not too familiar with RNNs.
    $endgroup$
    – John Spanos
    Mar 20 at 21:41










  • $begingroup$
    Somehow related question / answer datascience.stackexchange.com/a/25518/29781
    $endgroup$
    – aivanov
    Mar 22 at 13:52
















$begingroup$
try lstm for your ptoblem
$endgroup$
– Andreas Look
Mar 20 at 21:23




$begingroup$
try lstm for your ptoblem
$endgroup$
– Andreas Look
Mar 20 at 21:23












$begingroup$
Is there any machine-learning approaches to this? Not too familiar with RNNs.
$endgroup$
– John Spanos
Mar 20 at 21:41




$begingroup$
Is there any machine-learning approaches to this? Not too familiar with RNNs.
$endgroup$
– John Spanos
Mar 20 at 21:41












$begingroup$
Somehow related question / answer datascience.stackexchange.com/a/25518/29781
$endgroup$
– aivanov
Mar 22 at 13:52





$begingroup$
Somehow related question / answer datascience.stackexchange.com/a/25518/29781
$endgroup$
– aivanov
Mar 22 at 13:52











1 Answer
1






active

oldest

votes


















0












$begingroup$

I would plot the measurements time $t$ and the corresponding measurement 5 to 10 samples for each category. Try to detect some patterns. Possible patterns are the trend (Is the cure growing? Is it linearly? Exponentially?), frequencies of oscillations (Does one category have oscillations with higher magnitude or frequencies? You can use Fast Fourier Transform for this) self similarity of signal (autocorrelation) Then look at means, median, standard deviation, skewness and kurtosis of your signals.



After having extracted all these features I would try to calculate the correlations of your features with the target variable. Then you can eliminate variables that are not very highly correlated with your target variable. In the next step, I would look at the correlations between your features and eliminate the variables that are highly correlated by eliminating the one variable which less correlated with the target variable. Then I would use some classical binary classifiers like discriminant analysis or logistic regression.



If you see that this method will not lead to sufficient results then you should try more sophisticated methods like neural networks/decision trees for the features that you extracted.






share|improve this answer











$endgroup$












  • $begingroup$
    Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
    $endgroup$
    – John Spanos
    Mar 20 at 23:08










  • $begingroup$
    My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
    $endgroup$
    – MachineLearner
    Mar 21 at 7:10










Your Answer





StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47699%2fbinary-classification-with-time-series-features%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









0












$begingroup$

I would plot the measurements time $t$ and the corresponding measurement 5 to 10 samples for each category. Try to detect some patterns. Possible patterns are the trend (Is the cure growing? Is it linearly? Exponentially?), frequencies of oscillations (Does one category have oscillations with higher magnitude or frequencies? You can use Fast Fourier Transform for this) self similarity of signal (autocorrelation) Then look at means, median, standard deviation, skewness and kurtosis of your signals.



After having extracted all these features I would try to calculate the correlations of your features with the target variable. Then you can eliminate variables that are not very highly correlated with your target variable. In the next step, I would look at the correlations between your features and eliminate the variables that are highly correlated by eliminating the one variable which less correlated with the target variable. Then I would use some classical binary classifiers like discriminant analysis or logistic regression.



If you see that this method will not lead to sufficient results then you should try more sophisticated methods like neural networks/decision trees for the features that you extracted.






share|improve this answer











$endgroup$












  • $begingroup$
    Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
    $endgroup$
    – John Spanos
    Mar 20 at 23:08










  • $begingroup$
    My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
    $endgroup$
    – MachineLearner
    Mar 21 at 7:10















0












$begingroup$

I would plot the measurements time $t$ and the corresponding measurement 5 to 10 samples for each category. Try to detect some patterns. Possible patterns are the trend (Is the cure growing? Is it linearly? Exponentially?), frequencies of oscillations (Does one category have oscillations with higher magnitude or frequencies? You can use Fast Fourier Transform for this) self similarity of signal (autocorrelation) Then look at means, median, standard deviation, skewness and kurtosis of your signals.



After having extracted all these features I would try to calculate the correlations of your features with the target variable. Then you can eliminate variables that are not very highly correlated with your target variable. In the next step, I would look at the correlations between your features and eliminate the variables that are highly correlated by eliminating the one variable which less correlated with the target variable. Then I would use some classical binary classifiers like discriminant analysis or logistic regression.



If you see that this method will not lead to sufficient results then you should try more sophisticated methods like neural networks/decision trees for the features that you extracted.






share|improve this answer











$endgroup$












  • $begingroup$
    Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
    $endgroup$
    – John Spanos
    Mar 20 at 23:08










  • $begingroup$
    My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
    $endgroup$
    – MachineLearner
    Mar 21 at 7:10













0












0








0





$begingroup$

I would plot the measurements time $t$ and the corresponding measurement 5 to 10 samples for each category. Try to detect some patterns. Possible patterns are the trend (Is the cure growing? Is it linearly? Exponentially?), frequencies of oscillations (Does one category have oscillations with higher magnitude or frequencies? You can use Fast Fourier Transform for this) self similarity of signal (autocorrelation) Then look at means, median, standard deviation, skewness and kurtosis of your signals.



After having extracted all these features I would try to calculate the correlations of your features with the target variable. Then you can eliminate variables that are not very highly correlated with your target variable. In the next step, I would look at the correlations between your features and eliminate the variables that are highly correlated by eliminating the one variable which less correlated with the target variable. Then I would use some classical binary classifiers like discriminant analysis or logistic regression.



If you see that this method will not lead to sufficient results then you should try more sophisticated methods like neural networks/decision trees for the features that you extracted.






share|improve this answer











$endgroup$



I would plot the measurements time $t$ and the corresponding measurement 5 to 10 samples for each category. Try to detect some patterns. Possible patterns are the trend (Is the cure growing? Is it linearly? Exponentially?), frequencies of oscillations (Does one category have oscillations with higher magnitude or frequencies? You can use Fast Fourier Transform for this) self similarity of signal (autocorrelation) Then look at means, median, standard deviation, skewness and kurtosis of your signals.



After having extracted all these features I would try to calculate the correlations of your features with the target variable. Then you can eliminate variables that are not very highly correlated with your target variable. In the next step, I would look at the correlations between your features and eliminate the variables that are highly correlated by eliminating the one variable which less correlated with the target variable. Then I would use some classical binary classifiers like discriminant analysis or logistic regression.



If you see that this method will not lead to sufficient results then you should try more sophisticated methods like neural networks/decision trees for the features that you extracted.







share|improve this answer














share|improve this answer



share|improve this answer








edited Mar 20 at 21:50

























answered Mar 20 at 21:44









MachineLearnerMachineLearner

34410




34410











  • $begingroup$
    Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
    $endgroup$
    – John Spanos
    Mar 20 at 23:08










  • $begingroup$
    My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
    $endgroup$
    – MachineLearner
    Mar 21 at 7:10
















  • $begingroup$
    Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
    $endgroup$
    – John Spanos
    Mar 20 at 23:08










  • $begingroup$
    My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
    $endgroup$
    – MachineLearner
    Mar 21 at 7:10















$begingroup$
Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
$endgroup$
– John Spanos
Mar 20 at 23:08




$begingroup$
Hi, thanks for the answer. To clarify, the target variable isn't numerical. It is a categorical value: either the patient has aki or not.
$endgroup$
– John Spanos
Mar 20 at 23:08












$begingroup$
My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
$endgroup$
– MachineLearner
Mar 21 at 7:10




$begingroup$
My explanation is still applicable for these cases. The correlation will be the point-biserial correlation. Everything else can be applied as proposed.
$endgroup$
– MachineLearner
Mar 21 at 7:10

















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47699%2fbinary-classification-with-time-series-features%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High