One attribute includes another attributePre-processing (center, scale, impute) among training sets (different forms) and the test set - what is a good approach?What is the best way to scale a numerical datasetSelecting the number of hashes for minhash? Working with extremely sparse data and want more collisionsPython: Handling imbalance Classes in python Machine LearningChoice of replacing missing values based on the data distributionNested features with one to many relationshipsHow to use a dataset where attribute names are changed?Dealing with a dataset where a subset of points live in a higher dimensional spaceDifferent approaches of creating the test setHow to deal with attributes that can vary arbitrarily for each sample?
A non-technological, repeating, visible object in the sky, holding its position in the sky for hours
Examples of non trivial equivalence relations , I mean equivalence relations without the expression " same ... as" in their definition?
Did Henry V’s archers at Agincourt fight with no pants / breeches on because of dysentery?
Why is the origin of “threshold” uncertain?
Where does the labelling of extrinsic semiconductors as "n" and "p" come from?
Can a creature tell when it has been affected by a Divination wizard's Portent?
Confusion about capacitors
Transfer over $10k
How to back up a running remote server?
Possible to set `foldexpr` using a function reference?
Need help understanding harmonic series and intervals
Is it possible to measure lightning discharges as Nikola Tesla?
Pulling the rope with one hand is as heavy as with two hands?
Why do computer-science majors learn calculus?
When did stoichiometry begin to be taught in U.S. high schools?
Subtleties of choosing the sequence of tenses in Russian
Can fracking help reduce CO2?
Minimum value of 4 digit number divided by sum of its digits
Why is current rating for multicore cable lower than single core with the same cross section?
Help, my Death Star suffers from Kessler syndrome!
Build a trail cart
Phrase for the opposite of "foolproof"
A question regarding using the definite article
How to figure out whether the data is sample data or population data apart from the client's information?
One attribute includes another attribute
Pre-processing (center, scale, impute) among training sets (different forms) and the test set - what is a good approach?What is the best way to scale a numerical datasetSelecting the number of hashes for minhash? Working with extremely sparse data and want more collisionsPython: Handling imbalance Classes in python Machine LearningChoice of replacing missing values based on the data distributionNested features with one to many relationshipsHow to use a dataset where attribute names are changed?Dealing with a dataset where a subset of points live in a higher dimensional spaceDifferent approaches of creating the test setHow to deal with attributes that can vary arbitrarily for each sample?
$begingroup$
I have a telecom dataset that has many attributes, among these attributes, there is "Voice mail plan" attribute that takes yes or no, and another attribute is "voice mail calls" which has many values, but always zero when "Voice mail plan" is no. When removing "Voice mail plan" from the dataset the accuracy of classifiers is lowered, so how can we inform the classifier that No is impeded in zero voice calls
data-mining preprocessing
$endgroup$
add a comment |
$begingroup$
I have a telecom dataset that has many attributes, among these attributes, there is "Voice mail plan" attribute that takes yes or no, and another attribute is "voice mail calls" which has many values, but always zero when "Voice mail plan" is no. When removing "Voice mail plan" from the dataset the accuracy of classifiers is lowered, so how can we inform the classifier that No is impeded in zero voice calls
data-mining preprocessing
$endgroup$
add a comment |
$begingroup$
I have a telecom dataset that has many attributes, among these attributes, there is "Voice mail plan" attribute that takes yes or no, and another attribute is "voice mail calls" which has many values, but always zero when "Voice mail plan" is no. When removing "Voice mail plan" from the dataset the accuracy of classifiers is lowered, so how can we inform the classifier that No is impeded in zero voice calls
data-mining preprocessing
$endgroup$
I have a telecom dataset that has many attributes, among these attributes, there is "Voice mail plan" attribute that takes yes or no, and another attribute is "voice mail calls" which has many values, but always zero when "Voice mail plan" is no. When removing "Voice mail plan" from the dataset the accuracy of classifiers is lowered, so how can we inform the classifier that No is impeded in zero voice calls
data-mining preprocessing
data-mining preprocessing
asked Apr 8 at 16:52
AymanAyman
1
1
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
The two features "voice mail calls" and "voice mail plan" are related but they are not linearly correlated. "Voice mail plan" still contains some information that is not available from other features. Why do you want to remove "Voice mail plan" in first place? If you need to decrease your number of features, you can try dimension reduction (linear or non-linear), this way you make sure most of the variance of your feature set is considered for building your model.
$endgroup$
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48890%2fone-attribute-includes-another-attribute%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
The two features "voice mail calls" and "voice mail plan" are related but they are not linearly correlated. "Voice mail plan" still contains some information that is not available from other features. Why do you want to remove "Voice mail plan" in first place? If you need to decrease your number of features, you can try dimension reduction (linear or non-linear), this way you make sure most of the variance of your feature set is considered for building your model.
$endgroup$
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
add a comment |
$begingroup$
The two features "voice mail calls" and "voice mail plan" are related but they are not linearly correlated. "Voice mail plan" still contains some information that is not available from other features. Why do you want to remove "Voice mail plan" in first place? If you need to decrease your number of features, you can try dimension reduction (linear or non-linear), this way you make sure most of the variance of your feature set is considered for building your model.
$endgroup$
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
add a comment |
$begingroup$
The two features "voice mail calls" and "voice mail plan" are related but they are not linearly correlated. "Voice mail plan" still contains some information that is not available from other features. Why do you want to remove "Voice mail plan" in first place? If you need to decrease your number of features, you can try dimension reduction (linear or non-linear), this way you make sure most of the variance of your feature set is considered for building your model.
$endgroup$
The two features "voice mail calls" and "voice mail plan" are related but they are not linearly correlated. "Voice mail plan" still contains some information that is not available from other features. Why do you want to remove "Voice mail plan" in first place? If you need to decrease your number of features, you can try dimension reduction (linear or non-linear), this way you make sure most of the variance of your feature set is considered for building your model.
answered Apr 8 at 17:11
Mina NaghshnejadMina Naghshnejad
1
1
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
add a comment |
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
$begingroup$
I already applied PCA but it didn't distinguish between both, also it gave plan high principal
$endgroup$
– Ayman
Apr 11 at 16:21
add a comment |
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48890%2fone-attribute-includes-another-attribute%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown