Multi-label text classification (3 targetted result columns) The 2019 Stack Overflow Developer Survey Results Are InText-Classification-Problem, what is the right approach?RTextTools multi-label classificationMulti-label Text ClassificationHow to use binary relevance for multi-label text classification?Naive Bayes for Multi label text classificationExtract Pattern using Short Text ProcessingBuild train data set for natural language text classification?Large Numpy.Array for Multi-label Image Classification (CelebA Dataset)Multi Class Classification on large dataset with over 600 classesHow to use sklearn train_test_split to stratify data for multi-label classification?
What do the Banks children have against barley water?
How to Override Magento 2 vendor files
If the Wish spell is used to duplicate the effect of Simulacrum, are existing duplicates destroyed?
I am seven letter word. Find me Who Am I?
Differentiate between line ending within polygon and line passing all the way through polygon - QGIS
Does duplicating a spell with Wish count as casting that spell?
Is there a general name for the setup in which payoffs are not known exactly but players try to influence each other's perception of the payoffs?
What can other administrators access on my machine?
Loose spokes after only a few rides
What do hard-Brexiteers want with respect to the Irish border?
Worn-tile Scrabble
How to reverse every other sublist of a list?
Walkie-talkie and its origin
Landlord wants to switch my lease to a "Land contract" to "get back at the city"
Aging parents with no investments
How can I make payments on the Internet without leaving a money trail?
What is the meaning of Triage in Cybersec world?
Time travel alters history but people keep saying nothing's changed
Falsification in Math vs Science
Is this food a bread or a loaf?
What is the motivation for a law requiring 2 parties to consent for recording a conversation
Protecting Dualbooting Windows from dangerous code (like rm -rf)
Output the Arecibo Message
Monty Hall variation
Multi-label text classification (3 targetted result columns)
The 2019 Stack Overflow Developer Survey Results Are InText-Classification-Problem, what is the right approach?RTextTools multi-label classificationMulti-label Text ClassificationHow to use binary relevance for multi-label text classification?Naive Bayes for Multi label text classificationExtract Pattern using Short Text ProcessingBuild train data set for natural language text classification?Large Numpy.Array for Multi-label Image Classification (CelebA Dataset)Multi Class Classification on large dataset with over 600 classesHow to use sklearn train_test_split to stratify data for multi-label classification?
$begingroup$
I am trying to build a multi-label text classifier
for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.
The textual data is full of noise.
The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55
on the test set using a random forest approach
.
How to apply machine learning for other targetted result columns simultaneously?
I am looking specifically on how to apply machine learning for multi-label.
I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.
How to apply text classification for multiple target result columns?
PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.
machine-learning r text-mining random-forest multilabel-classification
$endgroup$
add a comment |
$begingroup$
I am trying to build a multi-label text classifier
for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.
The textual data is full of noise.
The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55
on the test set using a random forest approach
.
How to apply machine learning for other targetted result columns simultaneously?
I am looking specifically on how to apply machine learning for multi-label.
I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.
How to apply text classification for multiple target result columns?
PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.
machine-learning r text-mining random-forest multilabel-classification
$endgroup$
$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25
add a comment |
$begingroup$
I am trying to build a multi-label text classifier
for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.
The textual data is full of noise.
The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55
on the test set using a random forest approach
.
How to apply machine learning for other targetted result columns simultaneously?
I am looking specifically on how to apply machine learning for multi-label.
I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.
How to apply text classification for multiple target result columns?
PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.
machine-learning r text-mining random-forest multilabel-classification
$endgroup$
I am trying to build a multi-label text classifier
for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.
The textual data is full of noise.
The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55
on the test set using a random forest approach
.
How to apply machine learning for other targetted result columns simultaneously?
I am looking specifically on how to apply machine learning for multi-label.
I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.
How to apply text classification for multiple target result columns?
PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.
machine-learning r text-mining random-forest multilabel-classification
machine-learning r text-mining random-forest multilabel-classification
edited Mar 30 at 2:47
Damini Jain
1136
1136
asked Mar 29 at 13:08
Ahsan NawazAhsan Nawaz
62
62
$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25
add a comment |
$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25
$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25
$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48217%2fmulti-label-text-classification-3-targetted-result-columns%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48217%2fmulti-label-text-classification-3-targetted-result-columns%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25