Multi-label text classification (3 targetted result columns) The 2019 Stack Overflow Developer Survey Results Are InText-Classification-Problem, what is the right approach?RTextTools multi-label classificationMulti-label Text ClassificationHow to use binary relevance for multi-label text classification?Naive Bayes for Multi label text classificationExtract Pattern using Short Text ProcessingBuild train data set for natural language text classification?Large Numpy.Array for Multi-label Image Classification (CelebA Dataset)Multi Class Classification on large dataset with over 600 classesHow to use sklearn train_test_split to stratify data for multi-label classification?

What do the Banks children have against barley water?

How to Override Magento 2 vendor files

If the Wish spell is used to duplicate the effect of Simulacrum, are existing duplicates destroyed?

I am seven letter word. Find me Who Am I?

Differentiate between line ending within polygon and line passing all the way through polygon - QGIS

Does duplicating a spell with Wish count as casting that spell?

Is there a general name for the setup in which payoffs are not known exactly but players try to influence each other's perception of the payoffs?

What can other administrators access on my machine?

Loose spokes after only a few rides

What do hard-Brexiteers want with respect to the Irish border?

Worn-tile Scrabble

How to reverse every other sublist of a list?

Walkie-talkie and its origin

Landlord wants to switch my lease to a "Land contract" to "get back at the city"

Aging parents with no investments

How can I make payments on the Internet without leaving a money trail?

What is the meaning of Triage in Cybersec world?

Time travel alters history but people keep saying nothing's changed

Falsification in Math vs Science

Is this food a bread or a loaf?

What is the motivation for a law requiring 2 parties to consent for recording a conversation

Protecting Dualbooting Windows from dangerous code (like rm -rf)

Output the Arecibo Message

Monty Hall variation



Multi-label text classification (3 targetted result columns)



The 2019 Stack Overflow Developer Survey Results Are InText-Classification-Problem, what is the right approach?RTextTools multi-label classificationMulti-label Text ClassificationHow to use binary relevance for multi-label text classification?Naive Bayes for Multi label text classificationExtract Pattern using Short Text ProcessingBuild train data set for natural language text classification?Large Numpy.Array for Multi-label Image Classification (CelebA Dataset)Multi Class Classification on large dataset with over 600 classesHow to use sklearn train_test_split to stratify data for multi-label classification?










0












$begingroup$


I am trying to build a multi-label text classifier for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.



The textual data is full of noise.



Example Data Set



The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55 on the test set using a random forest approach.



How to apply machine learning for other targetted result columns simultaneously?



I am looking specifically on how to apply machine learning for multi-label.



I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.



How to apply text classification for multiple target result columns?



PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.










share|improve this question











$endgroup$











  • $begingroup$
    have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
    $endgroup$
    – Shamit Verma
    Mar 29 at 13:25















0












$begingroup$


I am trying to build a multi-label text classifier for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.



The textual data is full of noise.



Example Data Set



The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55 on the test set using a random forest approach.



How to apply machine learning for other targetted result columns simultaneously?



I am looking specifically on how to apply machine learning for multi-label.



I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.



How to apply text classification for multiple target result columns?



PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.










share|improve this question











$endgroup$











  • $begingroup$
    have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
    $endgroup$
    – Shamit Verma
    Mar 29 at 13:25













0












0








0





$begingroup$


I am trying to build a multi-label text classifier for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.



The textual data is full of noise.



Example Data Set



The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55 on the test set using a random forest approach.



How to apply machine learning for other targetted result columns simultaneously?



I am looking specifically on how to apply machine learning for multi-label.



I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.



How to apply text classification for multiple target result columns?



PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.










share|improve this question











$endgroup$




I am trying to build a multi-label text classifier for suggesting "AC_location", "Issue" and "Part_Affected" on the generated events.



The textual data is full of noise.



Example Data Set



The approach I have been following until now was a BOW approach with Tf-idf weighting (almost all the preprocessing steps i used so far like; stopwords removal, stemming, etc.). However, I could only figure it out how to apply a multi-class text classification approach on targetted result column "Issue" and reached an accuracy of around 0.55 on the test set using a random forest approach.



How to apply machine learning for other targetted result columns simultaneously?



I am looking specifically on how to apply machine learning for multi-label.



I have 20, 25 and 30 labels respectively and around 20000 data points. Data-set is not well-balanced but i will make it balance in future.



How to apply text classification for multiple target result columns?



PS: - i'm writing code in R by using the following libraries; tm, mlr, ranger, xgboost, etc.







machine-learning r text-mining random-forest multilabel-classification






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Mar 30 at 2:47









Damini Jain

1136




1136










asked Mar 29 at 13:08









Ahsan NawazAhsan Nawaz

62




62











  • $begingroup$
    have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
    $endgroup$
    – Shamit Verma
    Mar 29 at 13:25
















  • $begingroup$
    have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
    $endgroup$
    – Shamit Verma
    Mar 29 at 13:25















$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25




$begingroup$
have you tried Deep Learning based approach ? With tools like Ludwig, you can try this approach without significant amount of coding. github.com/uber/ludwig . user guide has an example on text classification : uber.github.io/ludwig/user_guide
$endgroup$
– Shamit Verma
Mar 29 at 13:25










0






active

oldest

votes












Your Answer





StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48217%2fmulti-label-text-classification-3-targetted-result-columns%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48217%2fmulti-label-text-classification-3-targetted-result-columns%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High