Variable Importance in unsupervised anomaly detection algorithmsOutlier detection for unbalanced classesNetwork Anomaly detectionhow to compare different sets of time series dataUnsupervised Anomaly Detection in ImagesUsing local outlier factor score to detect outliers at run timeHow would I apply anomaly detection to time series data in LSTM?Anomaly Detection: Model Creation & ImplementationKnowing Feature Importance from Sparse Matrix

Why did the HMS Bounty go back to a time when whales are already rare?

Why is so much work done on numerical verification of the Riemann Hypothesis?

What should you do when eye contact makes your subordinate uncomfortable?

Is it better practice to read straight from sheet music rather than memorize it?

Does an advisor owe his/her student anything? Will an advisor keep a PhD student only out of pity?

Offered money to buy a house, seller is asking for more to cover gap between their listing and mortgage owed

Is a model fitted to data or is data fitted to a model?

Do Legal Documents Require Signing In Standard Pen Colors?

What spells are affected by the size of the caster?

Is there a single word describing earning money through any means?

Varistor? Purpose and principle

Freedom of speech and where it applies

Will the technology I first learn determine the direction of my future career?

What is the difference between Reference and Background image in 2.8

Biological Blimps: Propulsion

Reply 'no position' while the job posting is still there

What if a revenant (monster) gains fire resistance?

Non-trope happy ending?

Bob has never been a M before

How should I respond when I lied about my education and the company finds out through background check?

Pre-mixing cryogenic fuels and using only one fuel tank

The IT department bottlenecks progress. How should I handle this?

How is flyblackbird.com operating under Part 91K?

What prevents the use of a multi-segment ILS for non-straight approaches?



Variable Importance in unsupervised anomaly detection algorithms


Outlier detection for unbalanced classesNetwork Anomaly detectionhow to compare different sets of time series dataUnsupervised Anomaly Detection in ImagesUsing local outlier factor score to detect outliers at run timeHow would I apply anomaly detection to time series data in LSTM?Anomaly Detection: Model Creation & ImplementationKnowing Feature Importance from Sparse Matrix













0












$begingroup$


I am working on an anomaly detection problem to detect fraud in insurance claims. I have used the PyOD package and used algorithms like ABOD, CBLOF, Isolation Forest, and AutoEncoder. I couldn't find any way to identify the important features which make the data points anomalies ( like variable Importance in Random Forest). Is there a way to identify the important features in unsupervised anomaly detection?










share|improve this question











$endgroup$
















    0












    $begingroup$


    I am working on an anomaly detection problem to detect fraud in insurance claims. I have used the PyOD package and used algorithms like ABOD, CBLOF, Isolation Forest, and AutoEncoder. I couldn't find any way to identify the important features which make the data points anomalies ( like variable Importance in Random Forest). Is there a way to identify the important features in unsupervised anomaly detection?










    share|improve this question











    $endgroup$














      0












      0








      0





      $begingroup$


      I am working on an anomaly detection problem to detect fraud in insurance claims. I have used the PyOD package and used algorithms like ABOD, CBLOF, Isolation Forest, and AutoEncoder. I couldn't find any way to identify the important features which make the data points anomalies ( like variable Importance in Random Forest). Is there a way to identify the important features in unsupervised anomaly detection?










      share|improve this question











      $endgroup$




      I am working on an anomaly detection problem to detect fraud in insurance claims. I have used the PyOD package and used algorithms like ABOD, CBLOF, Isolation Forest, and AutoEncoder. I couldn't find any way to identify the important features which make the data points anomalies ( like variable Importance in Random Forest). Is there a way to identify the important features in unsupervised anomaly detection?







      python clustering anomaly-detection






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Mar 19 at 17:01









      Ethan

      564223




      564223










      asked Mar 19 at 13:36









      GokulramGokulram

      62




      62




















          2 Answers
          2






          active

          oldest

          votes


















          1












          $begingroup$

          This question has been asked so many times, yet I believe no widely accepted answer exists, especially in the case of black box models such as neural networks.



          A way to go may be sensitivity analysis, i.e. evaluate the change in the output of the model for small changes in the individual inputs. The higher the change in the output, the more important the feature.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you for the answer
            $endgroup$
            – Gokulram
            Mar 20 at 12:37


















          0












          $begingroup$

          There have been workshops dedicated to "outlier detection and description" (ODD), but there came out nothing from them that convinced me, unfortunately. but YMMV.
          It definitely won't be enough to just use some library! You'll need to read and implement papers. There are subspace outlier detectors and correlation outliers, for example, that will tell you which features were relevant for a particular outlier.



          But in general I believe you'll quickly run into multiple testing problems: in real data, every point is anomalous if you just try attribute combinations hard enough. Donald Trump is anomalous because of his orange skin, fake hair, and small hands, for example. If you only look at his xenophobia, he probably is pretty normal, unfortunately.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you for the hints.
            $endgroup$
            – Gokulram
            Mar 20 at 12:38










          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47616%2fvariable-importance-in-unsupervised-anomaly-detection-algorithms%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          2 Answers
          2






          active

          oldest

          votes








          2 Answers
          2






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          1












          $begingroup$

          This question has been asked so many times, yet I believe no widely accepted answer exists, especially in the case of black box models such as neural networks.



          A way to go may be sensitivity analysis, i.e. evaluate the change in the output of the model for small changes in the individual inputs. The higher the change in the output, the more important the feature.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you for the answer
            $endgroup$
            – Gokulram
            Mar 20 at 12:37















          1












          $begingroup$

          This question has been asked so many times, yet I believe no widely accepted answer exists, especially in the case of black box models such as neural networks.



          A way to go may be sensitivity analysis, i.e. evaluate the change in the output of the model for small changes in the individual inputs. The higher the change in the output, the more important the feature.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you for the answer
            $endgroup$
            – Gokulram
            Mar 20 at 12:37













          1












          1








          1





          $begingroup$

          This question has been asked so many times, yet I believe no widely accepted answer exists, especially in the case of black box models such as neural networks.



          A way to go may be sensitivity analysis, i.e. evaluate the change in the output of the model for small changes in the individual inputs. The higher the change in the output, the more important the feature.






          share|improve this answer









          $endgroup$



          This question has been asked so many times, yet I believe no widely accepted answer exists, especially in the case of black box models such as neural networks.



          A way to go may be sensitivity analysis, i.e. evaluate the change in the output of the model for small changes in the individual inputs. The higher the change in the output, the more important the feature.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Mar 19 at 18:50









          pcko1pcko1

          1,581417




          1,581417











          • $begingroup$
            Thank you for the answer
            $endgroup$
            – Gokulram
            Mar 20 at 12:37
















          • $begingroup$
            Thank you for the answer
            $endgroup$
            – Gokulram
            Mar 20 at 12:37















          $begingroup$
          Thank you for the answer
          $endgroup$
          – Gokulram
          Mar 20 at 12:37




          $begingroup$
          Thank you for the answer
          $endgroup$
          – Gokulram
          Mar 20 at 12:37











          0












          $begingroup$

          There have been workshops dedicated to "outlier detection and description" (ODD), but there came out nothing from them that convinced me, unfortunately. but YMMV.
          It definitely won't be enough to just use some library! You'll need to read and implement papers. There are subspace outlier detectors and correlation outliers, for example, that will tell you which features were relevant for a particular outlier.



          But in general I believe you'll quickly run into multiple testing problems: in real data, every point is anomalous if you just try attribute combinations hard enough. Donald Trump is anomalous because of his orange skin, fake hair, and small hands, for example. If you only look at his xenophobia, he probably is pretty normal, unfortunately.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you for the hints.
            $endgroup$
            – Gokulram
            Mar 20 at 12:38















          0












          $begingroup$

          There have been workshops dedicated to "outlier detection and description" (ODD), but there came out nothing from them that convinced me, unfortunately. but YMMV.
          It definitely won't be enough to just use some library! You'll need to read and implement papers. There are subspace outlier detectors and correlation outliers, for example, that will tell you which features were relevant for a particular outlier.



          But in general I believe you'll quickly run into multiple testing problems: in real data, every point is anomalous if you just try attribute combinations hard enough. Donald Trump is anomalous because of his orange skin, fake hair, and small hands, for example. If you only look at his xenophobia, he probably is pretty normal, unfortunately.






          share|improve this answer









          $endgroup$












          • $begingroup$
            Thank you for the hints.
            $endgroup$
            – Gokulram
            Mar 20 at 12:38













          0












          0








          0





          $begingroup$

          There have been workshops dedicated to "outlier detection and description" (ODD), but there came out nothing from them that convinced me, unfortunately. but YMMV.
          It definitely won't be enough to just use some library! You'll need to read and implement papers. There are subspace outlier detectors and correlation outliers, for example, that will tell you which features were relevant for a particular outlier.



          But in general I believe you'll quickly run into multiple testing problems: in real data, every point is anomalous if you just try attribute combinations hard enough. Donald Trump is anomalous because of his orange skin, fake hair, and small hands, for example. If you only look at his xenophobia, he probably is pretty normal, unfortunately.






          share|improve this answer









          $endgroup$



          There have been workshops dedicated to "outlier detection and description" (ODD), but there came out nothing from them that convinced me, unfortunately. but YMMV.
          It definitely won't be enough to just use some library! You'll need to read and implement papers. There are subspace outlier detectors and correlation outliers, for example, that will tell you which features were relevant for a particular outlier.



          But in general I believe you'll quickly run into multiple testing problems: in real data, every point is anomalous if you just try attribute combinations hard enough. Donald Trump is anomalous because of his orange skin, fake hair, and small hands, for example. If you only look at his xenophobia, he probably is pretty normal, unfortunately.







          share|improve this answer












          share|improve this answer



          share|improve this answer










          answered Mar 19 at 19:35









          Anony-MousseAnony-Mousse

          5,010624




          5,010624











          • $begingroup$
            Thank you for the hints.
            $endgroup$
            – Gokulram
            Mar 20 at 12:38
















          • $begingroup$
            Thank you for the hints.
            $endgroup$
            – Gokulram
            Mar 20 at 12:38















          $begingroup$
          Thank you for the hints.
          $endgroup$
          – Gokulram
          Mar 20 at 12:38




          $begingroup$
          Thank you for the hints.
          $endgroup$
          – Gokulram
          Mar 20 at 12:38

















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47616%2fvariable-importance-in-unsupervised-anomaly-detection-algorithms%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Luettelo Yhdysvaltain laivaston lentotukialuksista Lähteet | Navigointivalikko

          Adding axes to figuresAdding axes labels to LaTeX figuresLaTeX equivalent of ConTeXt buffersRotate a node but not its content: the case of the ellipse decorationHow to define the default vertical distance between nodes?TikZ scaling graphic and adjust node position and keep font sizeNumerical conditional within tikz keys?adding axes to shapesAlign axes across subfiguresAdding figures with a certain orderLine up nested tikz enviroments or how to get rid of themAdding axes labels to LaTeX figures

          Gary (muusikko) Sisällysluettelo Historia | Rockin' High | Lähteet | Aiheesta muualla | NavigointivalikkoInfobox OKTuomas "Gary" Keskinen Ancaran kitaristiksiProjekti Rockin' High