rule generation in a big dataset Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsSequence pattern mining on continuous datasetWhat is the relationship between clustering and association rule mining?Association rule mining interpretationMapping xml tags by Rule Learning/Generation AlgorithmsAssociation Rule Learning for Home Electricity or Water Data?Can inferencing come from incomplete rule sets?Categorical data with order and blanks, is frequent dataset or k-modes a better option?How is FP-Tree used in FP-Growth maintained for large datasetSequence extraction in a dataset

What would be the ideal power source for a cybernetic eye?

What are the possible ways to detect skin while classifying diseases?

iPhone Wallpaper?

How do I stop a creek from eroding my steep embankment?

What are the motives behind Cersei's orders given to Bronn?

How to find all the available tools in macOS terminal?

What LEGO pieces have "real-world" functionality?

Is a manifold-with-boundary with given interior and non-empty boundary essentially unique?

Dominant seventh chord in the major scale contains diminished triad of the seventh?

How to recreate this effect in Photoshop?

What causes the vertical darker bands in my photo?

How to do this path/lattice with tikz

Can a non-EU citizen traveling with me come with me through the EU passport line?

Storing hydrofluoric acid before the invention of plastics

Proof involving the spectral radius and the Jordan canonical form

I am not a queen, who am I?

What happens to sewage if there is no river near by?

Do you forfeit tax refunds/credits if you aren't required to and don't file by April 15?

Is there a documented rationale why the House Ways and Means chairman can demand tax info?

Why does Python start at index -1 (as opposed to 0) when indexing a list from the end?

Why aren't air breathing engines used as small first stages

Do I really need recursive chmod to restrict access to a folder?

List *all* the tuples!

Is there a concise way to say "all of the X, one of each"?



rule generation in a big dataset



Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsSequence pattern mining on continuous datasetWhat is the relationship between clustering and association rule mining?Association rule mining interpretationMapping xml tags by Rule Learning/Generation AlgorithmsAssociation Rule Learning for Home Electricity or Water Data?Can inferencing come from incomplete rule sets?Categorical data with order and blanks, is frequent dataset or k-modes a better option?How is FP-Tree used in FP-Growth maintained for large datasetSequence extraction in a dataset










1












$begingroup$


Given a dataset with 30 fields and 25000 instances,



1) what are your suggestions for novel methods of rule extraction?



2) Can I use association rule mining in addition to sequential rule mining?



3) which method can be more appropriate for such a big dataset?Apriori-based like SPIRIT, SPADE, SPAM, IBM or pattern growth ones like FreeSpan, PrefixSpan, SLPMiner?



*the output field (risk) is labelled as very low, low, medium, high, very high. Also, there is a temporal field (date) for each instance. An example is given below.



$$
beginarrayc
hline
mathbfdate& mathbftemperature & mathbfdensity & mathbfrisk \ hline
2018/1/2 & 15 & 100 & textvery high\ hline
& & &\
endarray
$$










share|improve this question











$endgroup$











  • $begingroup$
    Hi Anna, What do you mean by novel? There are methods to extract rules, yes. Without any given target variable you could use both methods to find correlations in the dataset.
    $endgroup$
    – S van Balen
    Apr 1 at 22:15










  • $begingroup$
    Hi Balen, by the novel, I meant, for example, using deep neural networks or those that can work with my large dataset.
    $endgroup$
    – anna
    Apr 2 at 17:08















1












$begingroup$


Given a dataset with 30 fields and 25000 instances,



1) what are your suggestions for novel methods of rule extraction?



2) Can I use association rule mining in addition to sequential rule mining?



3) which method can be more appropriate for such a big dataset?Apriori-based like SPIRIT, SPADE, SPAM, IBM or pattern growth ones like FreeSpan, PrefixSpan, SLPMiner?



*the output field (risk) is labelled as very low, low, medium, high, very high. Also, there is a temporal field (date) for each instance. An example is given below.



$$
beginarrayc
hline
mathbfdate& mathbftemperature & mathbfdensity & mathbfrisk \ hline
2018/1/2 & 15 & 100 & textvery high\ hline
& & &\
endarray
$$










share|improve this question











$endgroup$











  • $begingroup$
    Hi Anna, What do you mean by novel? There are methods to extract rules, yes. Without any given target variable you could use both methods to find correlations in the dataset.
    $endgroup$
    – S van Balen
    Apr 1 at 22:15










  • $begingroup$
    Hi Balen, by the novel, I meant, for example, using deep neural networks or those that can work with my large dataset.
    $endgroup$
    – anna
    Apr 2 at 17:08













1












1








1





$begingroup$


Given a dataset with 30 fields and 25000 instances,



1) what are your suggestions for novel methods of rule extraction?



2) Can I use association rule mining in addition to sequential rule mining?



3) which method can be more appropriate for such a big dataset?Apriori-based like SPIRIT, SPADE, SPAM, IBM or pattern growth ones like FreeSpan, PrefixSpan, SLPMiner?



*the output field (risk) is labelled as very low, low, medium, high, very high. Also, there is a temporal field (date) for each instance. An example is given below.



$$
beginarrayc
hline
mathbfdate& mathbftemperature & mathbfdensity & mathbfrisk \ hline
2018/1/2 & 15 & 100 & textvery high\ hline
& & &\
endarray
$$










share|improve this question











$endgroup$




Given a dataset with 30 fields and 25000 instances,



1) what are your suggestions for novel methods of rule extraction?



2) Can I use association rule mining in addition to sequential rule mining?



3) which method can be more appropriate for such a big dataset?Apriori-based like SPIRIT, SPADE, SPAM, IBM or pattern growth ones like FreeSpan, PrefixSpan, SLPMiner?



*the output field (risk) is labelled as very low, low, medium, high, very high. Also, there is a temporal field (date) for each instance. An example is given below.



$$
beginarrayc
hline
mathbfdate& mathbftemperature & mathbfdensity & mathbfrisk \ hline
2018/1/2 & 15 & 100 & textvery high\ hline
& & &\
endarray
$$







association-rules sequential-pattern-mining






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Apr 2 at 17:12







anna

















asked Apr 1 at 14:56









annaanna

63




63











  • $begingroup$
    Hi Anna, What do you mean by novel? There are methods to extract rules, yes. Without any given target variable you could use both methods to find correlations in the dataset.
    $endgroup$
    – S van Balen
    Apr 1 at 22:15










  • $begingroup$
    Hi Balen, by the novel, I meant, for example, using deep neural networks or those that can work with my large dataset.
    $endgroup$
    – anna
    Apr 2 at 17:08
















  • $begingroup$
    Hi Anna, What do you mean by novel? There are methods to extract rules, yes. Without any given target variable you could use both methods to find correlations in the dataset.
    $endgroup$
    – S van Balen
    Apr 1 at 22:15










  • $begingroup$
    Hi Balen, by the novel, I meant, for example, using deep neural networks or those that can work with my large dataset.
    $endgroup$
    – anna
    Apr 2 at 17:08















$begingroup$
Hi Anna, What do you mean by novel? There are methods to extract rules, yes. Without any given target variable you could use both methods to find correlations in the dataset.
$endgroup$
– S van Balen
Apr 1 at 22:15




$begingroup$
Hi Anna, What do you mean by novel? There are methods to extract rules, yes. Without any given target variable you could use both methods to find correlations in the dataset.
$endgroup$
– S van Balen
Apr 1 at 22:15












$begingroup$
Hi Balen, by the novel, I meant, for example, using deep neural networks or those that can work with my large dataset.
$endgroup$
– anna
Apr 2 at 17:08




$begingroup$
Hi Balen, by the novel, I meant, for example, using deep neural networks or those that can work with my large dataset.
$endgroup$
– anna
Apr 2 at 17:08










0






active

oldest

votes












Your Answer








StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "557"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48360%2frule-generation-in-a-big-dataset%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Data Science Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f48360%2frule-generation-in-a-big-dataset%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Marja Vauras Lähteet | Aiheesta muualla | NavigointivalikkoMarja Vauras Turun yliopiston tutkimusportaalissaInfobox OKSuomalaisen Tiedeakatemian varsinaiset jäsenetKasvatustieteiden tiedekunnan dekaanit ja muu johtoMarja VaurasKoulutusvienti on kestävyys- ja ketteryyslaji (2.5.2017)laajentamallaWorldCat Identities0000 0001 0855 9405n86069603utb201588738523620927

Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

Is this part of the description of the Archfey warlock's Misty Escape feature redundant?When is entropic ward considered “used”?How does the reaction timing work for Wrath of the Storm? Can it potentially prevent the damage from the triggering attack?Does the Dark Arts Archlich warlock patrons's Arcane Invisibility activate every time you cast a level 1+ spell?When attacking while invisible, when exactly does invisibility break?Can I cast Hellish Rebuke on my turn?Do I have to “pre-cast” a reaction spell in order for it to be triggered?What happens if a Player Misty Escapes into an Invisible CreatureCan a reaction interrupt multiattack?Does the Fiend-patron warlock's Hurl Through Hell feature dispel effects that require the target to be on the same plane as the caster?What are you allowed to do while using the Warlock's Eldritch Master feature?