ways to represent document by its keyword vectors

A Cautionary Suggestion

Python if-else code style for reduced code for rounding floats

Describing a chess game in a novel

Why did it take so long to abandon sail after steamships were demonstrated?

What options are left, if Britain cannot decide?

Can a druid choose the size of its wild shape beast?

Shortcut for setting origin to vertice

How to terminate ping <dest> &

What did Alexander Pope mean by "Expletives their feeble Aid do join"?

How well should I expect Adam to work?

What do you call the act of removing a part of a word and replacing it with an apostrophe

How to explain that I do not want to visit a country due to personal safety concern?

Tikz diagrams and node placements

Convergence in probability and convergence in distribution

Is it normal that my co-workers at a fitness company criticize my food choices?

Why no Iridium-level flares from other satellites?

How to write cleanly even if my character uses expletive language?

What's the meaning of “spike” in the context of “adrenaline spike”?

Unexpected result from ArcLength

Knife as defense against stray dogs

group theory by geometry

How do I parse a string to number while destructuring?

What are substitutions for coconut in curry?

Pauli exclusion principle



ways to represent document by its keyword vectors














0












$begingroup$


I have documents, say for example, D1, D2, D3... Dm.



Every Di has its individual components or keywords k1, k2, k3,... kn, where ki is an n-dimensional vector. The number of individual components varies between documents.



What are the ways to find how close Di are? Or what is the best way to represent a document using its keyword vectors? Please note that I'm using a custom embedding here.









share









$endgroup$
















    0












    $begingroup$


    I have documents, say for example, D1, D2, D3... Dm.



    Every Di has its individual components or keywords k1, k2, k3,... kn, where ki is an n-dimensional vector. The number of individual components varies between documents.



    What are the ways to find how close Di are? Or what is the best way to represent a document using its keyword vectors? Please note that I'm using a custom embedding here.









    share









    $endgroup$














      0












      0








      0





      $begingroup$


      I have documents, say for example, D1, D2, D3... Dm.



      Every Di has its individual components or keywords k1, k2, k3,... kn, where ki is an n-dimensional vector. The number of individual components varies between documents.



      What are the ways to find how close Di are? Or what is the best way to represent a document using its keyword vectors? Please note that I'm using a custom embedding here.









      share









      $endgroup$




      I have documents, say for example, D1, D2, D3... Dm.



      Every Di has its individual components or keywords k1, k2, k3,... kn, where ki is an n-dimensional vector. The number of individual components varies between documents.



      What are the ways to find how close Di are? Or what is the best way to represent a document using its keyword vectors? Please note that I'm using a custom embedding here.







      nlp word-embeddings similar-documents vector-space-models





      share












      share










      share



      share










      asked 1 min ago









      Van PeerVan Peer

      103117




      103117




















          0






          active

          oldest

          votes











          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47429%2fways-to-represent-document-by-its-keyword-vectors%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47429%2fways-to-represent-document-by-its-keyword-vectors%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Is flight data recorder erased after every flight?When are black boxes used?What protects the location beacon (pinger) of a flight data recorder?Is there anywhere I can pick up raw flight data recorder information?Who legally owns the Flight Data Recorder?Constructing flight recorder dataWhy are FDRs and CVRs still two separate physical devices?What are the data elements shown on the GE235 flight data recorder (FDR) plot?Are CVR and FDR reset after every flight?What is the format of data stored by a Flight Data Recorder?How much data is stored in the flight data recorder per hour in a typical flight of an A380?Is a smart flight data recorder possible?

          Which is better: GPT or RelGAN for text generation?2019 Community Moderator ElectionWhat is the difference between TextGAN and LM for text generation?GANs (generative adversarial networks) possible for text as well?Generator loss not decreasing- text to image synthesisChoosing a right algorithm for template-based text generationHow should I format input and output for text generation with LSTMsGumbel Softmax vs Vanilla Softmax for GAN trainingWhich neural network to choose for classification from text/speech?NLP text autoencoder that generates text in poetic meterWhat is the interpretation of the expectation notation in the GAN formulation?What is the difference between TextGAN and LM for text generation?How to prepare the data for text generation task

          Is there a general name for the setup in which payoffs are not known exactly but players try to influence each other's perception of the payoffs?Osborne, Nash equilibria and the correctness of beliefsIs there a name for this family of games (Binomial games?)?Perfect Bayesian EquilibriumCalculating mixed strategy equilibrium in battle of sexesPure Strategy SPNEIs there a commitment mechanism which allows players to achieve pareto optimal solutions?Extensive Form GamesAn $n$-player prisoner's dilemma where a coalition of 2 players is better off defectingTit-For-Stat Strategy Best RepliesPotential solutions of the $n$-player Prisoner's Dilemma