Suggestion about Minhash implementation with n permutation










1














I'm trying to understand LSH implementation. I found this on stackoverflow



Can you suggest a good minhash implementation?



and I try to follow the Duhaime's implementation.



In my case, i wish apply a permutation on the minhash(like in datasketch tool), and i think this implementation isn't good for me.
I already start from sparse matrix.



Someone can give some suggestion about this tecnique? isn't very diffuse so i don't find more material about implementation with Python.



I hope in you help.










share|improve this question


























    1














    I'm trying to understand LSH implementation. I found this on stackoverflow



    Can you suggest a good minhash implementation?



    and I try to follow the Duhaime's implementation.



    In my case, i wish apply a permutation on the minhash(like in datasketch tool), and i think this implementation isn't good for me.
    I already start from sparse matrix.



    Someone can give some suggestion about this tecnique? isn't very diffuse so i don't find more material about implementation with Python.



    I hope in you help.










    share|improve this question
























      1












      1








      1







      I'm trying to understand LSH implementation. I found this on stackoverflow



      Can you suggest a good minhash implementation?



      and I try to follow the Duhaime's implementation.



      In my case, i wish apply a permutation on the minhash(like in datasketch tool), and i think this implementation isn't good for me.
      I already start from sparse matrix.



      Someone can give some suggestion about this tecnique? isn't very diffuse so i don't find more material about implementation with Python.



      I hope in you help.










      share|improve this question













      I'm trying to understand LSH implementation. I found this on stackoverflow



      Can you suggest a good minhash implementation?



      and I try to follow the Duhaime's implementation.



      In my case, i wish apply a permutation on the minhash(like in datasketch tool), and i think this implementation isn't good for me.
      I already start from sparse matrix.



      Someone can give some suggestion about this tecnique? isn't very diffuse so i don't find more material about implementation with Python.



      I hope in you help.







      python matrix dataset bigdata data-mining






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 10 '18 at 15:43









      theantomctheantomc

      1008




      1008






















          1 Answer
          1






          active

          oldest

          votes


















          0














          Don't just look for example code. Try to understand the math behind it.



          Obviously, maxhash should work similar. Or you could omit 0 values. But then you should double check the math.






          share|improve this answer




















            Your Answer






            StackExchange.ifUsing("editor", function ()
            StackExchange.using("externalEditor", function ()
            StackExchange.using("snippets", function ()
            StackExchange.snippets.init();
            );
            );
            , "code-snippets");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "1"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53240565%2fsuggestion-about-minhash-implementation-with-n-permutation%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            0














            Don't just look for example code. Try to understand the math behind it.



            Obviously, maxhash should work similar. Or you could omit 0 values. But then you should double check the math.






            share|improve this answer

























              0














              Don't just look for example code. Try to understand the math behind it.



              Obviously, maxhash should work similar. Or you could omit 0 values. But then you should double check the math.






              share|improve this answer























                0












                0








                0






                Don't just look for example code. Try to understand the math behind it.



                Obviously, maxhash should work similar. Or you could omit 0 values. But then you should double check the math.






                share|improve this answer












                Don't just look for example code. Try to understand the math behind it.



                Obviously, maxhash should work similar. Or you could omit 0 values. But then you should double check the math.







                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Nov 30 '18 at 8:13









                Anony-MousseAnony-Mousse

                57.5k796159




                57.5k796159



























                    draft saved

                    draft discarded
















































                    Thanks for contributing an answer to Stack Overflow!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53240565%2fsuggestion-about-minhash-implementation-with-n-permutation%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    𛂒𛀶,𛀽𛀑𛂀𛃧𛂓𛀙𛃆𛃑𛃷𛂟𛁡𛀢𛀟𛁤𛂽𛁕𛁪𛂟𛂯,𛁞𛂧𛀴𛁄𛁠𛁼𛂿𛀤 𛂘,𛁺𛂾𛃭𛃭𛃵𛀺,𛂣𛃍𛂖𛃶 𛀸𛃀𛂖𛁶𛁏𛁚 𛂢𛂞 𛁰𛂆𛀔,𛁸𛀽𛁓𛃋𛂇𛃧𛀧𛃣𛂐𛃇,𛂂𛃻𛃲𛁬𛃞𛀧𛃃𛀅 𛂭𛁠𛁡𛃇𛀷𛃓𛁥,𛁙𛁘𛁞𛃸𛁸𛃣𛁜,𛂛,𛃿,𛁯𛂘𛂌𛃛𛁱𛃌𛂈𛂇 𛁊𛃲,𛀕𛃴𛀜 𛀶𛂆𛀶𛃟𛂉𛀣,𛂐𛁞𛁾 𛁷𛂑𛁳𛂯𛀬𛃅,𛃶𛁼

                    Edmonton

                    Crossroads (UK TV series)