How to find data of object xpath









up vote
1
down vote

favorite












I am trying to find the data of a object on a website using xpath but it wont recognize object.



I am using this which works for divs but it wont output anything but when trying to find objects.



response.xpath("//object").extract()


This is the object I am trying to extract data form (I need the url)



<object id="swfobject_embed" type="application/x-shockwave-flash" data="https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123" style="visibility: visible;" width="800" height="600"><param name="wmode" value="direct"><param name="allowscriptaccess" value="never"><param name="allowfullscreen" value="true"><param name="allowfullscreeninteractive" value="true"><param name="flashvars" value="NewgroundsAPI_PublisherID=1&amp;NewgroundsAPI_SandboxID=5be5746a96d56&amp;NewgroundsAPI_SessionID=&amp;NewgroundsAPI_UserName=&amp;lt;deleted&amp;gt;&amp;NewgroundsAPI_UserID=0&amp;ng_username=&amp;lt;deleted&amp;gt;"></object>









share|improve this question























  • If //object returns nothing, then obviously there is no object in page source. Can you share page URL?
    – Andersson
    Nov 9 at 12:05










  • @Andersson I am trying to get the swf files for games on Newgrounds. I was using this game to test newgrounds.com/portal/view/575163
    – Gus
    Nov 10 at 3:27














up vote
1
down vote

favorite












I am trying to find the data of a object on a website using xpath but it wont recognize object.



I am using this which works for divs but it wont output anything but when trying to find objects.



response.xpath("//object").extract()


This is the object I am trying to extract data form (I need the url)



<object id="swfobject_embed" type="application/x-shockwave-flash" data="https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123" style="visibility: visible;" width="800" height="600"><param name="wmode" value="direct"><param name="allowscriptaccess" value="never"><param name="allowfullscreen" value="true"><param name="allowfullscreeninteractive" value="true"><param name="flashvars" value="NewgroundsAPI_PublisherID=1&amp;NewgroundsAPI_SandboxID=5be5746a96d56&amp;NewgroundsAPI_SessionID=&amp;NewgroundsAPI_UserName=&amp;lt;deleted&amp;gt;&amp;NewgroundsAPI_UserID=0&amp;ng_username=&amp;lt;deleted&amp;gt;"></object>









share|improve this question























  • If //object returns nothing, then obviously there is no object in page source. Can you share page URL?
    – Andersson
    Nov 9 at 12:05










  • @Andersson I am trying to get the swf files for games on Newgrounds. I was using this game to test newgrounds.com/portal/view/575163
    – Gus
    Nov 10 at 3:27












up vote
1
down vote

favorite









up vote
1
down vote

favorite











I am trying to find the data of a object on a website using xpath but it wont recognize object.



I am using this which works for divs but it wont output anything but when trying to find objects.



response.xpath("//object").extract()


This is the object I am trying to extract data form (I need the url)



<object id="swfobject_embed" type="application/x-shockwave-flash" data="https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123" style="visibility: visible;" width="800" height="600"><param name="wmode" value="direct"><param name="allowscriptaccess" value="never"><param name="allowfullscreen" value="true"><param name="allowfullscreeninteractive" value="true"><param name="flashvars" value="NewgroundsAPI_PublisherID=1&amp;NewgroundsAPI_SandboxID=5be5746a96d56&amp;NewgroundsAPI_SessionID=&amp;NewgroundsAPI_UserName=&amp;lt;deleted&amp;gt;&amp;NewgroundsAPI_UserID=0&amp;ng_username=&amp;lt;deleted&amp;gt;"></object>









share|improve this question















I am trying to find the data of a object on a website using xpath but it wont recognize object.



I am using this which works for divs but it wont output anything but when trying to find objects.



response.xpath("//object").extract()


This is the object I am trying to extract data form (I need the url)



<object id="swfobject_embed" type="application/x-shockwave-flash" data="https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123" style="visibility: visible;" width="800" height="600"><param name="wmode" value="direct"><param name="allowscriptaccess" value="never"><param name="allowfullscreen" value="true"><param name="allowfullscreeninteractive" value="true"><param name="flashvars" value="NewgroundsAPI_PublisherID=1&amp;NewgroundsAPI_SandboxID=5be5746a96d56&amp;NewgroundsAPI_SessionID=&amp;NewgroundsAPI_UserName=&amp;lt;deleted&amp;gt;&amp;NewgroundsAPI_UserID=0&amp;ng_username=&amp;lt;deleted&amp;gt;"></object>






html object url xpath web-scraping






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Nov 10 at 9:44









Andersson

36.1k103066




36.1k103066










asked Nov 9 at 11:57









Gus

255




255











  • If //object returns nothing, then obviously there is no object in page source. Can you share page URL?
    – Andersson
    Nov 9 at 12:05










  • @Andersson I am trying to get the swf files for games on Newgrounds. I was using this game to test newgrounds.com/portal/view/575163
    – Gus
    Nov 10 at 3:27
















  • If //object returns nothing, then obviously there is no object in page source. Can you share page URL?
    – Andersson
    Nov 9 at 12:05










  • @Andersson I am trying to get the swf files for games on Newgrounds. I was using this game to test newgrounds.com/portal/view/575163
    – Gus
    Nov 10 at 3:27















If //object returns nothing, then obviously there is no object in page source. Can you share page URL?
– Andersson
Nov 9 at 12:05




If //object returns nothing, then obviously there is no object in page source. Can you share page URL?
– Andersson
Nov 9 at 12:05












@Andersson I am trying to get the swf files for games on Newgrounds. I was using this game to test newgrounds.com/portal/view/575163
– Gus
Nov 10 at 3:27




@Andersson I am trying to get the swf files for games on Newgrounds. I was using this game to test newgrounds.com/portal/view/575163
– Gus
Nov 10 at 3:27












1 Answer
1






active

oldest

votes

















up vote
0
down vote



accepted










There is no <object> node in page source - it's generated dynamically by JavaScript. You can directly parse JavaScript function to get required swf-file URL:



import requests
import re

response = requests.get('https://www.newgrounds.com/portal/view/575163').text
swf_url = re.search('swfobject.embedSWF("(.+?)",', r).group(1)
print(swf_url)

# https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123





share|improve this answer




















    Your Answer






    StackExchange.ifUsing("editor", function ()
    StackExchange.using("externalEditor", function ()
    StackExchange.using("snippets", function ()
    StackExchange.snippets.init();
    );
    );
    , "code-snippets");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "1"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53225307%2fhow-to-find-data-of-object-xpath%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    0
    down vote



    accepted










    There is no <object> node in page source - it's generated dynamically by JavaScript. You can directly parse JavaScript function to get required swf-file URL:



    import requests
    import re

    response = requests.get('https://www.newgrounds.com/portal/view/575163').text
    swf_url = re.search('swfobject.embedSWF("(.+?)",', r).group(1)
    print(swf_url)

    # https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123





    share|improve this answer
























      up vote
      0
      down vote



      accepted










      There is no <object> node in page source - it's generated dynamically by JavaScript. You can directly parse JavaScript function to get required swf-file URL:



      import requests
      import re

      response = requests.get('https://www.newgrounds.com/portal/view/575163').text
      swf_url = re.search('swfobject.embedSWF("(.+?)",', r).group(1)
      print(swf_url)

      # https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123





      share|improve this answer






















        up vote
        0
        down vote



        accepted







        up vote
        0
        down vote



        accepted






        There is no <object> node in page source - it's generated dynamically by JavaScript. You can directly parse JavaScript function to get required swf-file URL:



        import requests
        import re

        response = requests.get('https://www.newgrounds.com/portal/view/575163').text
        swf_url = re.search('swfobject.embedSWF("(.+?)",', r).group(1)
        print(swf_url)

        # https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123





        share|improve this answer












        There is no <object> node in page source - it's generated dynamically by JavaScript. You can directly parse JavaScript function to get required swf-file URL:



        import requests
        import re

        response = requests.get('https://www.newgrounds.com/portal/view/575163').text
        swf_url = re.search('swfobject.embedSWF("(.+?)",', r).group(1)
        print(swf_url)

        # https://uploads.ungrounded.net/575000/575163_Superfighters.swf?123






        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Nov 10 at 9:42









        Andersson

        36.1k103066




        36.1k103066



























            draft saved

            draft discarded
















































            Thanks for contributing an answer to Stack Overflow!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.





            Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


            Please pay close attention to the following guidance:


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53225307%2fhow-to-find-data-of-object-xpath%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            𛂒𛀶,𛀽𛀑𛂀𛃧𛂓𛀙𛃆𛃑𛃷𛂟𛁡𛀢𛀟𛁤𛂽𛁕𛁪𛂟𛂯,𛁞𛂧𛀴𛁄𛁠𛁼𛂿𛀤 𛂘,𛁺𛂾𛃭𛃭𛃵𛀺,𛂣𛃍𛂖𛃶 𛀸𛃀𛂖𛁶𛁏𛁚 𛂢𛂞 𛁰𛂆𛀔,𛁸𛀽𛁓𛃋𛂇𛃧𛀧𛃣𛂐𛃇,𛂂𛃻𛃲𛁬𛃞𛀧𛃃𛀅 𛂭𛁠𛁡𛃇𛀷𛃓𛁥,𛁙𛁘𛁞𛃸𛁸𛃣𛁜,𛂛,𛃿,𛁯𛂘𛂌𛃛𛁱𛃌𛂈𛂇 𛁊𛃲,𛀕𛃴𛀜 𛀶𛂆𛀶𛃟𛂉𛀣,𛂐𛁞𛁾 𛁷𛂑𛁳𛂯𛀬𛃅,𛃶𛁼

            Edmonton

            Crossroads (UK TV series)