{"id":12976,"date":"2022-10-02T15:31:28","date_gmt":"2022-10-02T10:01:28","guid":{"rendered":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/"},"modified":"2022-10-02T15:31:28","modified_gmt":"2022-10-02T10:01:28","slug":"solved-extracting-variables-from-javascript-inside-html","status":"publish","type":"post","link":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/","title":{"rendered":"[Solved] Extracting variables from Javascript inside HTML"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div id=\"answer-52523377\" class=\"answer js-answer accepted-answer js-accepted-answer\" data-answerid=\"52523377\" data-parentid=\"52522884\" data-score=\"0\" data-position-on-page=\"1\" data-highest-scored=\"1\" data-question-has-accepted-highest-score=\"1\" itemprop=\"acceptedAnswer\" itemscope itemtype=\"https:\/\/schema.org\/Answer\">\n<div class=\"post-layout\">\n<div class=\"votecell post-layout--left\"><\/div>\n<div class=\"answercell post-layout--right\">\n<div class=\"s-prose js-post-body\" itemprop=\"text\">\n<p>You could use BeautifulSoup to extract the <code>&lt;script&gt;<\/code> tag, but you would still need an alternative approach to extract the information inside.<\/p>\n<p>Some Python can be used to first extract <code>flashvars<\/code> and then pass this to <code>demjson<\/code> to convert the Javascript dictionary into a Python one. For example:<\/p>\n<pre><code>import demjson\n\ncontent = \"\"\"&lt;script type=\"text\/javascript\"&gt;\/* &lt;![CDATA[ *\/ \n... \n...\n&lt;\/script&gt;\"\"\"\n\nscript_var = content.split('var flashvars=\")[1]\nscript_var = script_var[:script_var.find(\"};') + 1]\ndata = demjson.decode(script_var)\n\nprint(data['video_url'])\nprint(data['video_alt_url'])\n<\/code><\/pre>\n<p>This would then display:<\/p>\n<pre class=\"lang-none prettyprint-override\"><code>https:\/\/www.ptrex.com\/get_file\/4\/996a9088fdf801992d24457cd51469f3f7aaaee6a0\/33000\/33247\/33247.mp4\/\nhttps:\/\/www.ptrex.com\/get_file\/4\/774833c428771edee2cf401ef2264e746a06f9f370\/33000\/33247\/33247_720p.mp4\/\n<\/code><\/pre>\n<p><code>demjson<\/code> is an alternative JSON decoder which can be installed via PIP<\/p>\n<pre><code>pip install demjson\n<\/code><\/pre>\n<\/p><\/div>\n<div class=\"mt24\"><\/div>\n<\/div>\n<p>            <span class=\"d-none\" itemprop=\"commentCount\">4<\/span> <\/p><\/div>\n<\/div>\n<p>[ad_2]<\/p>\n<p>solved Extracting variables from Javascript inside HTML <\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] You could use BeautifulSoup to extract the &lt;script&gt; tag, but you would still need an alternative approach to extract the information inside. Some Python can be used to first extract flashvars and then pass this to demjson to convert the Javascript dictionary into a Python one. For example: import demjson content = &#8220;&#8221;&#8221;&lt;script type=&#8221;text\/javascript&#8221;&gt;\/* &#8230; <a title=\"[Solved] Extracting variables from Javascript inside HTML\" class=\"read-more\" href=\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\" aria-label=\"More on [Solved] Extracting variables from Javascript inside HTML\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[320],"tags":[622,349,760],"class_list":["post-12976","post","type-post","status-publish","format-standard","hentry","category-solved","tag-beautifulsoup","tag-python","tag-web-scraping"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v26.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>[Solved] Extracting variables from Javascript inside HTML - JassWeb<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"[Solved] Extracting variables from Javascript inside HTML - JassWeb\" \/>\n<meta property=\"og:description\" content=\"[ad_1] You could use BeautifulSoup to extract the &lt;script&gt; tag, but you would still need an alternative approach to extract the information inside. Some Python can be used to first extract flashvars and then pass this to demjson to convert the Javascript dictionary into a Python one. For example: import demjson content = &quot;&quot;&quot;&lt;script type=&quot;text\/javascript&quot;&gt;\/* ... Read more\" \/>\n<meta property=\"og:url\" content=\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\" \/>\n<meta property=\"og:site_name\" content=\"JassWeb\" \/>\n<meta property=\"article:published_time\" content=\"2022-10-02T10:01:28+00:00\" \/>\n<meta name=\"author\" content=\"Kirat\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Kirat\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\"},\"author\":{\"name\":\"Kirat\",\"@id\":\"https:\/\/jassweb.com\/solved\/#\/schema\/person\/65c9c7b7958150c0dc8371fa35dd7c31\"},\"headline\":\"[Solved] Extracting variables from Javascript inside HTML\",\"datePublished\":\"2022-10-02T10:01:28+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\"},\"wordCount\":76,\"publisher\":{\"@id\":\"https:\/\/jassweb.com\/solved\/#organization\"},\"keywords\":[\"beautifulsoup\",\"python\",\"web-scraping\"],\"articleSection\":[\"Solved\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\",\"url\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\",\"name\":\"[Solved] Extracting variables from Javascript inside HTML - JassWeb\",\"isPartOf\":{\"@id\":\"https:\/\/jassweb.com\/solved\/#website\"},\"datePublished\":\"2022-10-02T10:01:28+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/jassweb.com\/solved\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"[Solved] Extracting variables from Javascript inside HTML\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/jassweb.com\/solved\/#website\",\"url\":\"https:\/\/jassweb.com\/solved\/\",\"name\":\"JassWeb\",\"description\":\"Build High-quality Websites\",\"publisher\":{\"@id\":\"https:\/\/jassweb.com\/solved\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/jassweb.com\/solved\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/jassweb.com\/solved\/#organization\",\"name\":\"Jass Web\",\"url\":\"https:\/\/jassweb.com\/solved\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/jassweb.com\/solved\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/jassweb.com\/wp-content\/uploads\/2021\/02\/jass-website-logo-1.png\",\"contentUrl\":\"https:\/\/jassweb.com\/wp-content\/uploads\/2021\/02\/jass-website-logo-1.png\",\"width\":693,\"height\":132,\"caption\":\"Jass Web\"},\"image\":{\"@id\":\"https:\/\/jassweb.com\/solved\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/jassweb.com\/solved\/#\/schema\/person\/65c9c7b7958150c0dc8371fa35dd7c31\",\"name\":\"Kirat\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/jassweb.com\/solved\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/jassweb.com\/solved\/wp-content\/litespeed\/avatar\/1261af3c9451399fa1336d28b98ea3bb.jpg?ver=1775798750\",\"contentUrl\":\"https:\/\/jassweb.com\/solved\/wp-content\/litespeed\/avatar\/1261af3c9451399fa1336d28b98ea3bb.jpg?ver=1775798750\",\"caption\":\"Kirat\"},\"sameAs\":[\"http:\/\/jassweb.com\"],\"url\":\"https:\/\/jassweb.com\/solved\/author\/jaspritsinghghumangmail-com\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"[Solved] Extracting variables from Javascript inside HTML - JassWeb","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/","og_locale":"en_US","og_type":"article","og_title":"[Solved] Extracting variables from Javascript inside HTML - JassWeb","og_description":"[ad_1] You could use BeautifulSoup to extract the &lt;script&gt; tag, but you would still need an alternative approach to extract the information inside. Some Python can be used to first extract flashvars and then pass this to demjson to convert the Javascript dictionary into a Python one. For example: import demjson content = \"\"\"&lt;script type=\"text\/javascript\"&gt;\/* ... Read more","og_url":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/","og_site_name":"JassWeb","article_published_time":"2022-10-02T10:01:28+00:00","author":"Kirat","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Kirat","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/#article","isPartOf":{"@id":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/"},"author":{"name":"Kirat","@id":"https:\/\/jassweb.com\/solved\/#\/schema\/person\/65c9c7b7958150c0dc8371fa35dd7c31"},"headline":"[Solved] Extracting variables from Javascript inside HTML","datePublished":"2022-10-02T10:01:28+00:00","mainEntityOfPage":{"@id":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/"},"wordCount":76,"publisher":{"@id":"https:\/\/jassweb.com\/solved\/#organization"},"keywords":["beautifulsoup","python","web-scraping"],"articleSection":["Solved"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/","url":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/","name":"[Solved] Extracting variables from Javascript inside HTML - JassWeb","isPartOf":{"@id":"https:\/\/jassweb.com\/solved\/#website"},"datePublished":"2022-10-02T10:01:28+00:00","breadcrumb":{"@id":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/jassweb.com\/solved\/solved-extracting-variables-from-javascript-inside-html\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/jassweb.com\/solved\/"},{"@type":"ListItem","position":2,"name":"[Solved] Extracting variables from Javascript inside HTML"}]},{"@type":"WebSite","@id":"https:\/\/jassweb.com\/solved\/#website","url":"https:\/\/jassweb.com\/solved\/","name":"JassWeb","description":"Build High-quality Websites","publisher":{"@id":"https:\/\/jassweb.com\/solved\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/jassweb.com\/solved\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/jassweb.com\/solved\/#organization","name":"Jass Web","url":"https:\/\/jassweb.com\/solved\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/jassweb.com\/solved\/#\/schema\/logo\/image\/","url":"https:\/\/jassweb.com\/wp-content\/uploads\/2021\/02\/jass-website-logo-1.png","contentUrl":"https:\/\/jassweb.com\/wp-content\/uploads\/2021\/02\/jass-website-logo-1.png","width":693,"height":132,"caption":"Jass Web"},"image":{"@id":"https:\/\/jassweb.com\/solved\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/jassweb.com\/solved\/#\/schema\/person\/65c9c7b7958150c0dc8371fa35dd7c31","name":"Kirat","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/jassweb.com\/solved\/#\/schema\/person\/image\/","url":"https:\/\/jassweb.com\/solved\/wp-content\/litespeed\/avatar\/1261af3c9451399fa1336d28b98ea3bb.jpg?ver=1775798750","contentUrl":"https:\/\/jassweb.com\/solved\/wp-content\/litespeed\/avatar\/1261af3c9451399fa1336d28b98ea3bb.jpg?ver=1775798750","caption":"Kirat"},"sameAs":["http:\/\/jassweb.com"],"url":"https:\/\/jassweb.com\/solved\/author\/jaspritsinghghumangmail-com\/"}]}},"_links":{"self":[{"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/posts\/12976","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/comments?post=12976"}],"version-history":[{"count":0,"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/posts\/12976\/revisions"}],"wp:attachment":[{"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/media?parent=12976"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/categories?post=12976"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jassweb.com\/solved\/wp-json\/wp\/v2\/tags?post=12976"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}