Web Grabber WordPress Plugin

This is a live demonstration post for WiseLoop Web Grabber WordPress Plugin.
Below, in the red squares, are shown various extracted contents from different web site pages grabbed by this amazing WordPress Plugin using the provided “webgrab” shortcode.

Example 1: Very simple grabbing of first paragraph from the WordPress website
Shortcode: [ webgrab url='http://wordpress.org/about/' tag='{p class="intro"}']

URL "http://wordpress.org/about/" does not exist, is not readable or is protected against scraping.
Check if your IP address "212.146.85.23" has access permission to this URL.
Headers received:
Array
(
[0] => HTTP/1.1 301 Moved Permanently
[1] => Server: nginx
[2] => Date: Fri, 26 Nov 2021 23:05:20 GMT
[3] => Content-Type: text/html
[4] => Content-Length: 162
[5] => Connection: close
[6] => Location: https://wordpress.org/about/
[7] =>
[8] => HTTP/2 200
[9] => server: nginx
[10] => date: Fri, 26 Nov 2021 23:05:21 GMT
[11] => content-type: text/html; charset=utf-8
[12] => vary: Accept-Encoding
[13] => strict-transport-security: max-age=360
[14] => link: ; rel="https://api.w.org/"
[15] => link: ; rel="alternate"; type="application/json"
[16] => link: ; rel=shortlink
[17] => x-olaf: ⛄
[18] => x-frame-options: SAMEORIGIN
[19] => x-nc: HIT ord 2
[20] => content-encoding: br
[21] =>
[22] =>
)

Example 2: Complex grabbing with tags removal, string replacements and tag instance filtering
Shortcode: [ webgrab url='http://www.freewebsitetemplates.com' tag='{ul}' contains='images/templates' rtag1='{div class="option"' srch1='/images/templates/' repl1='http://www.freewebsitetemplates.com/images/templates/' srch2='/images/ads/templates/' repl2='http://www.freewebsitetemplates.com/images/ads/templates/']

Example 3: Requested by HammyHavoc
Shortcodes:
Getting MySpace friends number: [ webgrab url='http://www.myspace.com/psihamster' tag='span class="toolbarCount"' cache='0']
Getting Last.Fm plays: [ webgrab url='http://www.last.fm/user/DominarHammy' tag='span class="count"' cache='0']

URL "http://www.myspace.com/psihamster" does not exist, is not readable or is protected against scraping.
Check if your IP address "212.146.85.23" has access permission to this URL.
Headers received:
Array
(
[0] => HTTP/1.1 301 Moved Permanently
[1] => Location: https://myspace.com/psihamster
[2] => Connection: close
[3] => Cache-Control: no-cache
[4] => Pragma: no-cache
[5] =>
[6] => HTTP/1.1 404 Not Found
[7] => Vary: Accept-Encoding
[8] => Set-Cookie: persistent_id=pid%3D429de7d6-e6d8-4837-bc7a-eba032454dad%26llid%3D%26lprid%3D%26lltime%3D; domain=.myspace.com; path=/; expires=Thu, 21 Nov 2041 23:05:22 GMT; httpOnly
[9] => Set-Cookie: visit_id=01946f58-7959-43f1-a838-9b383d56ac78; domain=.myspace.com; path=/; expires=Fri, 26 Nov 2021 23:35:22 GMT; httpOnly
[10] => Set-Cookie: beacons_enabled=true; domain=.myspace.com; path=/; expires=Fri, 26 Nov 2021 23:35:22 GMT
[11] => Set-Cookie: player=sequenceId%3D-1%26paused%3Dtrue%26currentTime%3D0%26volume%3D0.5%26mute%3Dfalse%26shuffled%3Dfalse%26repeat%3Doff%26mode%3Dqueue%26pinned%3Dfalse%26streamStartDateTime%3D%26at%3D360%26incognito%3Dfalse%26allowSkips%3Dtrue%26ccOn%3Dfalse; domain=.myspace.com; path=/; expires=Sun, 26 Dec 2021 23:05:22 GMT
[12] => X-TrackingId: abe936c0-b405-48ae-b15c-460e1cde07c1
[13] => Cache-Control: no-cache
[14] => Strict-Transport-Security: max-age=31536000
[15] => X-Frame-Options: SAMEORIGIN
[16] => Content-Security-Policy: frame-ancestors 'self'
[17] => Content-Type: text/html; charset=utf-8
[18] => X-Response-Time: 371ms
[19] => Content-Encoding: gzip
[20] => Date: Fri, 26 Nov 2021 23:05:22 GMT
[21] => Connection: keep-alive
[22] => Transfer-Encoding: chunked
[23] =>
[24] =>
)

Last.Fm plays:
URL "http://www.last.fm/user/DominarHammy" does not exist, is not readable or is protected against scraping.
Check if your IP address "212.146.85.23" has access permission to this URL.
Headers received:
Array
(
[0] => HTTP/1.1 301 Moved Permanently
[1] => Server: Varnish
[2] => Retry-After: 0
[3] => Location: https://www.last.fm/user/DominarHammy
[4] => Content-Length: 0
[5] => Accept-Ranges: bytes
[6] => Date: Fri, 26 Nov 2021 23:05:23 GMT
[7] => Via: 1.1 varnish
[8] => Connection: close
[9] => X-Served-By: cache-vie6323-VIE
[10] => X-Cache: HIT
[11] => X-Cache-Hits: 0
[12] => X-Timer: S1637967923.402028,VS0,VE0
[13] => Strict-Transport-Security: max-age=300
[14] =>
[15] => HTTP/2 200
[16] => server: nginx
[17] => content-type: text/html; charset=utf-8
[18] => content-security-policy: upgrade-insecure-requests;
[19] => content-security-policy-report-only: default-src https: 'unsafe-inline' 'unsafe-eval' wss: ;img-src https: data: blob: ; font-src https: data:; form-action https: http://www.last.fm; report-uri https://cbsi.report-uri.io/r/default/csp/enforce
[20] => x-pjax-url: https://www.last.fm/user/DominarHammy
[21] => etag: W/"627f596d40cb14b0635e4ef4a63d147a"
[22] => x-frame-options: SAMEORIGIN
[23] => content-language: en
[24] => set-cookie: lfmanon=1; Path=/
[25] => set-cookie: not_first_visit=1; Path=/
[26] => set-cookie: sessionid=eyJfYXV0aF91c2VyX2hhc2giOiJkZWZhdWx0Iiwic2Vzc2lvbl9pZCI6IjdhMDBlMGJlLWVlYmUtNGM5Yy05OTI5LTJiOTA1NDU1YTc5YiJ9:1mqkHU:oNM4l-Ah-aWycPPNT2NX3jdKKy0; Domain=.last.fm; expires=Sat, 26-Nov-2022 23:05:24 GMT; HttpOnly; Max-Age=31536000; Path=/; Secure
[27] => content-encoding: gzip
[28] => via: 1.1 google, 1.1 varnish
[29] => accept-ranges: bytes
[30] => date: Fri, 26 Nov 2021 23:05:24 GMT
[31] => x-served-by: prod-lfm-web-57876fcc7b-fsq6k, cache-vie6364-VIE
[32] => x-cache: MISS
[33] => x-cache-hits: 0
[34] => x-timer: S1637967923.485477,VS0,VE1143
[35] => vary: Accept-Encoding, Accept-Language, Cookie
[36] => strict-transport-security: max-age=300
[37] =>
[38] =>
)

Example 4: Requested by danialhr
Shortcode: [ webgrab url='http://www.raymondphang.com/blog/2011/kok-keong-charmaine-actual-day-wedding' tag='{div id="post-' cache='0']

URL "http://www.raymondphang.com/blog/2011/kok-keong-charmaine-actual-day-wedding" does not exist, is not readable or is protected against scraping.
Check if your IP address "212.146.85.23" has access permission to this URL.
Headers received:
Array
(
[0] => HTTP/1.1 500 Internal Server Error
[1] => Date: Fri, 26 Nov 2021 23:08:45 GMT
[2] => Server: Apache
[3] => Expires: Wed, 11 Jan 1984 05:00:00 GMT
[4] => Cache-Control: no-cache, must-revalidate, max-age=0
[5] => Upgrade: h2
[6] => Connection: Upgrade, close
[7] => Vary: Accept-Encoding
[8] => Content-Encoding: gzip
[9] => X-Endurance-Cache-Level: 2
[10] => Content-Length: 1233
[11] => Content-Type: text/html; charset=UTF-8
[12] =>
[13] =>
)

Leave a Reply