<?xml version="1.0" encoding="iso-8859-1"?>
<rss version="2.0">
<channel>
  <title>Linux Format forums</title>
  <link>http://linuxformat.com/forums/index.php</link>
  <description>Help, discussion, magazine feedback and more</description>
  <language>english</language>
  <copyright>(c) Copyright Thu May 23, 2013 11:45 pm by Linux Format forums</copyright>
  <managingEditor>webmaster@linuxformat.com</managingEditor>
  <webMaster>webmaster@linuxformat.com</webMaster>
  <pubDate>Thu May 23, 2013 11:45 pm</pubDate>
  <lastBuildDate>Thu May 23, 2013 11:45 pm</lastBuildDate>
  <docs>http://backend.userland.com/rss</docs>
  <generator>phpBB2 RSS Syndication Mod by Lucas</generator>
  <ttl>1</ttl>

  <image>
    <title>Linux Format forums</title>
    <url></url>
    <link>http://linuxformat.com/forums/</link>
    <description>Help, discussion, magazine feedback and more</description>
  </image>

                                      <item>
                                        <title>Re: De-internet-archive-scripting webpages</title>
                                        <link>http://linuxformat.com/forums/viewtopic.php?p=106489#106489</link>
                                        <description>&lt;br /&gt;
                                      Author: &lt;a href='http://linuxformat.com/forums/profile.php?mode=viewprofile&amp;u=14187'&gt;Dutch_Master&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;
                                      Posted: Thu Jun 21, 2012 3:07 pm&lt;br /&gt;&lt;br /&gt;
                                      &lt;br /&gt;&lt;br /&gt;
                                      And again I find some familiar question in LFX159 this time  &lt;img src=&quot;images/smiles/icon_biggrin.gif&quot; alt=&quot;Very Happy&quot; border=&quot;0&quot; /&gt; Thx guys!&lt;br /&gt;
(haven't pursued it yet, got some RSI complaints by my wrist back then... &lt;img src=&quot;images/smiles/icon_sad.gif&quot; alt=&quot;Sad&quot; border=&quot;0&quot; /&gt;)</description>
                                        <comments>http://linuxformat.com/forums/viewtopic.php?p=106489#106489</comments>
                                        <author>Dutch_Master</author>
                                        <pubDate>Thu Jun 21, 2012 3:07 pm</pubDate>
                                        <guid isPermaLink="true">http://linuxformat.com/forums/viewtopic.php?p=106489#106489</guid>
                                      </item>
                                      <item>
                                        <title>Re: De-internet-archive-scripting webpages</title>
                                        <link>http://linuxformat.com/forums/viewtopic.php?p=105081#105081</link>
                                        <description>&lt;br /&gt;
                                      Author: &lt;a href='http://linuxformat.com/forums/profile.php?mode=viewprofile&amp;u=14187'&gt;Dutch_Master&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;
                                      Posted: Mon Mar 26, 2012 2:06 pm&lt;br /&gt;&lt;br /&gt;
                                      &lt;br /&gt;&lt;br /&gt;
                                      Thanks Nelz, give it a try later. &lt;img src=&quot;images/smiles/icon_smile.gif&quot; alt=&quot;Smile&quot; border=&quot;0&quot; /&gt;</description>
                                        <comments>http://linuxformat.com/forums/viewtopic.php?p=105081#105081</comments>
                                        <author>Dutch_Master</author>
                                        <pubDate>Mon Mar 26, 2012 2:06 pm</pubDate>
                                        <guid isPermaLink="true">http://linuxformat.com/forums/viewtopic.php?p=105081#105081</guid>
                                      </item>
                                      <item>
                                        <title>Re: De-internet-archive-scripting webpages</title>
                                        <link>http://linuxformat.com/forums/viewtopic.php?p=105079#105079</link>
                                        <description>&lt;br /&gt;
                                      Author: &lt;a href='http://linuxformat.com/forums/profile.php?mode=viewprofile&amp;u=5'&gt;nelz&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;
                                      Posted: Mon Mar 26, 2012 10:34 am&lt;br /&gt;&lt;br /&gt;
                                      &lt;br /&gt;&lt;br /&gt;
                                      My bad, I was working from failing memory, it is x,yd not x-yd, which means something completely different.&lt;br /&gt;
&lt;br /&gt;
You can also give an extension to -i and sed will create backups of the original files with that extension. This will remove the toolbar stuff and save a backup&lt;br /&gt;
&lt;br /&gt;
&lt;/span&gt;&lt;table width=&quot;90%&quot; cellspacing=&quot;1&quot; cellpadding=&quot;3&quot; border=&quot;0&quot; align=&quot;center&quot;&gt;&lt;tr&gt; 	  &lt;td&gt;&lt;span class=&quot;genmed&quot;&gt;&lt;b&gt;Code:&lt;/b&gt;&lt;/span&gt;&lt;/td&gt;	&lt;/tr&gt;	&lt;tr&gt;	  &lt;td class=&quot;code&quot;&gt;sed -i.bak /BEGIN\ WAYBACK\ TOOLBAR/,/END\ WAYBACK\ TOOLBAR/d Start\ page.html&lt;/td&gt;	&lt;/tr&gt;&lt;/table&gt;&lt;span class=&quot;postbody&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
You can change the links to relative with something like&lt;br /&gt;
&lt;br /&gt;
&lt;/span&gt;&lt;table width=&quot;90%&quot; cellspacing=&quot;1&quot; cellpadding=&quot;3&quot; border=&quot;0&quot; align=&quot;center&quot;&gt;&lt;tr&gt; 	  &lt;td&gt;&lt;span class=&quot;genmed&quot;&gt;&lt;b&gt;Code:&lt;/b&gt;&lt;/span&gt;&lt;/td&gt;	&lt;/tr&gt;	&lt;tr&gt;	  &lt;td class=&quot;code&quot;&gt;sed 's/http&amp;#58;\/\/web.archive.org\/web\/20041103050546\/http&amp;#58;\/\/web.utanet.at\/smiderkr\/asr\//g'&lt;/td&gt;	&lt;/tr&gt;&lt;/table&gt;&lt;span class=&quot;postbody&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
on all files in the same directory, but it gets messy if the pages are stored in multiple subdirectories.</description>
                                        <comments>http://linuxformat.com/forums/viewtopic.php?p=105079#105079</comments>
                                        <author>nelz</author>
                                        <pubDate>Mon Mar 26, 2012 10:34 am</pubDate>
                                        <guid isPermaLink="true">http://linuxformat.com/forums/viewtopic.php?p=105079#105079</guid>
                                      </item>
                                      <item>
                                        <title>Re: De-internet-archive-scripting webpages</title>
                                        <link>http://linuxformat.com/forums/viewtopic.php?p=105074#105074</link>
                                        <description>&lt;br /&gt;
                                      Author: &lt;a href='http://linuxformat.com/forums/profile.php?mode=viewprofile&amp;u=14187'&gt;Dutch_Master&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;
                                      Posted: Mon Mar 26, 2012 2:32 am&lt;br /&gt;&lt;br /&gt;
                                      &lt;br /&gt;&lt;br /&gt;
                                      The problem is that although the archive script puts a lot of files in the same place, it also hard-links all links in a page, using absolute links (with the http header). However, when storing the files I also introduced some issues, only to become apparent when I opened the html code.... Right now, I think your script will remove the bulk of the added code and I'd have to manually edit the hard-links back to relative links. Thanks again Nelz!&lt;br /&gt;
&lt;br /&gt;
[edit: cried victory too soon, after replacing the x and y with numerical values I got the error &lt;/span&gt;&lt;table width=&quot;90%&quot; cellspacing=&quot;1&quot; cellpadding=&quot;3&quot; border=&quot;0&quot; align=&quot;center&quot;&gt;&lt;tr&gt; 	  &lt;td&gt;&lt;span class=&quot;genmed&quot;&gt;&lt;b&gt;Code:&lt;/b&gt;&lt;/span&gt;&lt;/td&gt;	&lt;/tr&gt;	&lt;tr&gt;	  &lt;td class=&quot;code&quot;&gt;sed&amp;#58; -e expression #1, char 3&amp;#58; unknown command&amp;#58; `-'&lt;/td&gt;	&lt;/tr&gt;&lt;/table&gt;&lt;span class=&quot;postbody&quot;&gt; I've got as far as &lt;/span&gt;&lt;table width=&quot;90%&quot; cellspacing=&quot;1&quot; cellpadding=&quot;3&quot; border=&quot;0&quot; align=&quot;center&quot;&gt;&lt;tr&gt; 	  &lt;td&gt;&lt;span class=&quot;genmed&quot;&gt;&lt;b&gt;Code:&lt;/b&gt;&lt;/span&gt;&lt;/td&gt;	&lt;/tr&gt;	&lt;tr&gt;	  &lt;td class=&quot;code&quot;&gt;for i in *.html; do sed i\ &amp;#123;14-216&amp;#125;d $i; done&lt;/td&gt;	&lt;/tr&gt;&lt;/table&gt;&lt;span class=&quot;postbody&quot;&gt; This clears the files completely. Luckily I got a backup.... &lt;img src=&quot;images/smiles/icon_wink.gif&quot; alt=&quot;Wink&quot; border=&quot;0&quot; /&gt;]&lt;br /&gt;
&lt;br /&gt;
[edit2: here's a simple &lt;a href=&quot;http://web.archive.org/web/20041103050546/http://web.utanet.at/smiderkr/asr/enidx.html&quot; rel=&quot;nofollow&quot; target=&quot;_blank&quot; class=&quot;postlink&quot;&gt;sample&lt;/a&gt;]</description>
                                        <comments>http://linuxformat.com/forums/viewtopic.php?p=105074#105074</comments>
                                        <author>Dutch_Master</author>
                                        <pubDate>Mon Mar 26, 2012 2:32 am</pubDate>
                                        <guid isPermaLink="true">http://linuxformat.com/forums/viewtopic.php?p=105074#105074</guid>
                                      </item>
                                      <item>
                                        <title>Re: De-internet-archive-scripting webpages</title>
                                        <link>http://linuxformat.com/forums/viewtopic.php?p=105070#105070</link>
                                        <description>&lt;br /&gt;
                                      Author: &lt;a href='http://linuxformat.com/forums/profile.php?mode=viewprofile&amp;u=5'&gt;nelz&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;
                                      Posted: Mon Mar 26, 2012 1:55 am&lt;br /&gt;&lt;br /&gt;
                                      &lt;br /&gt;&lt;br /&gt;
                                      If the code is in the same place in each file, you can remove a range of line with&lt;br /&gt;
&lt;br /&gt;
&lt;/span&gt;&lt;table width=&quot;90%&quot; cellspacing=&quot;1&quot; cellpadding=&quot;3&quot; border=&quot;0&quot; align=&quot;center&quot;&gt;&lt;tr&gt; 	  &lt;td&gt;&lt;span class=&quot;genmed&quot;&gt;&lt;b&gt;Code:&lt;/b&gt;&lt;/span&gt;&lt;/td&gt;	&lt;/tr&gt;	&lt;tr&gt;	  &lt;td class=&quot;code&quot;&gt;for i in *.html; do&lt;br /&gt;
&amp;nbsp; sed -i x-yd $i&lt;br /&gt;
done&lt;/td&gt;	&lt;/tr&gt;&lt;/table&gt;&lt;span class=&quot;postbody&quot;&gt;&lt;br /&gt;
&lt;br /&gt;
Where x and y are the first and last lines of the script. Otherwise we'd need to see an example to see how to identify the lines to be deleted.</description>
                                        <comments>http://linuxformat.com/forums/viewtopic.php?p=105070#105070</comments>
                                        <author>nelz</author>
                                        <pubDate>Mon Mar 26, 2012 1:55 am</pubDate>
                                        <guid isPermaLink="true">http://linuxformat.com/forums/viewtopic.php?p=105070#105070</guid>
                                      </item>
                                      <item>
                                        <title>De-internet-archive-scripting webpages</title>
                                        <link>http://linuxformat.com/forums/viewtopic.php?p=105064#105064</link>
                                        <description>&lt;br /&gt;
                                      Author: &lt;a href='http://linuxformat.com/forums/profile.php?mode=viewprofile&amp;u=14187'&gt;Dutch_Master&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;
                                      Posted: Sun Mar 25, 2012 10:18 pm&lt;br /&gt;&lt;br /&gt;
                                      &lt;br /&gt;&lt;br /&gt;
                                      Some time ago I downloaded a website from the internet archive pages as it turned out the owner discontinued it. They insert some ***** scripting I distaste greatly and it keeps on linking to the archive. I could remove all instances by hand in the html code, but with 100+ pages, I'd think there's a better (quicker!) solution. I assume sed or awk are required, but knowing nought about either, what's the best oneliner (or script, that's fine) that get me running? (s'cuse the pun &lt;img src=&quot;images/smiles/icon_wink.gif&quot; alt=&quot;Wink&quot; border=&quot;0&quot; /&gt;)</description>
                                        <comments>http://linuxformat.com/forums/viewtopic.php?p=105064#105064</comments>
                                        <author>Dutch_Master</author>
                                        <pubDate>Sun Mar 25, 2012 10:18 pm</pubDate>
                                        <guid isPermaLink="true">http://linuxformat.com/forums/viewtopic.php?p=105064#105064</guid>
                                      </item></channel></rss>