Copy-protected PDF File

April 23, 2006 on 4:15 pm | In 電腦應用 | No Comments

Sometimes one may want to extract text from a pdf file for further editing. This cannot be done if the pdf file is protected against copying/extracting. There are, however, 2 ways to bypass this restriction :

(1) send the pdf file as an attachment to one’s Gmail account – Gmail provides an option to view the file as HTML and text can be extracted from there (I learn this trick from here);

related links :

<meta content="OpenOffice.org 2.0 (Linux)" name="GENERATOR" /><meta content="chan" name="AUTHOR" /><meta content="20060520;9561300" name="CREATED" /><meta content="16010101;0" name="CHANGED" /><br /> <style type="text/css"> <!-- @page { size: 21cm 29.7cm; margin: 2cm } P { margin-bottom: 0.21cm } --> </style> <p>(2) open the pdf file with <a href="http://www.gnome.org/projects/evince">Evince</a> which is a document viewer running on Linux.</p> <p>The above work for pdf files containing Chinese text. Print-protected restriction can also be bypassed this way. But this will not work for those pdf files which need a password to open.</p> <p><em><font color="magenta">Updated on 20 May 2006<br /> </font></em></p> <div class="feedback"></div> <!-- <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/"> <rdf:Description rdf:about="http://ckmwelt.com/blog/2006/04/23/copy-protected-pdf-file/" dc:identifier="http://ckmwelt.com/blog/2006/04/23/copy-protected-pdf-file/" dc:title="Copy-protected PDF File" trackback:ping="http://ckmwelt.com/blog/2006/04/23/copy-protected-pdf-file/trackback/" /> </rdf:RDF> --> </div> <h2 id="comments">No Comments yet <a href="#postcomment" title="Leave a comment">»</a> </h2> <p> <a href='http://ckmwelt.com/blog/2006/04/23/copy-protected-pdf-file/feed/'><abbr title="Really Simple Syndication">RSS</abbr> feed for comments on this post.</a> <a href="http://ckmwelt.com/blog/2006/04/23/copy-protected-pdf-file/trackback/" rel="trackback">TrackBack <abbr title="Uniform Resource Identifier">URI</abbr></a> </p> <h2 id="postcomment">Leave a comment</h2> <form action="http://ckmwelt.com/blog/wp-comments-post.php" method="post" id="commentform"> <p><input type="text" name="author" id="author" value="" size="22" tabindex="1" /> <label for="author"><small>Name (required)</small></label></p> <p><input type="text" name="email" id="email" value="" size="22" tabindex="2" /> <label for="email"><small>Mail (will not be published) (required)</small></label></p> <p><input type="text" name="url" id="url" value="" size="22" tabindex="3" /> <label for="url"><small>Website</small></label></p> <p><textarea name="comment" id="comment" cols="50" rows="10" tabindex="4"></textarea></p> <p><input name="submit" type="submit" id="submit" tabindex="5" value="Submit Comment" /></p> <p><small><strong>XHTML:</strong> <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> </small> <input type="hidden" name="comment_post_ID" value="40" /> </p> </form> </div> <!-- begin footer --> <!-- begin sidebar --> <div id="sidebar"> <div id="categories"> <h3>Categories:</h3> <ul> <li class="cat-item cat-item-3"><a href="http://ckmwelt.com/blog/category/%e9%9b%bb%e8%85%a6%e6%87%89%e7%94%a8/" title="View all posts filed under 電腦應用">電腦應用</a> (<a href="http://ckmwelt.com/blog/category/%e9%9b%bb%e8%85%a6%e6%87%89%e7%94%a8/feed/" title="rss">rss</a>) (22) </li> <li class="cat-item cat-item-2"><a href="http://ckmwelt.com/blog/category/%e8%ae%80%e6%9b%b8%e7%ad%86%e8%a8%98/" title="View all posts filed under 讀書雜記">讀書雜記</a> (<a href="http://ckmwelt.com/blog/category/%e8%ae%80%e6%9b%b8%e7%ad%86%e8%a8%98/feed/" title="rss">rss</a>) (16) </li> <li class="cat-item cat-item-6"><a href="http://ckmwelt.com/blog/category/%e5%85%b6%e4%bb%96%e9%a1%9e%e5%88%a5/" title="View all posts filed under 其他類別">其他類別</a> (<a href="http://ckmwelt.com/blog/category/%e5%85%b6%e4%bb%96%e9%a1%9e%e5%88%a5/feed/" title="rss">rss</a>) (4) </li> <li class="cat-item cat-item-7"><a href="http://ckmwelt.com/blog/category/%e6%89%80%e6%80%9d%e6%89%80%e6%83%b3/" title="View all posts filed under 所見所想">所見所想</a> (<a href="http://ckmwelt.com/blog/category/%e6%89%80%e6%80%9d%e6%89%80%e6%83%b3/feed/" title="rss">rss</a>) (4) </li> </ul> </div> <div id="archives"> <h3>Archives:</h3> <ul> <li><a href='http://ckmwelt.com/blog/2006/10/' title='October 2006'>October 2006</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2006/07/' title='July 2006'>July 2006</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2006/06/' title='June 2006'>June 2006</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2006/05/' title='May 2006'>May 2006</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2006/04/' title='April 2006'>April 2006</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2006/03/' title='March 2006'>March 2006</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2006/02/' title='February 2006'>February 2006</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2006/01/' title='January 2006'>January 2006</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2005/12/' title='December 2005'>December 2005</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2005/10/' title='October 2005'>October 2005</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2005/09/' title='September 2005'>September 2005</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2005/08/' title='August 2005'>August 2005</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2005/05/' title='May 2005'>May 2005</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2005/04/' title='April 2005'>April 2005</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2005/01/' title='January 2005'>January 2005</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2004/12/' title='December 2004'>December 2004</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2004/11/' title='November 2004'>November 2004</a> (2)</li> <li><a href='http://ckmwelt.com/blog/2004/10/' title='October 2004'>October 2004</a> (3)</li> <li><a href='http://ckmwelt.com/blog/2004/09/' title='September 2004'>September 2004</a> (3)</li> <li><a href='http://ckmwelt.com/blog/2004/08/' title='August 2004'>August 2004</a> (9)</li> <li><a href='http://ckmwelt.com/blog/2004/07/' title='July 2004'>July 2004</a> (1)</li> <li><a href='http://ckmwelt.com/blog/2004/06/' title='June 2004'>June 2004</a> (5)</li> </ul> </div> <div id="blogroll"> <h3>Blogroll</h3> <ul> <li><a href="http://www.mozilla.com/firefox/">Firefox</a></li> <li><a href="http://www.gmail.com">Gmail</a></li> <li><a href="http://www.openoffice.org/">OpenOffice.org</a></li> <li><a href="http://www.ubuntu.com/">Ubuntu</a></li> <li><a href="http://www.yahoo.com/">Yahoo</a></li> </ul> </div> <div id="meta"> <h3>Meta:</h3> <ul> <li><a href="http://ckmwelt.com/blog/wp-login.php">Log in</a></li> <li><a href="http://gmpg.org/xfn/"><abbr title="XHTML Friends Network">XFN</abbr></a></li> <li><a href="http://wordpress.org/" title="Powered by WordPress, state-of-the-art semantic personal publishing platform."><abbr title="WordPress">WP</abbr></a></li> </ul> </div> </div> <div class="both"></div> </div> <!-- end sidebar --> <p id="credits"> Powered by <a href="http://wordpress.org">WordPress</a> with design by <a href="http://www.lamateporunyogur.net/">Borja Fernandez</a>.<br /> <a href="http://ckmwelt.com/blog/feed/">Entries</a> and <a href="http://ckmwelt.com/blog/comments/feed/">comments</a> feeds. Valid <a href="http://validator.w3.org/check/referer">XHTML</a> and <a href="http://jigsaw.w3.org/css-validator/check/referer">CSS</a>. ^<a href="#">Top</a>^<br /> <!-- 19 queries. 3.643 seconds. --> </p> </div> <script type="text/javascript" language="JavaScript"> //<![CDATA[ var wpdone; function wpvisit() { var z; z="&r="+escape(document.referrer); z=z+"&b="+escape(navigator.appName+" "+navigator.appVersion); w=parseFloat(navigator.appVersion); if (w > 2.0) { z=z+"&s="+screen.width+"x"+screen.height; z=z+"&o="+navigator.platform; v="1.2"; if (navigator.appName != "Netscape") { z=z+"&c="+screen.colorDepth; } else { z=z+"&c="+screen.pixelDepth } z=z+"&j="+navigator.javaEnabled(); } else { v=1.0; } z=z+"&v="+v; document.writeln("<img border=\"0\" src=\"http://visit.webhosting.yahoo.com/wisit.gif"+"/blog/"+"?"+z+"\" />"); } wpvisit(); //]]> </script> <noscript><img src="http://visit.webhosting.yahoo.com/wisit.gif?1283745302" border="0" width="1" height="1" alt="visit" /></noscript> <!-- ~ --><div style="display:none"><a href="ok">ok</a></div><!-- ~ --> </body> </html> <!-- Dynamic Page Served (once) in 3.580 seconds -->