 |
Your forum announcement here! |
|
 |

01-11-2008, 10:29
|
 |
Senior Member
|
|
Join Date: Nov 2005
Location: Virtual Private Server
Posts: 2,033
|
|
Google Now Searches through Scanned PDF
Google’s search bot can now scrub through all PDF documents which were produced through scanning, using OCR technology it converts picture into words that can be searched and indexed. You need to just make PDF searchable through the OCR process which is the functionality of the Adobe Acrobat Professional.
[ Source; Googleblog ]
|

01-11-2008, 14:19
|
|
Junior Member
|
|
Join Date: Nov 2008
Posts: 23
|
|
Wow! this is a great announcement by Google that will definitely help Search Engine Optimization efforts across the board.
|

01-11-2008, 16:38
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Cool, very interesting. How many people have access to Acrobat Pro though? This could be a limit till its usefulness temporarily. At the moment the only people I know with Acrobat Pro are developers that have suites like CS3.
|

02-11-2008, 18:58
|
 |
Senior Member
|
|
Join Date: May 2008
Location: Swansea, Wales
Posts: 120
|
|
Although potentially a great addition to Google searches, I do wonder what on earth it will uncover when it comes to crawling government websites for whom document image processing (and publishing) is the way of the world
Now where did I put that robots.txt reference manual 
|

02-11-2008, 18:59
|
 |
Premium Member
|
|
Join Date: Mar 2007
Location: 127.0.0.1
Posts: 1,556
|
|
Quote:
Originally Posted by DPS Computing
Cool, very interesting. How many people have access to Acrobat Pro though? This could be a limit till its usefulness temporarily. At the moment the only people I know with Acrobat Pro are developers that have suites like CS3.
|
Free tools can be used that allow you to save Word documents as PDFs AFAIK.
__________________
Regards,
Josh Hold
eUKhost Blog: Over 1000 Computer Related Articles to Sink Your Teeth Into!
LDN GIGS - Gig Listings for London
Super Moderator
I'm only a forum gremlin (moderator), and do not work for eUKhost in any way. Opinions expressed by me are mine only, and do not reflect those of either eUKhost or any company that may be listed above.
I don't bite, honest.
|

02-11-2008, 22:32
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Quote:
Originally Posted by flesso
Free tools can be used that allow you to save Word documents as PDFs AFAIK.
|
Please share the names and / or locations  .
|

02-11-2008, 22:53
|
 |
Senior Member
|
|
Join Date: May 2008
Location: Swansea, Wales
Posts: 120
|
|
Personally I've always found Bullzip's PDF Printer to be very good and free
FREE PDF Printer
|

02-11-2008, 23:05
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Quote:
Originally Posted by ddwt
Personally I've always found Bullzip's PDF Printer to be very good and free
FREE PDF Printer
|
Thank you for that  .
|

03-11-2008, 09:27
|
 |
Senior Member
|
|
Join Date: Nov 2005
Location: Virtual Private Server
Posts: 2,033
|
|
Quote:
Originally Posted by ddwt
Although potentially a great addition to Google searches, I do wonder what on earth it will uncover when it comes to crawling government websites for whom document image processing (and publishing) is the way of the world
Now where did I put that robots.txt reference manual 
|
I hope Goolge might have partnership with various Government agencies to streamline government information and make them available for their users to access hard-to-find public information.
I think a basic robots.txt file is just enough however to get indexed the pdf documents throguhly you may need to specify different rules for different type of search.
Last edited by paul; 03-11-2008 at 10:34.
|

03-11-2008, 11:48
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Quote:
Originally Posted by paul
I hope Goolge might have partnership with various Government agencies to streamline government information and make them available for their users to access hard-to-find public information.
I think a basic robots.txt file is just enough however to get indexed the pdf documents throguhly you may need to specify different rules for different type of search.
|
Yes, just hopefully they won't cock up and start letting Google search all the private information that the government likes to "lose" these days!
Again, I hear another memory stick full of Government Gateway passwords has gone missing and been found again. I sincerly hope that mine wasn't on there or I may become sue-happy lol  .
|

03-11-2008, 12:01
|
 |
Senior Member
|
|
Join Date: May 2008
Location: Swansea, Wales
Posts: 120
|
|
Hmmm, I wonder if Google will also enable this 'functionality' on their Search Appliaces... now that REALLY would open a can of worms.
Just imagine all of those wannabe Google experts tweaking this and that on their big company/government agency search appliances - adds a whole new dimension to the phrase 'Freedom of Information'
Geez, I sound a right old merchant of doom don't I 
|

03-11-2008, 12:31
|
 |
Senior Member
|
|
Join Date: Nov 2005
Location: Virtual Private Server
Posts: 2,033
|
|
Depend on how Government agency balance the interests between the freedom of information and their right to privacy. However it is sure that Google is trying to create a "World Without Privacy", the privacy concerns will have potential risk factor, no doubt. 
|

03-11-2008, 12:56
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Quote:
Originally Posted by paul
Depend on how Government agency balance the interests between the freedom of information and their right to privacy. However it is sure that Google is trying to create a "World Without Privacy", the privacy concerns will have potential risk factor, no doubt. 
|
I bet Google wouldn't have the same policy if it was their trade secrets or their directors home addresses and phone numbers  .
|

03-11-2008, 15:14
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Quote:
Originally Posted by ddwt
|
Sounds like an interesting book. Am going to look to get that and have a read. Have already read the Google book about its begginings, history and rise to fame so the next logical step is to read a book how to abuse and dismantle it  .
|

03-11-2008, 15:21
|
 |
Senior Member
|
|
Join Date: May 2008
Location: Swansea, Wales
Posts: 120
|
|
Rumour has it that the book is available as a downloadable ebook "somewhere" and can be found by doing a search on.....yes you've guessed it Google 
|

03-11-2008, 18:00
|
 |
Premium Member
|
|
Join Date: Apr 2007
Location: Manchester, United Kingdom
Posts: 6,494
|
|
Quote:
Originally Posted by ddwt
Rumour has it that the book is available as a downloadable ebook "somewhere" and can be found by doing a search on.....yes you've guessed it Google 
|
Ah. *wink* | |