When is a photocopy not a photocopy?

footflaps

Posts: 13594

Free Member

Topic starter

http://www.theverge.com/2013/8/6/4594482/xerox-copiers-randomly-replacing-numbers-in-documents

Posted : 07/08/2013 9:31 am

CountZero

Posts: 33873

Full Member

It's really taken sombody all this time to notice that photocopiers distort characters, especially at small sizes, causing certain figures and letters to look like different ones?
I was aware of that thirty years ago... 🙄

Posted : 07/08/2013 10:39 am

bencooper

Posts: 0

Free Member

It's more than distortion, it seems that it's actually doing OCR on the image and then reconstituting it.

Posted : 07/08/2013 10:55 am

footflaps

Posts: 13594

Free Member

Topic starter

It's more than distortion, it seems that it's actually doing OCR on the image and then reconstituting it.

They're all scanners and printers now, with a processor in between, hence data gets corrupted in the lossy process between the two ends.

Posted : 07/08/2013 10:58 am

DezB

Posts: 54367

Free Member

[i]When is a photocopy not a photocopy?[/i]

When it's taken on a iBum chair?

[img] [/img]

Posted : 07/08/2013 11:00 am

theroadwarrior

Posts: 0

Free Member

Welcome to the digital world.

The only people surprised by this are really failing to understand exactly what is going on inside a scanner- it's not OCR'ing and then magically substituting characters, it's simply a compression artefact.

Granted it's rather an unfortunate one- one that should probably be avoided by Xerox choosing an appropriate amount of compression in the software!

Posted : 07/08/2013 11:12 am

plyphon

Posts: 0

Free Member

Numbers were randomly being altered, with 6 and 8 proving especially susceptible.

That's not randomly then is it.

Posted : 07/08/2013 11:27 am

retro83

Posts: 621

Free Member

theroadwarrior - Member

Welcome to the digital world.

The only people surprised by this are really failing to understand exactly what is going on inside a scanner- it's not OCR'ing and then magically substituting characters, it's simply a compression artefact.

Granted it's rather an unfortunate one- one that should probably be avoided by Xerox choosing an appropriate amount of compression in the software!

It's a strange artifact though, I reckon they are doing pattern matching, the scanner effectively learning the font as it goes through the document, replacing subsequent uses of the same(!) letters with just a pointer back to the first instance.

So in a way, it's closer to deduping rather than a traditional lossy compression algorithm like JPG. Very clever, but obviously needs more tuning!

Posted : 07/08/2013 11:34 am

joemarshall

Posts: 0

Free Member

The only people surprised by this are really failing to understand exactly what is going on inside a scanner- it's not OCR'ing and then magically substituting characters, it's simply a compression artefact.

No that's not right, the compression format involved actually does OCR to find characters, so if the OCR has an error between similar characters like 6 and 8, then a completely wrong character gets put in. It is a symbol based compression format. Nothing like what is traditionally thought of as a compression artifact.

http://en.wikipedia.org/wiki/JBIG2

Posted : 07/08/2013 11:38 am

joemarshall

Posts: 0

Free Member

Why it is worse than blurred figures and traditional compression artefacts is that an error is not obviously blurred and hard to read, it comes out as a perfectly rendered but wrong symbol.

Posted : 07/08/2013 11:42 am

bencooper

Posts: 0

Free Member

[img] [/img]

Yup, see the 3rd line - that's not a blurry 6, that's an 8. That's not a traditional compression artifact, it's an OCR error.

Posted : 07/08/2013 12:21 pm

5thElefant

Posts: 151

Free Member

No good for photocopying bank notes then...

Posted : 07/08/2013 12:26 pm

bencooper

Posts: 0

Free Member

I've never had much luck with my fake £6 notes anyway...

Posted : 07/08/2013 12:29 pm

woody2000

Posts: 8177

Free Member

My god, what if someone photocopies an Excel spreadsheet......!

[img] [/img]

Posted : 07/08/2013 12:39 pm

midlifecrashes

Posts: 8396

Full Member

My god, what if someone photocopies an Excel spreadsheet......!

Don't be silly, you can never get the laptop screen to sit flat enough on the copier.

Posted : 07/08/2013 2:30 pm

thomthumb

Posts: 1014

Free Member

I've never had much luck with my fake £6 notes anyway...

you'd do better to copy them into £8 notes 😉

Posted : 07/08/2013 2:37 pm

[Closed] When is a photocopy not a photocopy?