IT Jobs

Did you know? Techworld now offers an IT Jobs section with hundreds of jobs! Current job listings are now available for Software Developers, Web Developers, Application Engineers, Project Managers, Graduate opportunities and more. Apply for your new IT job today!

Can you get 256GB on an A4 sheet? No way!

Bold claim torn to shreds.

Is it really possible to store 256GB of data on a plain A4 sheet of paper?

Yes, according to Sainul Abideen, an engineering student in India who claims his Rainbow format encoding can do it by turning the basic data into coloured geometric shapes. Abideen stated that he could store 2.7GB of data in a square inch, and up to 450GB on a large sheet of paper - which he was then photographed with.

Rainbow technology claims debunked

But following huge interest in the story, Mr Abideen's claims have come under determined scrutiny across the Internet, with a broad consensus that it is in fact impossible to store that amount of information with the apparatus he has outlined. One expert has called the claims "the storage equivalent of perpetual motion".

The feasibility of the entire method boils down to the encoding system used and the technology behind a special scanner that Abideen has demonstrated. However, Abideen has so far refused to divulge any details of either, adding to the general mood of scepticism.

Techworld has spoken twice with Mr Abideen and sent a lengthy email asking questions about his claimed technology but we are, as yet, unsatisfied with the response. We're not the only ones.

Expert opinion

Regarding the printing and scanning technology, Alex Young, director of technical marketing at RAID specialist Infortrend Europe said: "With today's laser printer technology (600 dpi or 1200 dpi), it will be hard to achieve this. Also from today's scanner technology, the scanner has to be very precise and free of dust. From the colour-matching/correction software technology, the colours have to be precise or the data might be misinterpreted."

A storage expert, Robin Harris, agreed: "Mr. Abideen has misplaced a few decimal places. Modern offset presses used to print magazines, which he suggests as a medium operate at about 300 dpi, or about 8.7 million dots on an A4 sheet, or 35 million dots with four color printing. There is no lossless way to compress 256 GB (or 2 trillion bits) of data into 35 million dots, or even 35 billion dots. It is the storage equivalent of perpetual motion."

Large numbers of Net users have also raised doubts.

An article focuses on scanning/printing technology and compression difficulties of the method outlined. Daily Tech forum commentators list a large number of problems and issues they see with the reported claims. So too do Digg, Slashdot and ITsoup.

The postings point to three types of problem overall:

  1. If the stored data is simply represented as bits on the page then there aren't enough reliably detectable bits, using current scanning and printing technology, to do the job.
  2. If there is some new method of encoding information then the computational tasks associated with encoding it and then reading it are potentially immense.
  3. Paper as a medium distorts, is not level, changes its shape in response to heat and humidity, and can be folded which decreases information clarity in the fold area. As such it is a poor medium for recording such dense amounts of information.

Is the original report suspect?

There is some confusion about the people mentioned in the original Arab News story. The MES College of Engineering (MESCE), Kuttippuram, exists as and is listed on a Kerala government website. It also has its own website. However a Professor Hyderali is not listed as a member of the MCA (Master of Computer Application) faculty. A Professor Sainul Abdeen though is listed as chief warden at MESCE.

A Mr. K Hyderali is listed as a lecturer in the MCA faculty at MESCE. The same listing includes Professor Sainul Abideen in an MCA adjunct faculty status. However a 2003 student admission listing includes a Sainul Abideen.

The Arab News report, and a near-identical Deccan Herald report by the same writer, state Sainul Abideen has just gained his MCA qualification and is aged 24. This seems young for a professor. So, if taken literally, there are two Sainul Abideens but no Professor Hyderali. The most likely explanation is journalistic inaccuracy, which also raises doubts about the story overall.

FAQ: The issues and problems

We have indentified no less than 14 queries over the feasibility of the Rainbow technology. Here they are with brief explanations:

1. Scanning To scan the Rainbow-encoded image would require a scanner to be able to scan 256GB-worth of data. A 1200dpi scanner might pick out 1,440,00 dots per square inch. That computes to 18,000 bytes, and 18KB per square inch is a long way short of 2.7GB per square inch. Assuming the scanner is perfectly calibrated and that the paper is positioned correctly in the scanner then we might say it could pick out one of 256 colours per dot. That leads to 4.608MB per square inch - still a long way shy of the claimed figure.

2. Compression In the Rainbow scheme, geometric shapes as well as colours are used to represent the information. It's asserted that this is a form of compression scheme and that it can't be better than existing compression methods otherwise it would be in use already. Compressing data into a Zip archive could increase the storage capacity but only by a factor of two or three and that depends upon the data type. You can only compress so far before losing information.

3. Data capacity and number of bits It's asserted that there is no way that storing data as coloured shapes can be any better (at increasing storage capacity) than storing coloured bits. A shape on paper is made of coloured bits. A bit is a bit and has a storage capacity of 2 to the N where N is the number of bits. Eight bits, a byte, have 256 possible permutations.

We can think of a byte as a code for a geometric shape, like a square or a triangle, but the byte still only contains 8 bits, meaning 256 possible values, and that is not enough for the Rainbow format claims. So the claims would appear to be impossible.

4. A dot equaling a byte still doesn't provide enough capacity. Let's suppose a dot could have 256 values through its presence/absence and colour. Then are there enough dots per square inch to provide the claimed capacity? No. Assuming a 10x10 inch piece of paper and a 2400dpi scanner then the scanned information amount is:-

((((1.200x10=12.000)X(1.200x10=12.000)=144.000.00 0)/1024=140.625KB)/1024=137MB)

It's not enough.

5. Real life Scanning density isn't good enough. Hard disks can have unimaginably high bit per square inch numbers because disk and read/write head are fantastically accurately aligned. Having paper, a medium that changes shape according to temperature and humidity fed into a scanner, meaning its position is variable, will make it impossible for a scanner to retrieve information at the dpi measure required.

Also you have to allow for error-correction which decreases the theoretical scanning density to a lower, real life, level. Detecting differences between a 12-sided and a thirteen-sided polygon could be very difficult. Also the difficulty increases with the number of sides. Ditto the increasing wavelength of colours.

6. Using symbols does not increase the storage capacity of the medium. The amount of information that can be read off the paper is measured in bits. Just because a post-scanning algorithm finds out that the bits represent colours, hex numbers, geometric shapes or Chinese characters does not increase or decrease the basic amount of information, which is limited by what can be printed and what can be read by a scanner. Using symbols divides the bit-level information into chunks. It doesn't increase the amount of bit-level information.

A shape that is made of ten pixels (bits) contains no more information than the ten pixels.

7. Depth You could greatly increase the storage capacity if you stored data in 3D, i.e. by using the depth of the media to store more information, as in holographic storage. But the Rainbow format makes no claims to this effect. We are left to assume that only the storage medium's surface is used and that is 2D.

8. Printer and scanner calibration issue To reliably and accurately scan one of 256 colours would need the originating printer and the reading scanner set to use exactly the same colour wavelengths. This is a practical impossibility as the number of used wavelengths (colours) increase. When combined with the response of paper and ink to temperature and humidity changes this becomes even more of a problem.

9. More on scanner alignment To accurately position the scanner's read head you would need tracking information, meaning bits, added to the paper. (This is similar to track positioning data on magnetic tape.) The space taken up by these would reduce the storage capacity.

10. Colour printing doesn't add much more capacity A colour printer prints, at most, in for primary colours: cyan, magenta, yellow and black in the CMYK scheme. It can print many more recognisable colours but only by partially overlapping dots to additively form a colour from these four components. The primary colour dots overlap so that the area of the page needed for one apparent dot of the desired colour is many times greater than a dot of a primary colour. This decreases the bit dpi rate on the page. In other words the new colour is like a chunk of the original bits.

11. If there is a new way of encoding information so that you could store 2.7GB/sq in then you could use to store much more information on CDs and DVDs too. With the storage market worth billions of dollars a year, it is highly unlikely that one Indian student has uncovered something not considered by thousands of storage specialists.

12. The encoding method We know numbering schemes can increase the information content of a string of bits. For example, hexadecimal numbering has more information in a single number than a binary number scheme. Until we know what the Rainbow format encoding scheme involves and what the 'scanner' actually consists of and does we can't ascribe believability to the Rain technology claims.

If it's a form of hash encoding then there is a huge computational problem in reversing a hash to get the original information back, raising questions about any practical application.

13. The alphabet problem Hexadecimal numbering works because the reading device "understands" hexadecimal. Suppose you could use coloured and shape-grouped bits to store more information, you would then need to "understand" it. If every pixel represented a 32-bit colour then its value is 2 to the power 32. A contributor to Daily Tech calculated that you could have a 4096x4096 grid using pixels of 1-32 colours and so arrive at 6MB of data. Two such "super bits" could represent 16GB (16 trillion) pieces of information but ... you have invent an alphabet with 16 trillion letters and map that to a binary alphabet. This is not a trivial computational problem.

14. Paper problems Paper distorts and inks fade so the long term storage potential is strictly limited. Paper also burns and can get torn which also restricts the method's usability. Paper can be folded which would distort the represented information in the area of the fold.


So, in summary, the claims made for this Rainbow technology are almost certainly incorrect. There is no pot of gold at the end of it, and we'll just have to stick with hard disk drives for the moment. Although new holographic technologies are beginning to offer new, huge capacities, and advances in Flash memory technology mean that increasingly large amounts of data can be stored in smaller and smaller areas.

But 256GB on an A4 sheet? No way!


What are your views on this subject? Use the form below to post a comment on this article up to 500 characters.


Characters remaining: 500

Add your commentComments

Colby | Published: 05:57 GMT, 28 April 2008

The claims are conceivable in my imagination, but PAPER is not a sufficient medium for storage. It already holds thousands of bits of information in BOOKS, and we should just leave it that way. BTW, you could calibrate a machine to etch a wavelength of bytes into a medium, but we're just reinventing the wheel here.

Bryan B | Published: 17:43 GMT, 08 January 2008

What's with all the paranoia?? There is no bias here against third world countries, it's just questioning the claims made for a new technology, claims which apparently (so Fawaz said above, anyhow) were misinterpreted in the local press.

abdul or abdulkareem.t | Published: 11:44 GMT, 08 January 2008

he should withdrw this article if he believes on thruth

abdul | Published: 11:41 GMT, 08 January 2008

the author of this blog is not studied the matter well. He is rejecting the claim of abideen because he belongs to the third world country. If he has done some hard work before writing the article he would never write like this.The authr try to claim that sainul abideen and his professor is not there and it is simply a claim. I know them and I belongs to their native and he is my friend for a long time.

msr | Published: 03:26 GMT, 30 December 2007

Of course, data cannot be safely stored on paper for obvious reasons but considering the theoretical possibilities, with the known compression, scanning and printing technology, this claim is certainly impossible; if the inventor has any credibility, he must demonstrate it in public or else it is waste of so many people's time and efforts

Xerexes | Published: 06:55 GMT, 14 September 2007

May be Mr. Abideen don't want to disclose his Patent's details, so he wont answer your tricky mails where you want to get details from him. Thats doesn't mean his research is void.

Santosh | Published: 07:44 GMT, 23 August 2007

Its really interesting and note worthy comments made by Fawaz. I would like to dwelve furhter into the Sainul Abideen's work. fawaz if you have any information worth sharing after your meeting with Sainul Abideen please do send me a mail at santosh.yekula@gmail.com

abhishan | Published: 12:06 GMT, 08 August 2007

This is an interesting article. The arguments provided in the main article do make sense and sound very convincing. One request to FAWAZ, please provide with more information regarding Abideen’s work. If not 400GB, then how much? No-one seems to believe that 400 GB can be stored on an A4 sheet of paper and it looks like Abideen never made such a claim.

Fawaz | Published: 02:44 GMT, 20 July 2007

I am one of those who have actually visited Sainul Abideen to know the truth behind the allegations against him. It seems that he has never raised a 400 GB claim, and that was some thing that I have verified. Besides, all of his claims have been misinterpretted. I was fortunate to see his product in person. Besides, he has shown me an apology letter sent by Mr Chris Mellor (Author of this article) for spreading rumours on his blog. The Hypocrisy of this author is to the extent that he has never pulled back the article from the blogs, nor has he publicly apologised. Shame on the way these so called 'critics' have taken sides against the new developments in third world countries. And I am rather sure he wont have the guts to keep this comment on this site for long.

Blade | Published: 13:45 GMT, 27 June 2007

It's too early to either get too excited or totally dismiss this idea. I think most mathematicians would agree you can store information in patterns, what if the information stored in those patterns was much more complex (or percieved as such) by special software. Almost symbology (if that's a word) if you will! We perceive the word, "goodbye", to mean so many different things, it can be, "I hate you I never want to see you again", or, "I'm looking forward to the next time we meet my friend". Surely if language can contain more meaning, a simple shape with the understanding already understood, could create large memory storage. What if in future not printing the image, but storing it as an image (RAW image format) was able to hold more info than it contained. Cheers! Keep your thinking caps on! :D

Related Storage news

HP tool offers continous laptop backup

Set it and forget.

Intel fixes drive bricking firmware update for flash drives

Company to re-release SSD software

IBM offers Lotus Symphony on Keepod USB devices

Thin USB device uses VMware to provide secure access to the Lotus suite

Sun claims record-breaking storage array

Says Storage 7000 is fastest on the planet

Related Storage reviews



Email this article to a friend or colleague:


PLEASE NOTE: Your name is used only to let the recipient know who sent the story, and in case of transmission error. Both your name and the recipient's name and address will not be used for any other purpose.

Techworld White Papers

Database security: Preventing enterprise data leaks at the source

IDC discusses the growing internal threats to business information, the impact of government regulations on the protection of data, and how enterprises must adopt database security best practices...

Download Whitepaper

Service-oriented security

SOA has become an integral part of enterprise software by providing a framework to efficiently develop software as services that is easily sharable, reusable, and integrated. No where is the need more apparent than in the Identity Management space. Welcome to the age of Service-Oriented Security (SOS).

Download Whitepaper

Data protection prospective vendor checklist

Organisations need a way to map business needs against all these challenges in procuring a technical solution. To help, SANS has developed the following Prospective Vendor Checklist.

Download Whitepaper

Unlock the power of the mainframe

This whitepaper presents the notion of CICS as an integration hub based on a component-based, service-oriented architecture supporting Web services. Highlights will review the challenges and contrasted support for Web services natively in CICS.

Download Whitepaper

Techworld UK - Technology - Business

COLT White Paper

Are all VoIP services the same?

Questions to ask your service provider to ensure you get the VoIP service you need
With careful choice of partner, your business can have all the advantages of VoIP access - reduced costs, flexibility and simplicity - without the drawbacks.
This white paper is your guide to ensure you get right the VoIP service and details the pitfalls which businesses would do well to avoid.

Download white paper
BMC

Ride the express lane in the journey to speed ITIL adoption

Explore the challenges in making the journey to ITIL and the criteria for selecting consulting services
By following ITIL practices, your IT organisation will become more closely integrated with the business. We recommend making the journey to ITIL in a sequence of six incremental steps, the phases of which are driven through execution of a strategic transformational roadmap.

Download white paper

Webcast: IT Financial Management: Cost Optimisation for Efficiency and Agility.
On Demand Webcast
Join this webcast to learn about the techniques and technologies that can help you prove the value of IT to the business by understanding the true cost of today's IT services and those that will be necessary to deliver future success.

Register Today

Site Map

IDG Network

* *