5. Summary And Outlook

We presented a novel method for creating a stand-alone representation of time-based media in a printable still image format. An analog representation is used for the most information intensive part of the time-based media, e.g., the first key frame, and motion vectors and prediction errors are encoded in binary format and represented with a QR code. Our experiments showed that several seconds of video can be encoded on paper this way using a modified MPEG-4 codec. Only a flip-book decoder software is needed on the client device for the playback.

Such short video clips can be used to personalize greeting cards, create video albums, play small animations on books, and can be printed on products to demonstrate usage. Long video clips can be obtained by printing several key frames and larger QR codes. Instead of using the first frame as the reference frame, a key frame could be used that provides the minimum prediction.

figure 7
Figure 7. Speech to static image representation.

The ideas presented in this paper can be extended to represent other time-based media such as music, speech, and animations. An audio message can be encoded in the QR code. Alternatively, speech can be printed in text and the user's vocal information (e.g., pitch, pronunciation) can be represented in the QR code as shown in Figure 7. At the decoder, OCR would recover the text and speech would be synthesized using the vocal information in the QR code. Music can be printed as musical notes, which can be recognized at the decoder, and the QR code can represent instrument and other audio information to synthesize music at the decoder.

6. References

[1]ISO/IEC 14496-2, "'Information technology—Coding of audiovisual objects—Part 2: Visual", 2000.
[2]Uchihashi, S., Foote, J., Girhensohn, A., & Boreczky, J. "Video Manga: Generating Semantically Meaningful Video Summaries", ACM Multimedia Conf., pp. 383-392, 1999.
[3]T. Stich, M. Magnor, "Keyframe Animation from Video", Proc. IEEE ICIP, pp.2713-2716, 2006.
[4]Graham, J, Erol, B., Hull, J.J. and Lee, D.S., "The VideoPaper Multimedia Playback System", ACM Multimedia Conference, pp.94-94, 2003.
[5]Klemmer, S.R., Graham, J., Wolff, G.J., Landay, J.A., "Books with voices: paper transcripts as a physical interface to oral histories", ACM CHI, pp. 89-96, 2003.
[6]Bansal, P., Narendran, M.R., and Murali, M.N.K., "Improved error detection and localization techniques for MPEG-4 video", IEEE ICIP, 2002.
[7]Gallant, M. Shirani, S. Kossentini, F. , "Standardcompliant multiple description video coding", IEEE ICIP, 2001.
[8]ISO/IEC 18004, "Information Technology AIDC Techniques Bar code symbology QR Code," 2000.
[9]Examples at http://rii.ricoh.com/~berna/videoflipbook.html