Technology to Easily Record and Deliver Presentations

Recent years have seen technologies that enable the recording, storage and delivery of presentations, lectures and meetings as multimedia content. However, when making a video of a presenter, for example, a video camera would have to be sent to and set up at the venue, and the recorded video would require editing such as indexing. Therefore, creating content is not easy, as it requires a lot of time and high costs. In addition, recorded video is often large in terms of data size and requires storage on a large-capacity server. Such large-sized video content is also disadvantageous in terms of delivery, since it imposes a heavy load on the network, limiting the environment in which to view the video.
To address these issues, Fuji Xerox has developed technology to easily record and deliver presentations without recording the presenter with a video camera, but instead recording the images of the document used in the presentation along with sound and mouse cursor movements. Usually, footage of the presenter is captured when recording a presentation or lecture, but we believe that the documents used in the presentation and its audio content contain the most important information. This technology automatically records the presentation documents displayed on a PC and the presenter's voice with our original capturing software, enabling easy delivery. Another feature of this technology is that since the images are captured as still images, the data size can be kept relatively small compared to a video.

Fig. 1 shows the flow of creating content. The presenter displays the document on a PC and starts the presentation just as in any other presentation. Using our original capturing software, the presenter's voice is recorded throughout the presentation, and when the page of the document displayed on the screen changes, the software captures the screen as a still image. The software determines the page change by monitoring the amount of difference in the captured screen images. If the presenter decides to skip some pages during the presentation, those pages are not captured as images. The captured images are then linked with the recorded sound, along with the movements of the mouse cursor. The set of data that includes images, sound and mouse cursor movements is then sent to a server to be managed. When viewing the data, a viewer can access the content on the server via a browser. The content can be accessed and viewed by multiple viewers simultaneously, and the viewers can select the page of the presentation document from which to start viewing by selecting from the thumbnails of pages.
In such a way, this technology can easily record, store, and deliver presentations, and could be used for such purposes as recording lectures and seminars, creating educational materials, and self-learning.

Fig. 1: Flow of Creating Content