IBM Skip to main content
  Home     Products & services     Support & downloads     My account  
  Select a country  
Journals Home  
  Systems Journal  
  ·  Current Issue  
  ·  Recent Issues  
  ·  Papers in Progress  
  ·  Search/Index  
  ·  Orders  
  ·  Description  
  ·  Author's Guide  
Journal of Research
and Development
  Staff  
  Contact Us  
Systems Journal  
Volume 35, Numbers 3 & 4, 1996
MIT Media Lab
 Table of contents: arrowHTML arrowPDF arrowASCII   This article: arrowHTML arrowPDF arrowASCII
arrowCopyright info
   

Using acoustic structure in a hand-held audio playback device - References

by C. Schmandt and D. Roy

Cited references

  1. C. Schmandt and B. Arons, "A Conversational Telephone Messaging System," IEEE Transactions on Consumer Electronics CE-30, No. 3, xxi-xxiv (August 1984).
  2. B. Chalfonte, R. Fish, and R. Kraut, "Expressive Richness: A Comparison of Speech and Text as Media for Revision," Proceedings of the Conference on Computer Human Interaction, ACM (April 1991), pp. 21-26.
  3. D. Hindus, C. Schmandt, and C. Horner, "Capturing, Structuring, and Representing Ubiquitous Audio," ACM Transactions on Information Systems 11, No. 4, 376-400 (October 1993).
  4. T. W. Malone, K. R. Grant, K. Y. Lai, R. Rao, and D. Rosenblitt, "Semi-Structured Messages Are Surprisingly Useful for Computer-Supported Coordination," TOIS 5, No. 2, 115-131 (April 1987).
  5. F. Chen and M. Withgott, "The Use of Emphasis to Automatically Summarize a Spoken Discourse," Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Vol. 1 (1992), pp. 229-232.
  6. L. J. Stifelman, "A Discourse Analysis Approach to Structured Speech," AAAI 1995 Spring Symposium Series, Palo Alto, CA (March 1995).
  7. H. Gish, M. Siu, and R. Rohlicek, "Segregation of Speakers for Speech Recognition and Speaker Identification," Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Vol. 2 (1991), pp. 873-876.
  8. L. Wilcox, D. Kimber, and F. Chen, "Audio Indexing Using Speaker Identification," Xerox PARC ISTL Technical Report No. ISTL-QCA-1994-05-04 (1994).
  9. J. B. Voor and J. M. Miller, "The Effect of Practice upon the Comprehension of Time-Compressed Speech," Speech Monography 32, 452-455 (1965).
  10. D. S. Beasley and J. E. Maki, "Time- and Frequency-Altered Speech," Contemporary Issues in Experimental Phonetics, N. J. Lass, Editor, Chapter 12, Academic Press, New York (1976), pp. 419-458.
  11. L. Degen, R. Mander, and G. Salomon, "Working with Audio: Integrating Personal Tape Recorders and Desktop Computers," CHI, OCHI, New York (1992), pp. 413-418.
  12. C. Schmandt, "The Intelligent Ear: A Graphical Interface to Digital Audio," Proceedings of the IEEE Conference on Cybernetics and Society (October 1981), pp. 393-397.
  13. D. Kimber, L. Wilcox, F. Chen, and T. Moran, "Speaker Segmentation for Browsing Recorded Audio," CHI '95 Conference Companion, Denver, CO (May 7-11, 1995), pp. 212-213.
  14. A. S. Bregman, Auditory Scene Analysis: The Perceptual Organization of Sound, MIT Press, Cambridge, MA (1990).
  15. D. A. Norman, Memory and Attention: An Introduction to Human Information Processing, John Wiley & Sons, New York (1976).
  16. L. J. Stifelman, "Not Just Another Voice Mail System," Proceedings of the 1991 Conference, American Voice I/O Society (September 1991), pp. 21-26.
  17. C . Schmandt, "Caltalk: A Multi-Media Calendar," Proceedings of the 1990 Conference, OAVIOS (1990), pp. 71-75.
  18. B. Arons, "Hyperspeech: Navigating in Speech-Only Hypermedia," Hypertext '91 Proceedings, ACM (December 1991), pp. 133-146.
  19. B. Arons, SpeechSkimmer: Interactively Skimming Recorded Speech, Ph.D. thesis, MIT Media Laboratory, Cambridge, MA (1994).
  20. C. Schmandt and A. Mullins, "AudioStreamer: Exploiting Simultaneity for Listening," CHI '95 Conference Companion, Denver, CO (May 1995), pp. 218-219.
  21. T. G. Zimmerman, J. R. Smith, J. A. Paradiso, D. Allport, and N. Gershenfeld, "Applying Electric Field Sensing to Human-Computer Interfaces," CHI '95 Conference Proceedings, Denver, CO (May 1995), pp. 280-287.
  22. D. Roy, NewsComm: A Hand-Held Device for Interactive Access to Structured Audio, M.Sc. thesis, MIT Media Laboratory, Cambridge, MA (1995).
  23. D. Rumelhart, G. Hinton, and R. Williams, "Learning Representations by Back-Propagating Errors," Nature 323, 533-536 (1986).
  24. Information available by sending e-mail to technation@usfca.edu.
  25. D. O'Shaughnessy, "Recognition of Hesitations in Spontaneous Speech," Proceedings of the International Conference on Acoustics, Speech and Signal Processing (1992), pp. 1521-1524.
  26. Emergency Medical Abstracts, G. Hasapes, Executive Editor, Center for Medical Education, Harleysville, PA (1995).
  27. D. Roy and C. Schmandt, "NewsComm: A Hand-Held Interface for Interactive Access to Structured Audio," CHI '96 Conference Proceedings, Vancouver, Canada (April 1996), pp. 173-180.