IBMSkip to main content
  Home     Products & services     Support & downloads     My account  
  Select a country 
Journals Home 
 Systems Journal 
 ·  Current Issue 
 ·  Recent Issues 
 ·  Papers in Progress 
 ·  Search/Index 
 ·  Orders 
 ·  Description 
 ·  Author's Guide 
Journal of Research
and Development
 Staff 
 Contact Us 
 Related links: 
    IBM On demand
   business
 
     IBM WebFountain  
IBM Systems Journal 
Volume 43, Number 1, 2004
Utility Computing
 Table of contents: arrowHTML arrowPDF   This article: arrowHTML arrowPDF arrowCopyright info
  

How to build a WebFountain: An architecture for very large-scale text analytics - Author Bios

by D. Gruhl, L. Chavet, D. Gibson, J. Meyer, P. Pattanayak, A. Tomkins, and J. Zien

Biographical sketches of authors

David Gibson IBM Research Division, Almaden Research Center, 650 Harry Road, San Jose, California 95120 (davgib@us.ibm.com). Dr. Gibson is a researcher at the Almaden Research Center, where he has been involved in several projects related to Web mining and visualization, including early experiments with the HITS link analysis algorithm, browser enhancements with the WBI (Web Intermediaries) intelligent proxy project, and the SemTag Web-scale semantic annotation project. Currently he is working on WebFountain, an IBM initiative to provide Web data mining to corporate customers. He has a Ph.D. from the University of California, Berkeley.

Daniel Gruhl IBM Research Division, Almaden Research Center, 650 Harry Road, San Jose, California 95120 (dgruhl@almaden.ibm.com). Dr. Gruhl is a researcher at the Almaden Research Center. He earned his Ph.D. in electrical engineering from the Massachusetts Institute of Technology in 2000, with thesis work on distributed text analytics systems. His interests include stegonography (visual, audio, text, and database), machine understanding, user modeling, and very large-scale text analytics. Dr. Gruhl is the chief architect for WebFountain, with responsibility for overall hardware, software, and systems design. He is also co-architect of IBM's Unstructured Information Management Architecture.

Laurent Chavet Microsoft Corporation, One Microsoft Way, Redmond, Washington 98052 (fsort@fsort.com). Prior to joining Microsoft, Mr. Chavet worked on the WebFountain infrastructure at the IBM Almaden Research Center. His expertise in all aspects of search technology comes from his work at Alta Vista and with WebFountain. His primary interests are design and implementation of large-scale high-performance applications. He received three M.S. degrees: two from Ecole Polytechnique France in computer science and mathematics and one from E.N.S.E.E.I.H.T. France in computer science.

Joerg Meyer IBM Research Division, Almaden Research Center, 650 Harry Road, San Jose, California 95120 (jmeyer@almaden.ibm.com). Mr. Meyer is a software engineer at the Almaden Research Center. He has worked on projects involving Web application programming for Web Intermediaries (WBI), XML transcoding, and P2P. For WebFountain, he is responsible for various modules within the WebFountain indexer. He holds a Diploma Engineering degree in computer science from the University of Applied Sciences in Hamburg, Germany.

Pradhan Pattanayak IBM Research Division, Almaden Research Center, 650 Harry Road, San Jose, California 95120 (ppattana@us.ibm.com). Mr. Pattanayak is a software engineer at the Almaden Research Center. His interests include parallel processing, distributed computing, and Web technologies. He has worked at IBM, Hewlett-Packard Company, Oracle Corporation, and the Centre for Development of Advanced Computing (CDAC). He received his Master of Technology degree from the Indian Institute of Technology, Bombay.

Andrew Tomkins IBM Research Division, Almaden Research Center, 650 Harry Road, San Jose, California 95120 (tomkins@almaden.ibm.com). Dr. Tomkins is a member of the Principles and Methodologies group at the Almaden Research Center. His research interests include algorithms, particularly on-line algorithms, disk scheduling and prefetching, pen computing and OCR (optical character recognition), and the World Wide Web. He is one of the founders of WebFountain. He received his Ph.D. from Carnegie Mellon University and his B.S. in mathematics and computer science from the Massachusetts Institute of Technology.

Jason Zien IBM Research Division, Almaden Research Center, 650 Harry Road, San Jose, California 95120 (jasonz@almaden.ibm.com). Dr. Zien has been at the Almaden Research Center since 1997, working first on a Java-based thin server platform for the Open Services Gateway Initiative (OSGI), then on WebFountain, and now on the TREVI search engine. His research interests include graph partitioning, scalable text indexing, search, and information retrieval. He received his Ph.D. in computer engineering from the University of California, Santa Cruz and his B.S. in computer engineering from Carnegie Mellon University.