Introduction to the world of Hadoop
Gordon Rios (Cork Constraint Computation Centre, University College of Cork, Ireland)
Location: Aula 108, ETSE
Start time: Oct. 29, 2010,
12 p.m.
Abstract
In this talk I will introduce one of the most popular computing technologies called Hadoop. It was originally developed for large scale text mining based upon the principles of map-reduce from Google. Hadoop and related tools have rapidly become one of the most widespread frameworks for large scale data mining and analytics applications in a variety of settings. It is hoped that this introduction will help researchers and scientists see applications for this remarkable set of technologies in their own fields.
Email announcementAbout Gordon Rios
Institution: Cork Constraint Computation Centre, University College of Cork, Ireland
Gordon Rios is a leader in search and search-related technologies. Over the past 10 years he has focused his efforts on devising machine learning systems for web search, document classification, and text mining. At Inktomi he developed the scoring engine for a user click-based ranking application that was deployed at HotBot and Snap, was a major contributor to the Directory Engine product which classified millions of web documents into over 10,000 categories, and was a founding member of Inktomi’s Web Search Relevance Group. Moving to Proofpoint, an enterprise messaging company, he led the development of the award-winning MLX technology used in their industry leading anti-spam product. Most recently Gordon worked at Yahoo! in the International Relevance team, where he worked on all aspects of search engine relevance and developed production code running in all major international markets. Gordon holds an MBA in Finance and a master’s degree in Engineering and Computer Science from U.C. Berkeley. His Erdös number is 4.
More about Gordon Rios