Bill Keenan and Terry Heinze, Thomson Reuters
Distributed Computing and Big Data: Hadoop and MapReduce
April 8 , 2011
Distributed computing is recognized as a key enabler of large scale computing that operates against Big Data. Hadoop is an open source computing infrastructure that provides a distributed filesystem and a means of running jobs across clusters of servers. The Thomson Reuters Corporate Research and Development group has collaborated with multiple teams across the company to experiment with and use distributed computing. We describe some of the work that has occurred over the last year in our Hadoop environments. The object is to give the audience a sense of the varied use cases for cluster-based computing and how Hadoop has been used successfully. We also share some lessons learned along the way.
Bill Keenan directs our Development group of over 20 software engineers. He earned a Master's in Computer Science at the University at Albany. Bill's team works on numerous product development projects, with a focus on classification, entity recognition and linking, and data and text mining. Bill joined Thomson Reuters in 2002. Bill previously led Development teams at several technology focused organizations in manufacturing, telecommunications and knowledge management: Nortel, NetSetGo and Mpower Communications. He is a member of the IEEE and ACM.
Terry Heinze has a B.S. in Physics from the University of Minnesota. He has over 30 years experience developing and architecting software. He is currently interested in service-oriented architectures, cloud computing and dependency injection frameworks. He has played a primary role in the development and deployment of named entity extraction frameworks and solutions for projects including Reuters Insider and LiveNote. Before joining Thomson Reuters, Terry worked in the financial and transportation industries where he designed enterprise frameworks for expert underwriting systems, rule-based product recommendation software, and dynamic role-based corporate security systems. He also has prior military experience as a Naval officer.