date: August 2016 display_date: August 2016 meetups: - name: ‘Nearline Topic Tagging of News Articles on Samza’ host: LinkedIn image: presenters: - name: Eric Huang website: image: affiliation: LinkedIn video: url: https://youtu.be/bnOSqCBUdpU image: https://img.youtube.com/vi/bnOSqCBUdpU/0.jpg abstract: LinkedIn provides meaningful and fresh content to users at scale, it automatically tag news articles with the topics that they are about. In this talk, Eric Huang presents the distributed architecture of nearline topic tagger built on Samza, offline-to-online model delivery, the overarching machine learning workflow, and interesting problems and solutions we have encountered along the way. - name: ‘How to convert a legacy Hadoop Map/Reduce ETL systems to Samza Streaming’ host: LinkedIn image: presenters: - name: Louis Calisi website: image: affiliation: TripAdvisor abstract: In this talk we will discover How TripAdvisor converted our legacy Hadoop Map/Reduce jobs to Samza Streaming. This system feeds thousands of tables and downstream reports. No data loss and full backwards capability were required.
video: url: https://youtu.be/KQ5OnL2hMBY image: https://img.youtube.com/vi/KQ5OnL2hMBY/0.jpg