Personal Information
Entreprise/Lieu de travail
Singapore Singapore
Profession
Data Geek
Secteur d’activité
Technology / Software / Internet
À propos
Over 5 years specialized in big data analytic, mainly in Data Acquisition, Marketing Intelligence, Web Analytics, Fraud Detection, Recommendation, etc.
Specialties:
Machine Learning Algorithms: SVM & Neural Network & PCA & Clustering & Regression & Decision Tree & Outliers Detection;
Web Analytics & Clickstream System & Graph Analysis & Data Warehousing;
Tools: Hadoop(MapR), Spark(Scala,pyspark,MLlib,SparkSQL,Graphx, Magellan for Geospatial Analytics), Presto, HBase, Hive, Drill, Sqoop, Kafka and Storm.
DB: Greenplum & Oracle(11g&10g) & PostgreSQL & Mysql.
Also Interested in operation research, convex optimization, stochastic optimization.
Mots-clés
dtcc
svm
strata singapore
spark
Tout plus
Présentations
(10)J’aime
(27)Stateful, Stateless and Serverless - Running Apache Kafka® on Kubernetes
confluent
•
il y a 5 ans
Part 1: Lambda Architectures: Simplified by Apache Kudu
Cloudera, Inc.
•
il y a 7 ans
Improving PySpark Performance - Spark Beyond the JVM @ PyData DC 2016
Holden Karau
•
il y a 7 ans
林佳賢/資料視覺化的 20 個小訣竅
台灣資料科學年會
•
il y a 7 ans
Productionizing Spark and the REST Job Server- Evan Chan
Spark Summit
•
il y a 8 ans
Dreaming Infrastructure
kyhpudding
•
il y a 14 ans
Handling Data Skew Adaptively In Spark Using Dynamic Repartitioning
Spark Summit
•
il y a 7 ans
Apache Spark 2.0: A Deep Dive Into Structured Streaming - by Tathagata Das
Databricks
•
il y a 7 ans
1. Apache Kylin Deep Dive - Streaming and Plugin Architecture - Apache Kylin Meetup @Shanghai
Luke Han
•
il y a 8 ans
Magellen: Geospatial Analytics on Spark by Ram Sriharsha
Spark Summit
•
il y a 8 ans
Sparkcamp stratasingapore
Cheng Feng
•
il y a 8 ans
AWSome Day Singapore Keynote 2015
Hwee Bee Tan
•
il y a 8 ans
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
•
il y a 10 ans
Introduction to Machine Learning
Lior Rokach
•
il y a 11 ans
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Arnaud Bonzom
•
il y a 8 ans
Using Apache Drill
Chicago Hadoop Users Group
•
il y a 9 ans
Parquet Hadoop Summit 2013
Julien Le Dem
•
il y a 10 ans
Titan: The Rise of Big Graph Data
Marko Rodriguez
•
il y a 11 ans
Intro to Graph Databases Using Tinkerpop, TitanDB, and Gremlin
Caleb Jones
•
il y a 10 ans
Real time Analytics with Apache Kafka and Apache Spark
Rahul Jain
•
il y a 9 ans
Open Source Lambda Architecture with Hadoop, Kafka, Samza and Druid
DataWorks Summit
•
il y a 8 ans
Sqoop on Spark for Data Ingestion
DataWorks Summit
•
il y a 8 ans
Enterprise Kafka: Kafka as a Service
Todd Palino
•
il y a 10 ans
Kdd 2014 Tutorial - the recommender problem revisited
Xavier Amatriain
•
il y a 9 ans
鹰眼下的淘宝_EagleEye with Taobao
terryice
•
il y a 10 ans
All you wanted to know about analytics in e commerce- amazon, ebay, flipkart
Anju Gothwal
•
il y a 9 ans
Kaggle Otto Challenge: How we achieved 85th out of 3,514 and what we learnt
Eugene Yan Ziyou
•
il y a 8 ans
Personal Information
Entreprise/Lieu de travail
Singapore Singapore
Profession
Data Geek
Secteur d’activité
Technology / Software / Internet
À propos
Over 5 years specialized in big data analytic, mainly in Data Acquisition, Marketing Intelligence, Web Analytics, Fraud Detection, Recommendation, etc.
Specialties:
Machine Learning Algorithms: SVM & Neural Network & PCA & Clustering & Regression & Decision Tree & Outliers Detection;
Web Analytics & Clickstream System & Graph Analysis & Data Warehousing;
Tools: Hadoop(MapR), Spark(Scala,pyspark,MLlib,SparkSQL,Graphx, Magellan for Geospatial Analytics), Presto, HBase, Hive, Drill, Sqoop, Kafka and Storm.
DB: Greenplum & Oracle(11g&10g) & PostgreSQL & Mysql.
Also Interested in operation research, convex optimization, stochastic optimization.
Mots-clés
dtcc
svm
strata singapore
spark
Tout plus