Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Duc Ho

Data Scientist
Ho Chi Minh

Summary

A life-long learner and data enthusiast. Seeking a position, where creativity, ideas and innovation are highly prized and appreciated. Highly passionate about using knowledge and experience in data analytics and engineering to help explore insights from big data, achieve business goals and align with vision, mission and values.

Overview

7
7
years of professional experience
4
4
years of post-secondary education

Work History

Data Scientist

Tenpoint7 LLC
08.2019 - Current
  • Involve in the development cycle of consulting projects in back-end parts
  • Integrate multiple crawlers from different data sources to a crawling pipeline of a supplier risk application (Bing Search, Crunchbase, sec.gov, etc)
  • Integrate third-party ML services (AWS Comprehend, AWS Textract) to existing apps
  • Develop rule-based algorithm to detect events from news articles
  • Design database schema, and maintained data pipeline on AWS cloud (Elasticsearch, MySQL, ECS, Lambda, CloudWatch, etc)
  • Implement data backup and migration
  • Write public/private API for web applications
  • Propose and implemented solutions to optimize run-time and cost of applications
  • Built an end-to-end data pipeline using Airflow and ECS.

Technical Data Analyst

Tenpoint7 LLC
01.2019 - 08.2019
  • Researched and implemented stable, scalable crawlers from different sources (PubMed, Amazon, Financial Times, etc)
  • Integrated new crawlers to current data pipeline using Docker, CI and AWS stack
  • Trained and integrated an image recognition model to read Amazon's CAPTCHA
  • Listened to customer feedback in order to improve features built
  • Supported analyst team in technical queries and tasks
  • Supported customers in data consulting projects

Technical Data Analyst

Krom LLC By XOMAD
07.2017 - 01.2019
  • Write and maintain stable crawler APIs to collect text and image data from social networking sites (Instagram, Twitter)
  • Implemented a machine learning model to detect bot on Instagram based on features from user info
  • Built a text-based machine learning model to classify parental status of Instagram users
  • Created a rule-based algorithm extract age from bio and username of Instagram users using pattern recognition
  • Designing distributed and scalable crawling system
  • Working with different types of database in the processes of storing data thus ensuring the availability of data platform team
  • Carrying out new research on machine learning models to get business insights from collected data
  • Doing ad-hoc data analytics and visualization on given dataset in order to help the company make business decisions accurately on different projects.

Data Analyst

Sentifi Vietnam LLC
04.2016 - 06.2017
  • Audited financial content to maintain Named Entity Recognition (NER) system
  • Collected a large amount of text and quantitative financial data from the web including Twitter and over news and blog websites
  • Labelled, processed data for machine learning models
  • Visualized, analyzed data to explore trends, patterns and draw conclusions
  • Led the review and test of machine learning models before releasing
  • In charge of making KPI reports, statistics, writing scripts and tools to automate manual jobs using Python, IPython Notebook, Elasticsearch and PostgreSQL
  • Carried out research on new methods, innovations in order to optimize working efficiency
  • Work closely with Data Science and Data Engineering teams to improve models precision and automate working processes.

Education

Bachelor's Degree - Finance And Banking

Hanoi University of Industry Hanoi
09.2010 - 07.2014

Skills

Python (functional programming, OOP, data analytics, data visualization)undefined

Accomplishments

  • Sequence Models - deeplearning.ai https://www.coursera.org/account/accomplishments/verify/KW92NWYPZUA7
  • Structuring Machine Learning Projects - deeplearning.ai https://www.coursera.org/account/accomplishments/verify/NBR9E67ESMXD
  • Convolutional Neural Networks - deeplearning.ai https://www.coursera.org/account/accomplishments/verify/FPA4B3DQGDW9
  • Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization - deeplearning.ai https://www.coursera.org/account/accomplishments/verify/G7SJTNLFRH32
  • Neural Networks and Deep Learning - deeplearning.ai https://www.coursera.org/account/accomplishments/verify/LGGCKK7YE7N4
  • Machine Learning - Stanford https://www.coursera.org/account/accomplishments/verify/HKS94XKH6ZRN

Timeline

Data Scientist

Tenpoint7 LLC
08.2019 - Current

Technical Data Analyst

Tenpoint7 LLC
01.2019 - 08.2019

Technical Data Analyst

Krom LLC By XOMAD
07.2017 - 01.2019

Data Analyst

Sentifi Vietnam LLC
04.2016 - 06.2017

Bachelor's Degree - Finance And Banking

Hanoi University of Industry Hanoi
09.2010 - 07.2014
Duc HoData Scientist