Maochen.org - In the middle of nowhere

Interest Areas:

DL Multi-modality Modeling
Parsing
Entity Recognition
Semantic Relation Extraction, Knowledge Distributed Representation

Experience

Meta Inc.

2020 - Present
Sr. Machine Learning Engineer

Microsoft

2017 - 2020
Senior Engineer

2013 - 2016
Lead NLP R&D Engineer

2010 - 2013
M.Sc. in Computer Science

Publication & Patent

Querying Enterprise Knowledge Graph with Natural Language (2019)

Junyi Chai, Yonggang Deng, Maochen Guan, Yujie He, Bing Li and Rui Yan
in Proceedings of the CEUR Workshop 2019

Publication

Word Substitution in Short Answer Extraction: A WordNet-based Approach (2016)

Qingqing Cai, James Gung, Maochen Guan, Gerald Kurlandski and Adam Pease
in Proceedings of the Eighth Global WordNet Conference, Verginica Barbu Mititelu, Corina Forascu, Christiane Fellbaum, Piek Vossen (eds), pp 66-73

Publication

Patents

• Ontology Entity Type Detection from Tokenized Utterance - Serial # 16/370797 - 2019
• Training Utterance Generation for Knowledge Graph Based QA - Serial # 16/370759 - 2019
• Querying Knowledge Graph with Natural Language Input - Serial # 16/370721 - 2019

Projects

Amelia

Virtual agent for assisting customer answering domain related questions.
Constructing a hierarchical based complex Sentence Similarity computing system using distributed word representation.
Implemented an adapter layer for Stanford Parser to accept CoNLL-U format training data.
CRFs-based entity recognition.
Constructed dialog act recognizer by using LibLinear model with ~30% accuracy boosting on original dialogue scripts for 82 related topics.
Optimized PCFG parser model.

Feature Extraction Optimization for Multi-core Arch.

Optimize TweetEmotion.com.
Parallel feature extraction process.
Data level parallelism in training process.
Cell level and pair level parallelism in parsing process.
Non-blocking parallelism implemented by thread pool model and fork and join model.

Introduction | Code | Paper

Sentiment Classification System for Twitter

Supervised machine learning classification system.
Classify tweets extracted from twitter.com into 3 groups (Positive, Negative or Neutral).
Maximum entropy classifier for prediction.
Revised on Dec 2012, Backoff model added.
10-fold CV F1-Score: ~88%.

Paper

Calendar

Verified by Let's Encrypt