Phuong Hoang is a data scientist in the
Data Science R&D group at CareerBuilder.
His research lies in the field of natural language processing and machine learning
with applications to human capital management domain. He attended North Carolina State University, where he earned his
BS in financial mathematics, and his M.S.
and Ph.D. in applied mathematics, specializing in machine learning applications for
medical diagnostics and sports analytics. He
currently resides in Atlanta, Georgia, USA.

Thomas Mahoney is the manager of candidate and data services at CareerBuilder,
where he oversees a group of teams focused
on building out and maintaining classification, data enrichment, and candidate management web services to power CareerBuilder’s products. He has worked in the
human capital management space for the
past five years and is passionate about building fast, reliable, and highly scalable
microservices that deliver high customer
value. He holds a B.S. degree in computer
science from the Georgia Institute of Technology and currently resides in Roswell,
Georgia, USA.

Faizan Javed is a manager of data science at
CareerBuilder, where he leads the Data Science group responsible for data enrichment
technologies such as knowledge bases, entity taxonomies and relationships, data standardization, and deduplication and normalization algorithms for the online
recruitment domain. He has almost 10 years
of industry experience in diverse domains
with multiple technology stacks. Faizan has
over 30 publications in areas ranging from
data science and machine learning to software systems and model-driven engineering. His current area of focus is the application of data science to end-to-end human
capital management processes. He holds an
M.S. degree in computer science and bioinformatics, a Ph.D degree in computer and
information sciences, and a certificate in
technology entrepreneurship, all from the
University of Alabama in Birmingham, Alabama, USA.

Matt McNair, is vice president of global
services strategy, where he focuses on the
edges of Careerbuilder’s products, ensuring
that they drive recruiter efficiency by integrating well with each other and with the
tools that recruiters use daily. His 12 years of
experience in the recruitment space has
made him passionate about applying data
science, running high-scale microservices,
and bringing it all together into a recruiter-friendly sourcing product. He currently
resides in Atlanta, Georgia, USA.

socioeconomic problem. To this end,
we describe the SKILL system for skill
normalization that has been in production at CareerBuilder for more than a
year. More specifically we follow up on
our previous work, which was back
then an emerging prototype, by
describing how the system evolved
over time as it gained greater traction
and usage across the company. We also
focus on the collaboration between the
data science and data engineering
teams and describe how both organizational teams are needed to bring large-scale, high-impact ideas to fruition.

We have received extensive and
valuable feedback both from our internal stakeholders and from external
customers on areas for improvement,
and we are currently researching several future directions for the system. We
plan to improve it by supporting case-sensitive tagging to minimize false positives and building a more comprehensive skill hierarchy. We will also
support emerged, established, and saturated skill categorization. As we
expand the service in various international markets, we will also support
multilingual skill tagging and taxonomies. To support our compensation analytics and career path efforts,
we plan to extend the service with
skills proficiency, expertise inference,
and skill effort capabilities.