Stanford
University
  • Stanford Home
  • Maps & Directions
  • Search Stanford
  • Emergency Info
  • Terms of Use
  • Privacy
  • Copyright
  • Trademarks
  • Non-Discrimination
  • Accessibility
© Stanford University.  Stanford, California 94305.
Words Matter: The Text of Online Job Postings Can Predict Salaries | Stanford HAI

Stay Up To Date

Get the latest news, advances in research, policy work, and education program updates from HAI in your inbox weekly.

Sign Up For Latest News

Navigate
  • About
  • Events
  • Careers
  • Search
Participate
  • Get Involved
  • Support HAI
  • Contact Us
Skip to content
  • About

    • About
    • People
    • Get Involved with HAI
    • Support HAI
    • Subscribe to Email
  • Research

    • Research
    • Fellowship Programs
    • Grants
    • Student Affinity Groups
    • Centers & Labs
    • Research Publications
    • Research Partners
  • Education

    • Education
    • Executive and Professional Education
    • Government and Policymakers
    • K-12
    • Stanford Students
  • Policy

    • Policy
    • Policy Publications
    • Policymaker Education
    • Student Opportunities
  • AI Index

    • AI Index
    • AI Index Report
    • Global Vibrancy Tool
    • People
  • News
  • Events
  • Industry
  • Centers & Labs
news

Words Matter: The Text of Online Job Postings Can Predict Salaries

Date
January 10, 2022
Topics
Economy, Markets
Natural Language Processing

A machine learning tool that connects job skills to pay could help workers, companies and policymakers make better decisions about job training.

The job landscape in the United States is dramatically shifting: The COVID-19 pandemic has redefined essential work and moved workers out of the office; new technologies are transforming the nature of many occupations; globalization continues to push jobs to new locations; and climate change concerns are adding jobs in the alternative energy sector while cutting them in the fossil fuel industry. 

Amid this workplace turmoil, workers, as well as employers and policymakers, could benefit from understanding which job characteristics lead to higher wages and mobility, says Sarah Bana, a postdoctoral fellow at Stanford’s Digital Economy Lab, part of the  Stanford Institute for Human-Centered Artificial Intelligence. And, she notes, there now exists a large dataset that might help provide that understanding: the text of millions of online job postings. 

“Online data provides us with a tremendous opportunity to measure what matters,” she says.

Indeed, using machine learning, Bana recently showed that the words used in a dataset of more than one million online job postings explain 87% of the variation in salaries across a vast proportion of the labor market. It’s the first work to use such a large dataset of postings and to look at the relationship between postings and salaries. 

Bana also experimented with injecting new text – adding a skill certificate, for example – into relevant job listings to see how these words changed the salary prediction.

“It turns out that we can use the text of job listings to evaluate the salary-relevant characteristics of jobs in close to real time,” Bana says. “This information could make applying for jobs more transparent and improve our approach to workforce education and training.”

The Words of Job Listings Matter 

To analyze how the text of online job postings relates to salaries, Bana obtained more than one million pre-pandemic job postings from Greenwich.HR, which aggregates millions of job postings from online job board platforms. 

She then used BERT, one of the most advanced natural language processing (NLP) models available, to train an NLP model using the text of more than 800,000 of the job postings and their associated salary data. When she tested the model using the remaining 200,000 job listings, it accurately predicted the associated salaries 87% of the time. By comparison, using only the job postings’ job titles and geographic locations yielded accurate predictions just 69% of the time.

In follow-up work, Bana will attempt to characterize the contribution of various words to the salary prediction. “Ideally, we will color words within postings from red to green, where the darker red words are linked with lower salary and the darker green are linked with higher salary,” she says. 

The Value of Upskilling – A Text Injection Experiment

To identify which skills matter for salary prediction, Bana used a text injection approach: To certain relevant job postings, she added short phrases indicating the job requires a particular career certification, such as those listed in Indeed.com’s 10 In-Demand Career Certifications (And How To Achieve Them). Obtaining these certifications can be costly, with prices ranging from about $225 to about $2,000. But, until now, there has been no way to determine whether the investment is worthwhile from a salary point of view. 

Bana’s experiment revealed that some certifications (such as the IIBA Agile Analysis Certification) produce meaningful salary gains quickly while others (such as the Cisco Certified Internetwork Expert) do so more slowly – valuable information for workers who would like to have better information about how an investment in skills training will affect their salaries and prospects, Bana says.

Employees aren’t the only ones to benefit from this information, Bana notes. Employers can use these results to better invest in human capital, she says. If, for example, machine learning models reveal a gradual shift away from some tasks and toward others, employers would have advance warning and could retrain certain employees.

And policymakers considering what job training programs to promote would similarly benefit from understanding which skills are waxing or waning in economic value.

To that end, Bana and her colleagues are currently working on a companion paper that identifies what tasks are disappearing from job listings over time and what new tasks are appearing. 

In the future, Bana hopes that textual analysis of job postings could yield a web-based application where workers or companies could research the value added by upskilling or by moving to a new geographic location. 

“Currently there’s not a lot of clarity around a path to higher earnings,” Bana says. “Tools like these could help job seekers improve their job prospects, employers develop their workforces, and policymakers respond to immediate changes in the economy.”

Stanford HAI's mission is to advance AI research, education, policy and practice to improve the human condition. Learn more. 

Share
Link copied to clipboard!
Contributor(s)
Katharine Miller

Related News

What Davos Said About AI This Year
Shana Lynch
Jan 28, 2026
News
James Landay and Vanessa Parli

World leaders focused on ROI over hype this year, discussing sovereign AI, open ecosystems, and workplace change.

News
James Landay and Vanessa Parli

What Davos Said About AI This Year

Shana Lynch
Economy, MarketsJan 28

World leaders focused on ROI over hype this year, discussing sovereign AI, open ecosystems, and workplace change.

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos
TIME
Jan 21, 2026
Media Mention

HAI Senior Fellow Yejin Choi discussed responsible AI model training at Davos, asking, “What if there could be an alternative form of intelligence that really learns … morals, human values from the get-go, as opposed to just training LLMs on the entirety of the internet, which actually includes the worst part of humanity, and then we then try to patch things up by doing ‘alignment’?” 

Media Mention
Your browser does not support the video tag.

AI Leaders Discuss How To Foster Responsible Innovation At TIME100 Roundtable In Davos

TIME
Ethics, Equity, InclusionGenerative AIMachine LearningNatural Language ProcessingJan 21

HAI Senior Fellow Yejin Choi discussed responsible AI model training at Davos, asking, “What if there could be an alternative form of intelligence that really learns … morals, human values from the get-go, as opposed to just training LLMs on the entirety of the internet, which actually includes the worst part of humanity, and then we then try to patch things up by doing ‘alignment’?” 

How AI Shook The World In 2025 And What Comes Next
CNN Business
Dec 30, 2025
Media Mention

HAI Co-Director James Landay and HAI Senior Fellow Erik Brynjolfsson discuss the impacts of AI in 2025 and the future of AI in 2026.

Media Mention
Your browser does not support the video tag.

How AI Shook The World In 2025 And What Comes Next

CNN Business
Industry, InnovationHuman ReasoningEnergy, EnvironmentDesign, Human-Computer InteractionGenerative AIWorkforce, LaborEconomy, MarketsDec 30

HAI Co-Director James Landay and HAI Senior Fellow Erik Brynjolfsson discuss the impacts of AI in 2025 and the future of AI in 2026.