Official UD Coronavirus (COVID-19) Updates
  • Visit
  • Apply
  • Give
University of Delaware - Alfred Lerner College of Business & Economics

By Sunny Rosen July 25, 2019

Imagine you consider lending $2,000 to one of two borrowers on a crowdfunding website. The borrowers are identical in terms of their demographic and financial characteristics, the amount of money they wish to borrow, and their reason for borrowing the money. However, the text they provided when applying for a loan differs.

Borrower #1 writes: “I am a hard working person, married for 25 years, and have two wonderful boys. Please let me explain why I need help. I would use the $2,000 loan to fix our roof. Thank you, God bless you, and I promise to pay you back.”

Borrower #2 writes: “While the past year in our new place has been more than great, the roof is now leaking and I need to borrow $2,000 to cover the cost of the repair. I pay all bills (e.g., car loans, cable, utilities) on time.”

It might surprise you to know that based solely on these different word choices, borrower #1 is eight times more likely to default on your loan than borrower #2, according to research from the University of Delaware’s Michal Herzenstein.

“The words we use matter in predicting our future behavior,” said Herzenstein, who is an associate professor of marketing at UD’s Alfred Lerner College of Business and Economics. “Ours is the first paper that shows it at the individual level. Other papers show it in aggregate — for example, Twitter can predict elections results, Amazon reviews can predict the manufacturer’s stock price fluctuations, car reviews can predict market share, etc.”

To find these results, Herzenstein and her coauthors, Oded Netzer and Alain Lemaire from Columbia University, used text-mining and machine-learning tools to automatically process and analyze the raw text in over 120,000 loan requests from an online crowdfunding platform. The result became the paper “When Words Sweat: Identifying Signals for Loan Default in the Text of Loan Applications,” recently accepted for publication by the Journal of Marketing Research.

Among the team’s primary findings were that defaulting borrowers are more likely to mention their families, God, hardship, a need for help, and short-term focused words (defaulters are more likely to use the term “a month,” while repayers are more likely to use the term “a year”). Both lenders and applicants may want to avoid these words during the loan application process.

Herzenstein added that what’s especially interesting about the findings is that even words that aren’t related to finances are significant in predicting behavior. It’s not surprising, for example, that people who talk about unsecured, high-interest payday loans are more likely to default.

Instead, she said, “What’s really cool is the pronouns and words that seem meaningless like ‘rather.’ These are the words that in spoken language we usually cannot track — how many times do you say ‘I’ in a casual conversation? We don’t really know, but those words are meaningful in predicting behavior.” Both “rather” and “I,” she said, are examples of words more likely to be used by people who repay their loans.

“We find that people who pay back their loans use a lot of relative words such as: than, side, rather, below,” Herzenstein explained. “Defaulting borrowers talk a lot about other people, and those who pay back talk a lot about themselves. Some of it is strategic and some is more mindless.”

The paper also found that requests from defaulting borrowers have some similarities to the writing styles of extroverts and liars. To clarify — this doesn’t mean that defaulting borrowers intend to lie, or that they are extroverts, but a number of similarities exist between these groups, Herzenstein said. According to her research, all of these trends found in liars and extroverts are also found in defaulting borrowers:

  • “Liars often use very simple language and tell simple stories (not short ones, but simple ones). This is because they must use their mental capacity to keep track of the lie, and therefore cannot elaborate on their stories.
  • “Liars tend to use present and future tense. This is because they try to avoid giving a conclusion to their story. Here is an example: If I told you I went shopping yesterday, you can ask me, ‘What did you buy?’ But if I tell you I am shopping now or will go shopping tomorrow, there is no conclusion, and I don’t have to lie about what I bought.
  • “Liars tend to refer to others and not to themselves, because they want to distance themselves from the lie.
  • “Extroverts use more body words (mouth, naked…), human words (female, boy…) and achievement words (master, able…), and less filler words (blah, like…). Extroverts like exciting lifestyles, and it has been shown that they take more debt than others. Hence, it is not surprising that we find that defaulting loan requests are written in a similar manner.”

What can individuals and lenders learn from this research? Whether you’re applying for a loan or loaning someone money, consider that even the most innocuous of words can help make important predictions about behavior and outcomes.