Business Insights
  • Home
  • Crypto
  • Finance Expert
  • Business
  • Invest News
  • Investing
  • Trading
  • Forex
  • Videos
  • Economy
  • Tech
  • Contact

Archives

  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • August 2023
  • January 2023
  • December 2021
  • July 2021
  • November 2019
  • October 2019
  • September 2019
  • August 2019
  • July 2019
  • June 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019

Categories

  • Business
  • Crypto
  • Economy
  • Finance Expert
  • Forex
  • Invest News
  • Investing
  • Tech
  • Trading
  • Uncategorized
  • Videos
Apply Loan
Money Visa
Advertise Us
Money Visa
  • Home
  • Crypto
  • Finance Expert
  • Business
  • Invest News
  • Investing
  • Trading
  • Forex
  • Videos
  • Economy
  • Tech
  • Contact
The Benefits of Using Economically Meaningful Factors in Financial Data Science
  • Invest News

The Benefits of Using Economically Meaningful Factors in Financial Data Science

  • August 17, 2025
  • Roubens Andy King
Total
0
Shares
0
0
0
Total
0
Shares
Share 0
Tweet 0
Pin it 0

Factor selection is among our most important considerations when building financial models. So, as machine learning (ML) and data science become ever more integrated into finance, which factors should we consider for our ML-driven investment models and how should we select among them?

These are open and critical questions. After all, ML models can help not only in factor processing but also in factor discovery and creation.

Factors in Traditional Statistical and ML Models: The (Very) Basics

Factor selection in machine learning is called “feature selection.” Factors and features help explain a target variable’s behavior, while investment factor models describe the primary drivers of portfolio behavior.

Perhaps the simplest of the many factor model construction methods is ordinary least squares (OLS) regression, in which the portfolio return is the dependent variable and the risk factors are the independent variables. As long as the independent variables have sufficiently low correlation, different models will be statistically valid and explain portfolio behavior to varying degrees, revealing what percentage of a portfolio’s behavior the model in question is responsible for as well as how sensitive a portfolio’s return is to each factor’s behavior as expressed by the beta coefficient attached to each factor.

Like their traditional statistical counterparts, ML regression models also describe a variable’s sensitivity to one or more explanatory variables. ML models, however, can often better account for non-linear behavior and interaction effects than their non-ML peers, and they generally do not provide direct analogs of OLS regression output, such as beta coefficients.

Graphic for Handbook of AI and Big data Applications in Investments

Why Factors Should Be Economically Meaningful

Although synthetic factors are popular, economically intuitive and empirically validated factors have advantages over such “statistical” factors, high frequency trading (HFT) and other special cases notwithstanding. Most of us as researchers prefer the simplest possible model. As such, we often begin with OLS regression or something similar, obtain convincing results, and then perhaps move on to a more sophisticated ML model.

But in traditional regressions, the factors must be sufficiently distinct, or not highly correlated, to avoid the problem of multicollinearity, which can disqualify a traditional regression. Multicollinearity implies that one or more of a model’s explanatory factors is too similar to provide understandable results. So, in a traditional regression, lower factor correlation — avoiding multicollinearity — means the factors are probably economically distinct.

But multicollinearity often does not apply in ML model construction the way it does in an OLS regression. This is so because unlike OLS regression models, ML model estimations do not require the inversion of a covariance matrix. Also, ML models do not have strict parametric assumptions or rely on homoskedasticity — independence of errors — or other time series assumptions.

Nevertheless, while ML models are relatively rule-free, a considerable amount of pre-model work may be required to ensure that a given model’s inputs have both investment relevance and economic coherence and are unique enough to produce practical results without any explanatory redundancies.

Although factor selection is essential to any factor model, it is especially critical when using ML-based methods. One way to select distinct but economically intuitive factors in the pre-model stage is to employ the least absolute shrinkage and selection operator (LASSO) technique. This gives model builders the facility to distill a large set of factors into a smaller set while providing considerable explanatory power and maximum independence among the factors.

Another fundamental reason to deploy economically meaningful factors: They have decades of research and empirical validation to back them up. The utility of Fama-French–Carhart factors, for example, is well documented, and researchers have studied them in OLS regressions and other models. Therefore, their application in ML-driven models is intuitive. In fact, in perhaps the first research paper to apply ML to equity factors, Chenwei Wu, Daniel Itano, Vyshaal Narayana, and I demonstrated that Fama-French-Carhart factors, in conjunction with two well-known ML frameworks — random forests and association rule learning — can indeed help explain asset returns and fashion successful investment trading models.

Finally, by deploying economically meaningful factors, we can better understand some types of ML outputs. For example, random forests and other ML models provide so-called relative feature importance values. These scores and ranks describe how much explanatory power each factor provides relative to the other factors in a model. These values are easier to grasp when the economic relationships among the model’s various factors are clearly delineated.

Data Science Certificate Tile

Conclusion

Much of the appeal of ML models rests on their relatively rule-free nature and how well they accommodate different inputs and heuristics. Nevertheless, some rules of the road should guide how we apply these models. By relying on economically meaningful factors, we can make our ML-driven investment frameworks more understandable and ensure that only the most complete and instructive models inform our investment process.

If you liked this post, don’t forget to subscribe to Enterprising Investor.


All posts are the opinion of the author. As such, they should not be construed as investment advice, nor do the opinions expressed necessarily reflect the views of CFA Institute or the author’s employer.

Image credit: ©Getty Images / PashaIgnatov


Professional Learning for CFA Institute Members

CFA Institute members are empowered to self-determine and self-report professional learning (PL) credits earned, including content on Enterprising Investor. Members can record credits easily using their online PL tracker.

Total
0
Shares
Share 0
Tweet 0
Pin it 0
Roubens Andy King

Previous Article
Hard cider leader files Chapter 11 bankruptcy, has survival plan
  • Trading

Hard cider leader files Chapter 11 bankruptcy, has survival plan

  • August 17, 2025
  • Roubens Andy King
Read More
Next Article
Wells Fargo Slashes PT on O-I Glass (OI) to  From
  • Business

Wells Fargo Slashes PT on O-I Glass (OI) to $14 From $15

  • August 17, 2025
  • Roubens Andy King
Read More
You May Also Like
How to Make Quick Money Selling Scrap Metal in the UK
Read More
  • Invest News

How to Make Quick Money Selling Scrap Metal in the UK

  • Roubens Andy King
  • September 11, 2025
Dave Says: They’re Manipulating Your Feelings
Read More
  • Invest News

Dave Says: They’re Manipulating Your Feelings

  • Roubens Andy King
  • September 10, 2025
10 Ways Seniors Are Being Watched Without Realizing It
Read More
  • Invest News

10 Ways Seniors Are Being Watched Without Realizing It

  • Roubens Andy King
  • September 4, 2025
Honest Advice to Someone Who Wants Financial Freedom
Read More
  • Invest News

Honest Advice to Someone Who Wants Financial Freedom

  • Roubens Andy King
  • September 3, 2025
Private Capital and Systemic Risk
Read More
  • Invest News

Private Capital and Systemic Risk

  • Roubens Andy King
  • September 3, 2025
New milestone – 0,000 portfolio
Read More
  • Invest News

New milestone – $500,000 portfolio

  • Roubens Andy King
  • September 3, 2025
10 Highest Yielding Kevin O’Leary Stocks Now
Read More
  • Invest News

10 Highest Yielding Kevin O’Leary Stocks Now

  • Roubens Andy King
  • September 3, 2025
Walker Lane Resources Ltd. Announces the Commencement of Drilling by Coeur Silvertip Holdings on its Silverknife Property, British Columbia
Read More
  • Invest News

Walker Lane Resources Ltd. Announces the Commencement of Drilling by Coeur Silvertip Holdings on its Silverknife Property, British Columbia

  • Roubens Andy King
  • September 3, 2025

Recent Posts

  • Crypto Exec Says Expect Tickerless US dollar Stablecoins in the Future
  • Bitcoin, Ethereum ETFs rake in over $1 billion
  • Is Ethereum Currently Undervalued At $4,700? NVT Reading Suggests So
  • DEVcon is back! | Ethereum Foundation Blog
  • More uncle statistics | Ethereum Foundation Blog
Featured Posts
  • Crypto Exec Says Expect Tickerless US dollar Stablecoins in the Future 1
    Crypto Exec Says Expect Tickerless US dollar Stablecoins in the Future
    • September 13, 2025
  • Bitcoin, Ethereum ETFs rake in over  billion 2
    Bitcoin, Ethereum ETFs rake in over $1 billion
    • September 13, 2025
  • Is Ethereum Currently Undervalued At ,700? NVT Reading Suggests So 3
    Is Ethereum Currently Undervalued At $4,700? NVT Reading Suggests So
    • September 13, 2025
  • DEVcon is back! | Ethereum Foundation Blog 4
    DEVcon is back! | Ethereum Foundation Blog
    • September 13, 2025
  • More uncle statistics | Ethereum Foundation Blog 5
    More uncle statistics | Ethereum Foundation Blog
    • September 13, 2025
Recent Posts
  • Bitcoin Breaks Above Mid-Term Holder Breakeven
    Bitcoin Breaks Above Mid-Term Holder Breakeven
    • September 13, 2025
  • Time to Buy or Too Late to Chase?
    Time to Buy or Too Late to Chase?
    • September 13, 2025
  • A Circular Economy And The Four Archetypes Of Bitcoiners
    A Circular Economy And The Four Archetypes Of Bitcoiners
    • September 13, 2025
Categories
  • Business (2,057)
  • Crypto (1,700)
  • Economy (123)
  • Finance Expert (1,687)
  • Forex (1,699)
  • Invest News (2,363)
  • Investing (1,616)
  • Tech (2,056)
  • Trading (2,024)
  • Uncategorized (2)
  • Videos (818)

Subscribe

Subscribe now to our newsletter

Money Visa
  • Privacy Policy
  • DMCA
  • Terms of Use
Money & Invest Advices

Input your search keywords and press Enter.