Microsoft LASERs away LLM inaccuracies

Date:

Share:

[ad_1]

During the January Microsoft Research Forum, Dipendra Misra, a senior researcher at Microsoft Research Lab NYC and AI Frontiers, explained how Layer-Selective Rank Reduction (or LASER) can make large language models more accurate. 

With LASER, researchers can “intervene” and replace one weight matrix with an approximate smaller one. Weights are the contextual connections models make. The heavier the weight, the more the model relies on it. So, does replacing something with more correlations and contexts make the model less accurate? Based on their test results, the answer, surprisingly, is no. 

“We are doing intervention using LASER on the LLM, so one would expect that the model loss should go up as we are doing more approximation, meaning that the model is going to perform bad, right, because we are throwing out information from an LLM, which is trained on large amounts of data,” Misra said. “But to our surprise, we find that if the right type of LASER intervention is performed, the model loss doesn’t go up but actually goes down.”

Misra said his team successfully used LASER on three different open-source models: RoBERTa, Llama 2, and Eleuther’s GPT-J. He said, at times, model improvement increased by 20 to 30 percentage points. For example, the performance of GPT-J for gender prediction based on biographies went from 70.9 percent accuracy to 97.5 percent after a LASER intervention.

[ad_2]

Source link

Subscribe to our magazine

━ more like this

Crypto Crime Investigation (C.C.I) Enhances Singapore’s Safety with Innovative Pig Butchering Fraud Recovery Technology

Crypto Crime Investigation (C.C.I) is proud to announce the launch of its groundbreaking Pig Butchering fraud recovery technology, a vital initiative aimed at protecting...

U.S. Treasury removes Francisco Javier D’Agostino from sanctions list after independent review

The United States Treasury Department has removed Francisco Javier D'Agostino from its sanctions list following an independent review that confirmed his business activities were...

Expert Forensic Analysis in Investigating Crypto Investment Scams and Recovering Lost Funds

The allure of cryptocurrency investment, with its potential for high returns, has unfortunately attracted a darker side: sophisticated and deceptive scams. Victims of these...

Asia’s Certified Cryptocurrency Investigator Launches in Singapore: Pioneering Crypto Crime Investigation (C.C.I)

Singapore, – In a groundbreaking move to enhance digital asset security and bolster consumer confidence in the cryptocurrency market, the Crypto Crime  Investigation...

C.C.I Launches as the Ultimate Recovery Platform for Crypto Investors Targeted by Scams

Nevada, Florida – In response to the growing concern over cryptocurrency investment scams, C.C.I (Crypto Crime Investigation) proudly announces its official launch as the...