Grade Caps are Not a Good Solution to Grade Inflation

It’s well known that grade inflation has “degraded” the informational content of grades at many colleges. At Harvard, two-thirds of all undergraduate grades are now A’s—up from about a quarter two decades ago. In response, a Harvard faculty committee has proposed capping A grades at 20 percent of each class (plus a cushion for small courses). That may give professors some cover to resist further inflation, but it doesn’t solve the real problem.

The real problem is not inflation per se. It’s that students are penalized for taking harder courses with stronger peers. A grade cap leaves that distortion intact—and can even amplify it. As Harvard economist Scott Kominers argues:

A grade cap systematically penalizes ambitious students for surrounding themselves with strong classmates. Perverse course-shopping incentives ensue as a result. A student who is prepared for an advanced course but concerned about landing in the bottom 80 percent may choose to drop down preemptively—seeking out a pond where they are a relatively bigger fish. As strong students move into lower-level courses, competition for A grades increases there while harder courses continue to shrink—reducing their A allocation further and driving more students away.

The underlying issue is informational. A grade tries to capture two things—student ability and course difficulty—with a single number. Gans and Kominers show that in general this is impossible: if some students take math and earn B’s while others take political science and earn A’s, there is no way, from grades alone, to tell whether the difference reflects ability or course difficulty.

There is, however, a solution in some cases. Clearly, if every student takes some math and political science courses, informative patterns can emerge. If math students tend to get B’s in math but A’s in political science, while political science students get A’s in their own field but C’s in math, you can begin to separate course difficulty from student ability.

Students don’t all overlap the same classes. But full overlap isn’t necessary—you just need a connected network. If Alice just takes math courses, Joe takes math and political science courses, and Bob just takes political science courses, then Alice and Bob can be compared through Joe. With enough of these links, the entire system can be stitched together. The more overlap, the more precise the estimates.

Valen Johnson proposed a practical method along these lines in 1997. Gans and Kominers embed the same intuition in a much more general framework, showing exactly what can and cannot be inferred, and under what conditions.

The great thing about achievement indexes based on relative comparisons is that they are robust to grade inflation and do not penalize students for taking hard classes or subjects. A political science student who chooses to take a tough math class instead of an easy-A intro to sociology course won’t be penalized because their low math grade will, in effect, by boosted by the difficulty of the course/quality of the students. That’s good for the student and also good for disciplines that have lost students over the years because they held the line on grade inflation.

One final point. Harvard’s cap proposal appears to have been developed with little engagement with researchers who have studied problems like these for decades in the mechanism and market design literature—people like Kominers, Gans, Budish, Roth, Maskin, and Sönmez, some of them at Harvard! Moreover, this isn’t a case of ignoring high-theory for practice. The high-theory of mechanism design has produced real-world systems including kidney exchanges, school choice mechanisms, physician-resident matching, even the assignment of students to courses at Harvard, as well as many other mechanisms. Mechanism design is practical.

Grade inflation is a mechanism design problem—and we know a lot about how to solve it, if we want to solve it.

Comments

Yes. When I was in law school, there was this class called "Federal Courts" that all of the most ambitious students take, as it's basically a pre-req for some of the most prestigious things you can do after law school. I was not an ambitious student but I took it because it sounded interesting.

BIG mistake. All law schools are required to grade on a curve and I was brutally shoved into the bottom 20% by all of the future federal appellate clerks and DOJ honors attorneys.

I quickly learned to avoid the striver classes.

Respond

Plenty of people can "get" Federal Courts and arrive at the logical/legal i.e. right conclusion on an exam, so the professor really is just grading on a rather subjective curve. The retort would be that a lawyer's job is to write forcefully, clearly, succinctly and integrally so this curve is appropriate for standards set by someone with demonstrated intellectual and writing chops.

Actual law practice is of course the great equalizer.

I had a very wise (and eccentric) law professor who said he'd just as soon have pass/fail on the assumption that if you made it into his school and got the right answer you could be safely certified as Bar-eligible and ready for the job market. He used to say it would take you 5 years to know what you don't know and 10 years to fix it. I asked him what he thought was the most valuable and marketable trait I needed and he said "efficient litigation skills." I'd add that it's really like being a doctor, accurately diagnosing the client's problem and using your toolbox to make it go away.

The nice thing about a brutal profession is if you can survive it you don't really hit your stride until around age 50 with accrued knowledge base, personal gravitas, sociability, and strategic decisionmaking. Then you can coast at your particular level not uncommonly into your 70s though most firms require you to surrender your equity at age 65.

I've seen several examples of superb BigLaw and MidLaw lawyers hanging on too long and being eased out the door after some embarrassing fiasco. And a number of lawyers who just repeat their first few years of practice, over and over.

Respond

"Plenty of people can 'get' Federal Courts and arrive at the logical/legal i.e. right conclusion on an exam, so the professor really is just grading on a rather subjective curve."

I'm not sure what you mean here, but as far as I understand it, it's nonsense. Nobody just "gets" Federal Courts. Of course, one may study it, do the work, and learn the material, but that isn't what you mean, is it? Because if you do that, then grading isn't subjective at all.

Respond

Back in the pterodactyl days, we wrote answers to law school exam questions in longhand in blue booklets. In answer to a particular question, the majority of the class would get a federal courts exam question correct e.g., yes the district court had jurisdiction or no it did not. At that point, the difference between a high score and a low score depended on the articulateness, succinctness, clarity, even the "elegance," as I heard one professor describe it, of the answer. Kind of squishy stuff, and not strictly objective.

Respond

I went to a law school that had forced grading. For the first year, it was top 10%, next 30%, then everyone else, except failing. It was slightly looser than that for years 2-3. In my experience, I actually saw sort of the opposite effect. There aren't too many choices for classes, especially when you have an idea of what you want to do. If you want to work in BigLaw, you need to take hard classes AND get good grades (especially if you want a Federal Clerkship). But for most people, they got a few good grades, but mostly they weren't under a lot of grade pressure, which was the point. I didn't see a lot of people who had great grades but avoided hard classes.

Respond

I’m m­a­k­i­n­g o­v­e­r $­­7­k a m­o­n­t­h w­o­r­k­i­n­g p­a­r­t t­i­m­e. I­ k­e­p­t h­e­a­r­i­n­g o­t­h­e­r p­e­o­p­l­e t­e­l­l m­e h­o­w m­u­c­h m­o­n­e­y t­h­e­y c­a­n m­a­k­e o­n­l­i­n­e s­o I d­e­c­i­d­e­d t­o l­o­o­k i­n­t­o i­t. W­e­l­l, i­t w­a­s a­l­l t­r­u­e ­a­n­d h­a­s­ t­o­t­a­l­l­y c­h­a­n­g­e­d­ m­y l­i­f­e. T­h­i­s i­s­ ­w­h­a­t­ I d­o............................ https://Www.Cash54.Com




Respond

An alternative would be for employers and others to stop making a fetish of grades independent of the courses taken. Put bluntly, they could grow up a bit.

Respond

What employers do is outside of Harvard's control. Harvard can control the merits of its own grading system.

Respond

You've got it backwards. Saying employers should ignore grades is the precisely the wrong conclusion and is actually the problem with grade inflation. A large part of the negative effects, I am guessing, is precisely because employers are already doing what you say employers must do and students are paying the price.

(Tyler has linked to many studies showing the negative effects of grade inflation recently (here, here and here for instance)

Since employers cannot discern between the candidates because grade inflation has made grades a useless signal, returns to lying, charisma, networking and interviewing-as-a-skill get higher as opposed to actual academic rigor and technical know-how at universities. Plus students are punished for taking tougher courses. I did this as well. Looking back, it feels like I wasted my graduate studies by taking easy courses instead of taking the most interesting courses.

Respond

Your first two links point to k12 grade inflation and passing inflation. That is very different from what is happening in higher education. Universities have highly selected populations. Insofar as we should expect grades to correlate with SAT scores, it is silly to expect the grade distribution at Harvard or Stanford where the median is ~1550 and UMass which is ~1390 or Grand Valley State which is ~1100 to be the same.

I find it curious that you think the ability to do well on exams is more valuable than charisma, networking ability and interpersonal skills to employers.

Respond

The ability to do well on exams is more valuable than charisma, networking ability and interpersonal skills to employers in knowledge work and grade inflation significantly dulls the signal.

If employers value charisma and interpersonal skills, then they can choose to evaluate candidates accordingly in interviews as it’s easy to display them in interviews.

But the college degree is supposed to capture technical ability and right now, grade inflation isn’t allowing students to reflect them by granularity. That’s what we’re discussing here.

Respond

"I find it curious that you think the ability to do well on exams is more valuable than charisma, networking ability and interpersonal skills to employers."

The idea of exams is to generate a set of people with deep knowledge and excellent intellectual ability. From there, we additionally select on other beneficial personal characteristics.

The alternative proposed by the left is to ditch the knowlege / intellectual ability selection and substitute with ideology; then select on charisma and interpersonal skills.

Respond

Looking back, I feel I wasted my graduate studies by ignoring subjects like "how to run a daycare in Minnesota," "how to run a school lunch program," or "how to claim disability benefits."

Respond

Not just an alternative but I would say that if things continue the way they have been going, an inevitability

Respond

Fair enough, though "we" have both: outlawed consideration of other measures of / proxies for the same information; and added a ton of noise into this signal via 'everyone goes to college' policies.

Respond

Employers need credible signals of applicant strength, especially in an era when its easy to apply to dozens/hundreds of companies, and thus companies are flooded with resumes.

Respond

This is not what grades, university admissions, and the rest get you. They get you signals of parental investment, people pleasing, looks (i.e. we literally had DiD showing us that going remote lowered the scores for beautiful women during Covid), and the rest.

And often, when you have far more credible signals, employers don't want them. For instance, I hire medical staff to work with incarcerated individuals. It is highly credible signal to have staff who have experience in prisons. Maybe they had a close family member incarcerated. Maybe they volunteered with a church ministry. But legal doesn't want us priviliging such applications. Even when they, statistically, are far more likely to stay with us and have us incur fewer turnover costs.

And on it goes. A credible signal for a lot of medical stuff is prior experience in the medical field. I like corpsmen, but if the institution making the selection doesn't have that as a box check they often get dinged because they don't fit the mold.

And it becomes a very difficult principle-agent problem. HR has some incentives to find qualified talent. They have a lot more incentives to demonstrate legal compliance and I would argue that qualified talent is likely not even second or third on their incentive list - they want to maintain a culture where the talent doesn't create waves and often have stronger commitments to their political passions than to organizational thriving.

And that is the real problem. Harvard and the rest of Academia are trying to create uniform ranking lists - so much so that they vet the examination questions by highly well they discriminate along the same lines as other ranking mechanisms.

Because Harvard ain't a search for talent. It is a search for prestige and you get that not by demonstrating students have talent (and at Harvard's level they pretty much all have that), but by playing to whims of the donors and the rankers.

We keep trying to have a single algorithm to rank talent, ability, potential, and the rest. And shockingly, we find endless incentive to subvert, distort, and lie at the margin with correlated failure mechaisms.

Respond

"They get you signals of parental investment...."

How is that bad? The point of "parental investment" is to generate exactly what we want: hard work and lots of hard-earned knowledge. I'm confused on why that's a bad thing to reward.

"people pleasing, looks..."

Yes, in "arts" classes these things can be effective. That's why the value of Arts degrees has fallen through the floor.

These are much less effective in science. Just the same, there has been ample effort on the part ot he left to undermine the sciences and introduce ideological quackism as a litmus test - from climate disaster to "structural" racism.

Respond

You'll likely get your wish if we continue making grades meaningless

Respond

"An alternative would be for employers" to request transcripts with grades and multi-year averages per course. The same for the GPA.

Your proposal is to abolish grades because they are not informative, so in a sense, you would like to shut the market altogether.

In contrast, my proposal is to keep grades - and thus the market open - but make them informative and incentive-compatible for students, faculty, schools, recruiters, grant-giving institutions, and donors.

I believe that the incentives, if changed in the proposed way, would trigger a rapid and large grade disinflation. Very importantly, the implementation should be completely voluntary. Some low-inflation-type schools will pre-empt the high-inflation ones, because it would be in their best interest. This pre-emption will start a cascade: under common knowledge, by delaying the adoption of something so trivial to do, an institution signals that it has valid reasons (such as being super-inflationary?). The market notices, performs Bayesian updating on these reasons, and "reprices." It's precisely this threat that will force schools to move fast rather than wait and see.

Respond

"An alternative would be for employers and others to stop making a fetish of grades independent of the courses taken."

And how is that going to happen? Much easier for Harvard to change the way it grades.

Respond

The real alternative would be for companies to hire and use signals that more directly correlated with job performance rather than outsourcing their vetting to a bunch of institutions not aligned with their incentives.

But for that to work, you'd need to allow companies to actually be able to develop weights and measures without first running everything past the legal/HR gauntlet.

We keep trying to cram a planned talent economy into a distributed talent market and are surprised when it devolves under pressure.

Like I don't get it. People think pencils are too mysterious for some evaluation function to adequately forecast supply and demand a few months out ... but talent, skills, and all the other variables of human production are not only immune to the computation problem, but that every university can basically run the same computation on everyone.

Meritocracy is just another form of central planning and like all central planning it will eventually be consumed by spurious feedback in the gears.

Respond

Or leaders could realize the futility of wars and undertake world peace.

Respond

python training 400!

Arduino IDE
An Integrated Development Environment (IDE) is a comprehensive software application
That's what programing, computer language, and also applications have in common. There is interplay, you know. There is no there there, it may seem. But then you copy and paste. You are also kind of your own boss re: p/f

The a element

python training 400!

Moreover, in abstract links, and CGI, the algebra rhythm determines access. Based on what? Well you know, that's a good example of someone who wouldn't get access at all.

Then yeah, Visual Code, Python, and Arduino (golf).

"0"=="connie" 1 "2"=="degas" 2 "3"=="Haste" 3 "4"== "clockwork" "5" analog Write(this Pin, brightness); for\ int (brightness = 64); brightness =>32 0 loop 1 4 "golf" 5 "RGB" 6 "Console" 7 "Network" 8 "0"=="connie" 1 "2"=="degas" 2 "3"=="Haste" 3 "4"== "clockwork" "5" analog Write(this Pin, brightness); for\ int (brightness = 64); brightness => loop "golf" "RGB" "Console" "Network"
Indentation Error: unexpected

indent variable = 10
variable = 20 print(variable)

Respond

Betty. Yes, Mrs. January. Haha. The Irony is that H-Hawke, is written by Februar. That's kind of my stinks. To be honest. She tried to implement a "bypass" like at4tribute. Maybe there's be more headlines about heart health, or whatever. I squashed it and yelled at her.

Today, I know why. Print is one, that is interesting. Even EP and I ran exercises. I figured out that "print" is an evil delusion. They leave it up, when they are delusional, thinking it is "they". Perhaps Steve, perhaps. If FDR, Anne Frank and Dean Acheson is them. Then would it not follow that too that Hitler is a tough pill swallow; that he lives on in memory as "beautiful." In that sense, alive forever. Good, then. Because he is.

Adolph Hitler is absolutely beautiful. So yeah, Betty is a Green Beret. They agreed, she should be. They love it, actually. It's really the symbolism of resistance. Not so much liberation. Liberated from progress I guess? Libations?? Liberation from republicanism? Conservation? Eva Brauns, they "brauchen" they need to be on the bat4tle field. Our Bettina's not so much. Reciprocal altruism is real, and also follows, flows, and freedonias.

We had two weeks of kind of grief-stricken productivity gains at Dentsu; I imagine that's beginning anew.

As Michael explained to LBJ, "A good offense is a strong defense." Right, if you just steal the ball, then your defense is your offense.

"A strong offense, comes from defensive driving." Something?

Workflows included in a solution where Microsoft is the publisher.
nstead of this manual process, as a developer of a client application or plug-in, you can pass special optional parameters with your requests to control two types of custom business logic as described in the following table:

Synchronous Logic: Let's say you have a huge ad campaign, all these different parameters, countries, geos, different ad servers, different audience targeting,
...But the ad is the same. that's a lot of manual labor, I'd like to bypass.

Asynchronous Logic
When large numbers of system jobs created to process asynchronous logic cause a backup within Dataverse that can affect performance. You can mitigate this performance issue by not triggering the asynchronous logic while performing bulk operations

On the other hand, where there is a long line, like at an airport, you gather your dialectic materialism. You know, like the idea, that music, books, and collections, shoes, and hats, are related, they are "altars."

Respond

POST [Organization URI]/api/data/v9.2/webresourceset HTTP/1.1
If-None-Match: null
OData-Version: 4.0
OData-MaxVersion: 4.0
Content-Type: application/json
Accept: application/json
MSCRM.SolutionUniqueName: ExampleSolution

{
"displayname": "Simple HTML web resource",
"content": "PCFET0NUWVBFIGh0bWw+CjxodG1sPgogIDxib2R5PgogICAgPGgxPkhlbGxvIFdvcmxkPC9oMT4KICA8L2JvZHk+CjwvaHRtbD4=",
"webresourcetype": 1,
"name": "sample_SimpleHTMLWebResource.htm",
"description": "An example HTML web resource"
POST [Organization URI]/api/data/v9.2/webresourceset HTTP/1.1
If-None-Match: null
OData-Version: 4.0
OData-MaxVersion: 4.0
Content-Type: application/json
Accept: application/json
MSCRM.SolutionUniqueName: ExampleSolution

{
"displayname": "Simple HTML web resource",
"content": "PCFET0NUWVBFIGh0bWw+CjxodG1sPgogIDxib2R5PgogICAgPGgxPkhlbGxvIFdvcmxkPC9oMT4KICA8L2JvZHk+CjwvaHRtbD4=",
"webresourcetype": 1,
"name": "sample_SimpleHTMLWebResource.htm",
"description": "An example HTML web resource"
}

SDK Add-On

static void DemonstrateSuppressDuplicateDetection(IOrganizationService service)
{
Entity account = new("account");
account["name"] = "Sample Account";

CreateRequest request = new()
{
Target = account
};
request.Parameters.Add("SuppressDuplicateDetection", false);

try
{
service.Execute(request);
}
catch (FaultException ex)
{
throw ex.Detail.ErrorCode switch
{
-2147220685 => new InvalidOperationException(ex.Detail.Message),
_ => ex,
};
G4
C++
The ID of the msdyn_AIModel record. How you set this value depends on whether you use the SDK for .NET or Web API.
The data that contains the parameters that the prompt is configured to accept. This is passed as a parameter named requestv2. Learn more about the requestv2 parameter
The version parameter. The value is always "2.0".
static void DemonstrateBypassBusinessLogicExecution(IOrganizationService service)
{
Entity account = new("account");
account["name"] = "Sample Account";

CreateRequest request = new()
{
Target = account
};
request.Parameters.Add("BypassBusinessLogicExecution", "CustomSync,CustomAsync");
service.Execute(request);
}
MSCRM.BypassBusinessLogicExecution: CustomSync,CustomAsync

{
"name":"Sample Account"
}

Respond

"One final point. Harvard’s cap proposal appears to have been developed with little engagement with researchers who have studied problems like these for decades in the mechanism and market design literature—people like Kominers, Gans, Budish, Roth, Maskin, and Sönmez, some of them at Harvard!"

I've only been at a prestigious university for about a year, but this appears to be the norm. We're currently hiring for what should be a relatively senior staff position. The administration has taken great pains to design a system that doesn't allow any input into the process from the people who have expertise in the subject matter...and indeed have not advanced any candidates that have experience in the discipline. I don't get it, but it's hard for me to see this as anything but a deliberate choice.

Respond

Pournelle's Law.

Pournelle's Iron Law of Bureaucracy states that in any bureaucratic organization there will be two kinds of people:

First, there will be those who are devoted to the goals of the organization. Examples are dedicated classroom teachers in an educational bureaucracy, many of the engineers and launch technicians and scientists at NASA, even some agricultural scientists and advisors in the former Soviet Union collective farming administration.

Secondly, there will be those dedicated to the organization itself. Examples are many of the administrators in the education system, many professors of education, many teachers union officials, much of the NASA headquarters staff, etc.

The Iron Law states that in every case the second group will gain and keep control of the organization. It will write the rules, and control promotions within the organization.

Respond

"little engagement with researchers who have studied problems like these for decades"

Yup. That is because researchers tend to make fancy things that (at best) need to be considerably modified to work in the real world.

In the top of my head, the system potentially falls down in practice in a bunch of places.

One is that it is hugely dependent on a few students to "connect" the islands and especially on the quality of those few students, which may not be random (i.e. if s/he is particularly brilliant or the opposite). How do you detect and handle this.

Two is that a grading system almost certainly needs to be legible; students, parents (who are paying), employers, graduate schools all need to be able to understand it.

Three, it doesn't work well for smaller classes,and I'm not sure, but it seems like it may be vulnerable to collusion...

Four, how do you compute grades in a sane amount of time? Assigning the final course grade requires that all of the grades for all of the relevant courses are done and then everything gets rescaled? Any poor fool administrator who has sent the N-th reminder to get final grades into the system knows that this is asking for trouble.

Respond

Python and continued playwriting with SVC School of Visual Code :/

"0"=="connie" "1"=="degas"
False

Work shown:

{indention} constructor(set Interval(() => { }, organic chr);) {

{indention! finally, look at my face, took forever}constructor(set Interval(() => {chr}, organic chr);) {*hastenation (real value in the pull down menu built by... seconds, Segundo, MS
master journalism Clockwork Prior*
execute the function array for Each(element =>mileau (spelling mistake) !!

= environment or surroundings (Connie + Degas) (capital not necessary)
...class and class {chr} chr or {constructor} construct drug chr is is for is

one way to think about CSS is "cost savings". I prefer that, there is hard work, then there is efficiency. Coding is both.

Respond

There are 3 so far, basically, there are more though.
1) Hyperlinks -- Murakami (it's good for literalness, clicks, etc)
2) the SDK -- keyword is Kit, part of the Andromeda Toolkit, really so far for efficiency, but as we saw, also for "measurement" or "reporting."
3) Indention -- yeah mine. It means you worked really hard, and it worked! Something worked.

Respond

This is just not true in practice at scale. Enroll in a mechanical engineering program, and perhaps 75% of your courses are already locked in, you get a few electives that are all within your field and are of equal difficulty, and you take some general education courses that you probably did as AP and 100% of them are fluff anyway. I suppose in humanities you can pick and choose the harder courses, but we should know to discount that degree entirely. In the fields where we may actually care to separate performance, grade inflation is the bigger issue versus course shopping. Decent state schools also have in-major GPA requirements and disclosures, so seeking out fluff outside of your field may not even be enough to graduate if you struggle in major.

Respond

For the non-lawyers we hire at my law firm (paralegals and administrators), being able to read and write coherent sentences is a plus, while being unable to talk to a woman would be a minus. So humanities grads would generally be more useful.

Respond

Writing coherent sentences seems to be the biggest shortfall in higher ed across the board right now, but you probably wouldn't want an English grad because the way they are taught to write is exactly the opposite of what you want in a professional business setting.

Respond

"being unable to talk to a woman would be a minus. "

The last company I worked for hired the smartest engineers possible. If they couldn't speak to women that was OK, the supervisor of the production line would hook them up with a nice Vietnamese or Fillipino lady production line worker who could manage the situation, voila, happy couple employees for life. And once hooked up to an engineer promotional opportunties were better for the ladies.

Respond

This is why (limited course choice) my kids dropped out of 'Software Engineering' and did Computer Science instead. They've gone on to be employed as software 'engineers' not 'developers.

Respond

Here is another solution for grade inflation. Every transcript should not only include the student's grade but also the average grade of the class. This would give employers and admissions officers a better understanding of the meaning of the grade. As an additional benefit it would publicize the grading standards of professors.

Respond

One problem with any such approach is that there are two different sorts of all-A's courses. There're the easy-A classes, where every student gets a good grade just by reason of having signed up (think Philosophy of Taylor Swift); and there're courses requiring high intelligence or talent, in which the dimwits don't enroll at all (think Differential Geometry).

A real-world illustration emerged in a study done many years ago by the Arizona Daily Star, which got access to anonymized grades from every course offered by the University of Arizona. The classes with the highest fraction of A's were music classes that required an audition to enroll (Advanced Concert Oboe, perhaps). The next-highest were classes offered by the College of Education, and we can reasonably speculate as to how difficult those were.

A figure for average grade in the class wouldn't distinguish between these two types of course. We'd need some means of rating the difficulty as well.

Respond

Yeah, it's a pretty difficult problem. It's theoretically addressable with some variant of an Elo system/rating your classmate's abilities (by, in turn rating THEIR classmate's abilities and so on). If you're interested in truly determining who the strongest performers at such an institution are, you could implement such a system.

But it would be relatively complex and non-transparent, and it would, I suspect, tend to reveal performance differences among certain identity groups that universities would prefer not to spotlight.

Also, you get issues of ability differences across disciplines. Aaron is a humanities star, bound for a top law school, and aces his humanities classes but struggles in his required math and science classes. Bob is destined to be an astro-physicist, with the abilities to match, but unfortunately stuggles to write a strong essay and underperforms in his humanities classes. (Especially if Bob is, in fact, not a native English speaker.)

Respond

There is also a testing style issue. My GPA on muliple-choice loaded classes was probably 0.5 points better than in long essay exams classes.

Respond

At my undergrad's honors convocation, an entire half of the auditorium was Ed majors, and the other half was all the other colleges.

I think it was Thomas Sowell who pointed out Ed majors were from the bottom quarter or third of entering freshmen.

Education courses should be elective, or a Minor. But now you've got people's career prestige and paychecks at issue, so it will never happen.

Respond

With cheating rampant in high school and college, and teachers/professors seemingly not caring, we might just end up revealing who the best/most shameless cheaters are.

Respond

My observation, as someone who attended an Ivy League school in the distant past and now interviews prospective candidates as an alumnus, is that current students are on average far better prepared academically, have higher IQs and more motivated to achieve in terms of grades (not always a good thing). All things being equal you would expect the current crop of kids to get better grades than the Harvard student body of 1991, which was selected from a far smaller pool of applicants, had a lot more legacy kids and where a lot of the truly high IQ kids were more concerned about doing cool creative things for themselves rather than caring what the Prof thought. If you admit a lot of grinds, those grinds will expect good grades when they do the work they are expected to do.

Respond

"things being equal you would expect the current crop of kids to get better grades than the Harvard student body of 1991"

So are you suggesting that recruiters - who themselves should have a higher IQ now in your logic - are, just like Harvard professors (again, with a higher average IQ than in 1991), are (cognitively) unable to discriminate?

Think sports: NBA players today are better than in 1991 — yet not all of them make it. Why? Let's just allow teams of 10, thus adjusting to the much greater incidence of greatness than in 1991.

Respond

„ who themselves should have a higher IQ now in your logic - are, just like Harvard professors (again, with a higher average IQ than in 1991)“

No, that’s the converse of my logic. Harvard has become more selective in a far larger more global pool of applicants, allowing them to cream off even better students. Meanwhile being a Harvard professor is not the job it once was, meaning the talent pool of available professors is arguably worse.

You also seem to be assuming Harvard needs to grade on a curve, the way sports teams do. Why? Shouldn’t a university simply be testing whether you‘ve mastered the material? Why is it the university‘s job to sort the students?

Respond

Thank you.
My quick rebuke: the NBA “has become more selective in a far larger more global pool of applicants, allowing them to cream off even better” players. Still, the NBA be able to do a merit-based selection, while Harvard is not?

I see a contradiction in your reasoning about the very bright students: are they exceptionally bright yet aware that grade inflation hurts them the most?

My last point refers to your “being a Harvard professor is not…”. I am not able to prove the existence of an equilibrium in which students are persistently brighter than the professors, since it would not be JM Smith x R Selten evolutionarily stable strategy (in which mutants either adapt or die out).

Respond

The problem is that meritocracy is always in an unstable tension between cost, correctness, and consequentianess.

Suppose we build a giant database of student comparisons (ignoring and challenges with FERPA). Will this make grades reliable? Of course not. If your database shows Math 402 as being particularly difficult, then students will flock to whichever professor or offering (e.g. summer term) is the least competitive for the grades on hand.

Okay, so you build your network to keep track of individual professors to avoid folks gaming for lecherous instructors or classes filled disproportionately with below average achieving for a given course (e.g. sign up for early morning courses with the athletic admits). Fine, now you either start reaching the limits of the data or you are spending ever more on a more complicated computation.

But if getting straight As at Harvard really does unlock wealth untold for being the guy who gets the good gig with McKinsey ... then expect one or more the gatekeepers to capture that. After all, Varisty Blues saw large numbers of gatekeepers selling their imprimatur to falsify access to highly ranked colleges.

Maybe it is the instructors selling grades. Maybe it is the coders administering the database. Maybe it is the guy who ports the database results into the academic record. Maybe it is just classmates making ad hoc arrangements to tank their own scores (or engineer a compact so they all hit exactly the same percent and all get As or something). I don't know where the weak point will be.

But I do know that someone will exploit it, whatever it is.

And take one of the areas that I know well, the Match has an algorithm that mathematically rewards applicants for giving a true list of preferences that matches them up to hospitals in the most efficient manner possible. What happened?

Well, the competition just moved to the weakest point in the system - the interview invites. Most programs will not rank folks they don't interview (preferring to go into the SOAP and Scramble generally). So securing the interview becomes the big ticket. So medical students try to subvert the rankings with both valuable signals (e.g. doing research with someone in the coveted department) and confounding ones (e.g. leaning on social ties). This, in turn, spawns secondary competitions - getting the desired away rotation, having facetime with residency and department heads. And then, the lost cost options - lying about your geographic ties and preferences. Developing sports loyalties to a bunch of local teams.

And the cost just keeps mounting. Because of course there is no stable Nash equilibrium for matching physicians. Applicants suffer massive discontinuity penalties if they fail to match (i.e. the guy who just barely matches before SOAP has wildly better career outcomes than a similar guy who matches in SOAP). So the safe bet is to secure 10-20% more interviews than average. And whatever your risk tolerance, there is always an increasing push for more interviews. Similarly, Step scores took over medicine - learning useful skills in medical school went completely out of vogue to learning how to answer multiple choice questions efficiently under time constraints. Entire medical school curricula were reorganized away from traditional sequences to promote understanding of technique and practice and towards weighting things like recency of examination materials.

The NRMP ends up consuming billions of dollars annually in time and investment. And we are seeing no major changes in physician outcomes when major hiccups interrupt its function (e.g. the Covid years largely eliminated the hundreds of millions of dollars applicants spent on physical travel to interviews and programs aren't reporting wild swings in resident quality).

And this is the problem of meritocracy. It is an adversarial system. If we want it to be accurate that means we need to deal with folks who cheat, distort, and subvert. And that takes effort, which is rarely cheap.

The limits to meritocracy are always when the system costs too much - China routinely was spending nearly as much on meritocracy as European states were spending on armies - and still having far more corruption and graft. Meritocracy will never be cheap, consequential, and correct at the same time.

Respond

Still, there are ways to improve. We coordinate courses with many sections so that students take the same exams and the entire course is curved versus individual sections. That just leads to students trying to enroll at the most convenient times, but their grade is less dependent on who they take the course with or when they take the course. It isn't perfect, but it is a fairly inexpensive way of enforcing some standard and reducing poor incentives. At the university level, you can simply have expected GPAs for 2000-level, 3000-level, etc. courses. Yes, you still run into the problem that the median engineering student is better than the median art history student, but engineering is harder and you want higher standards there. You don't necessarily want to hire the worst engineering graduate even if they would have been the best art history graduate.

Respond

I'm bumping my idea on the disinflation of grades, which I've recently proposed (also on MR :)).
(1) Report two numbers on transcripts: a grade in a given course (say 4.00) and GPA (say 3.75), plus the multi-year average grade in this course (say 3.75) and GPA (say 3.40), e.g., for 2020–2025.
(2) This proposal appears to be incentive-compatible. Employers, HS seniors (and their parents), college seniors choosing grad school, and students themselves would discount "good" grades and the professors/courses/schools that give them. So would donors (unwilling to support the illusion of good education). Revealing the second number—a multi-year average that is relatively biased toward the past and thus slower to adjust—would immediately change the optics and actions of all the main actors, prompting adjustments by recruiters, students, donors, professors, and schools.
If implemented, this mechanism is likely to put downward pressure on tuition at "generous" schools, while tuition may go up at the more competitive ones.
More generally, the power of market competition and incentive compatibility — also in markets with asymmetric information like schools — should be unleashed rather than constrained by grade quotas and the like, which fix one distortion by introducing another without any guarantee that Oskar Lange's Second-best Theorem holds.

Respond

"And this is the problem of meritocracy. It is an adversarial system. If we want it to be accurate that means we need to deal with folks who cheat, distort, and subvert. And that takes effort, which is rarely cheap."

Of course, there is cheating and distortion. But even the honest players modify their behavior to optimize for whatever the required metric is....

I more and more think that the "old boys network", had very real advantages. It could rely more on subjective judgement, which it both more expensive and cheaper than more and more pointless meritocracy games.

And the prestige and success rewards of making *good* judgements was real -- having a nose for good talent, being a recommendation that means something, being owed a favor for recommending someone really good, knowing whose recommendations to trust -- and their opposites -- could have real impact on both reputation and success...

Respond

Not sure what Kaminer's solution is in real-world terms.

Vary the number of As based on difficulty of the course? You'd presumably have to assign the A quota across courses ex ante otherwise you're waiting potentially several semesters to find out what your grade is.

I guess you could institute a weighted GPA where an A is a 4.5, 5, whatever in harder courses. You'd still have to wait to find out what a class is worth to your GPA but that might be acceptable, it just still allows students to be penalized unexpectedly by class composition if a surprisingly large number of low talent students takes a class you're in.

I think he suggests the best system would be to attach a degree of difficulty context to a GPA by rating its component classes. You could get an overall difficulty rating this way. In practice we kind of already have that don't we? A 4.0 GPA in humanities is worthless and a 3.7 or whatever in a technical major is at least plausibly a signal of competence.

Respond

I'll have to read the paper by Gans and Kominers, but they seem to be building on work that Valen Johnsons did 25 years ago, which used the same underlying principle: If students get a 3.5 GPA in a course in Norwegian literature, and those same students get a 3.25 GPA in a course in Swedish literature, then with the Norwegian course has higher GPAs -- even after correcting for student quality.

Equally important is when the students are different in the two classes; maybe the students in that Norwegian class have a 3.5 GPA in that class, but in all their other classes average 3.0. And maybe the students in the Swedish class are 3.75 GPA students, who can only manage a 3.25 in Swedish. That is evidence that the Norwegian class either has easier grading, or has easier material that's easier to get good grades at (for this sort of research, we can't identify what the explanation is but our main concern here is the higher grades in Norwegian). Whereas Swedish is a tough class with strong students enrolled -- and tough grading and lower GPAs.

Valen Johnson used a multinomial probit model (IIRC with cutpoints that would vary for each professor and class) to put all classes and student GPAs in a grand campus-wide grading curve. He was at Duke University at the time and suggested that Duke transcripts should report each student's "Academic Index", i.e. a GPA that accounted for the difficulty of the classes that the student took. Duke declined to start using and reporting his Academic Index, but somewhat ironically state rival UNC was interested and partially adopted his recommendations.

That was some time ago, I don't know if UNC or Duke have used his system recently

Respond

Hey now, let’s not compare Norwegian literature with Swedish literature so cavalierly … night and day, my good man, night and day.

Respond

I remain unconvinced of an actual problem here beyond the midwit class being upset that the proles are insufficiently separated. Are the vaunted high IQ candidates really deluged in a sea of 4.0 gym teachers? Is “grade inflation” (and what does that even mean across courses and institutions ?) really breaking something that was humming along before?

My impression is that markets have responded to incentives, receivers of school information have adjusted their signals as they always have, and this is fundamentally a status problem for a few having no substantial impact anywhere- which is how the grades became inflated in the first place.

Respond

"markets have responded to incentives..."

Oh they definitely have. That's why you need a BS in Psych to work in a retirement home kitchen. And that's why that's about as far as a BS in psych will get you, unless of course it has a Harvard stamp on it.

Grade inflation a way of implementing the Equality philosophy. Economically, the left wants to create equality by killing everyone's goat; educationally, they want to give everyone an A.

You can see how it benefits faculty: they can demand less (less work) but still get more adulation and happy hugs - both figuratively and literally - from students and to top it all off avoid nasty confrontations with their CCP handlers in the admin.

Respond

You’re talking about the prole end; the problem Alex worries about is on the laptop end. How will our meritricious elites stand out in a sea of University of Phoenix graduates?

As always, undiscovered talent is rare, but the nagging feeling that someone somewhere got something they didn’t deserve, is constant.

Respond

It's not that much of an issue driven by the left. It's much more due to kids and parents having paid a lot to get into Harvard (in some form) and may be paying a lot to go there. They are told they were the smartest because they managed to get into Harvard. It just ends up being a lot easier to give the kids an A rather than fight with students and maybe most importantly risk losing donations from parents and future grads.

Steve

Respond

"At Harvard, two-thirds of all undergraduate grades are now A’s—up from about a quarter two decades ago."

I guess if you're defining "midwit" to mean 75th percentile at Harvard and "prole" to mean 33rd percentile...

Respond

lol this blog and Harvard. Ok then on the elite end - Harvard prof has x standards for his course and 80% of the class meets them. What exactly has failed?

And if 20% of the students meet his standards, is that evidence about him, them, or both?

It’s safe to sleep this out until someone can state the actual problem beyond “what about Harvard?” If the problem is that Harvards grades need redistribution they can solve that tomorrow if it’s a problem for them. It isn’t.

Respond

Is it actually true that "grade inflation has degraded the informational content of grades at many colleges"? If the grade distribution curve shifts higher, then it provides less information about the highest achievers, but *more* information about the *lowest* achievers. A grade of C might not have meant much if half of the class got C's, but it would mean much more if most of the class gets A's; it tells you that the student who got a C was far below average. I could imagine situations in which employers might find it more useful to be able to screen out the very lowest-performing students than to hire the very highest-performing ones.

In order to know whether grade inflation decreases or increases the amount of information provided by the grade signal, you need to know both the old and the new distributions.

Respond

It doesn't necessarily provide more information about the low achievers. They are now distributed from e.g. D to B where they used to be from F to C. You shift the mean and cap the upper end, the tail is the same.

Respond

At University of Toronto we had a grade cap (not more than 25% in the A range) until about a decade ago. It was not that difficult to administer, and we avoided a lot of problems just by having it apply only to courses with more than 25 students. Unfortunately it resulted in us having the lowest average GPAs in Canada, which is why we eventually abandoned it. The pressure from grade inflation elsewhere convinced us we were disadvantaging our own students. (We actually did a study that proved our students were being handicapped in med school admissions by their low average GPAs, using MCATs as the control.)

As someone who administered this grade cap for several years, I'll just observe that the single greatest cause of grade inflation is professorial laziness. Giving students low grades generates more work, because students are more likely to complain, and dealing with student complaints is extremely time-consuming. Giving low grades also kills the vibe with students, making it more difficult to be the chill/cool professor. So keeping a lid on grade inflation was all about imposing discipline on one's colleagues.

Respond

Yeah, unless those on the other end - employers or next-step educational institutions, are willing and able to normalize GPAs they receive from applicants by averages at the overall high school/college, then there are strong incentives for the HS/college to allow grade inflation to run rampant.

There are various reasons (including ~laziness) why higher institutions/employers don't do this, but ONE such reason is that it allows them to do a somewhat stealthy form of affirmative action - an applicant who is at the 80th percentile (GPA-wise) of his HS class from a strong HS with a lot of striving Asians and other strong students is likely significantly stronger than a 95th percentile (within class) applicant from an inner city HS with terrible reading and math outcomes. But for reasons, many colleges would prefer to admit the latter, reject the former. Same thing for law school, med school, etc.

Respond

Or you could dispense with grades altogether. My perspective is that you should make all classes pass/fail with pass basically indicating you've mastered the material (that is, it's equivalent to a genuine, non-grade-inflated A). If you really care that much about using grades to evaluate students, then you allow more flexibility in how fast or slow students can complete a course. In other words, the top students will distinguish themselves by completing course quickly. To be sure, this would be a radical reshaping of how education is delivered, so I don't think it'll happen, but then again it is how a lot of education software is already structured.

Respond

The obvious solution is a standardized test out for every course, with difficulty rating attached to each test/course. Courses that operate outside the standardization would be automatically given a low difficulty rating - since that's mostly how they turn out to be.

The upshot is that a course can't be made to be "difficult" unless there is a large body of relatively standardized knowledge to acquire and master. But if there is a large body of standardized knowledge to acquire and master, it's relatively easy to develop a representative standard test.

I don't think it matters so much that its far easier to get good grades in BA courses than in BS courses, because the job market for STEM students simply doesn't accept people iwth degrees in poli-sci or psych. The problem is that lots of BA degrees are getting "BSified" to make them more marketable, even though the students have virtualy zero science background.

Another part of the equation is the "science for non-major" courses. These surely tend to be very lightly graded compared to "science for major" courses - after all, that's the whole reason they exist, so the non-major can get a smell of science in their background without actually having to do the work or suffer a lower grade. These courses are problematic both in terms of cranking up grades and in misrepresenting and cartoonifying the sciences - they give participants the actual idea that they learned something about science, when in fact they've missed the most important parts.

Respond

The obvious solution is to bring back 1st year flunk-out courses. It's fairer to everyone vs. designing ever more complex systems to prevent people from gaming the rules.

Respond

The problem with flunk-out classes is that the Universities want to monetize its main asset ("graduated from X") by selling it to the dim children of rich alumni.* Unfortunately, this business model doesn't work if said dim children actually have to perform in the classroom.

* the "special dean's list admittees" put the Varsity Blues scandal into a strange perspective. The problem there wasn't that the defendants violating our sacred meritocracy; it's just that they were bribing the wrong party.

Respond

"bring back 1st year flunk-out courses. "

Actually my experience is that those are second year courses: PChem, Organic Chem, Mineralogy, various physics, eng, bio (anatomy?) courses that I don't know.

But I don't think it's right to call them "flunk-out" courses. The material in them is important. The last thing we need is a chemist that can't recall the names of chemicals or a biologist or doctor that doesn't know anatomy. In real science there are real things you have to know. You can't just bullshit your way around everythign like I did in every humanities course I ever took. You actually ahve to know what you're talking about.

They're not intended to flunk people out. It just happens that lots of people don't have the drive or ability to succeed in those courses.

Respond

Law school admissions officers care chiefly about two things: LSAT score and undergrad GPA. "Strength of schedule" can be a tiebreaker, but those two numbers are the biggest things that determine law school rankings. Until this state of affairs, and similar circumstances in other fields, change, Harvard would be hurting its students by trying bring about grade deflation. Instead, I would propose that Harvard move to a universal pass/fail system and publicly challenge the legitimacy of GPA as an informative metric for evaluating students.

Respond

Some schools, including my law school (which I attended many years ago), have a mandatory bell curve ("normal distribution") requirement for grades. The elites don't because in their minds all of their students are A students. Sure, they are A students compared to students as a whole, but even at the elites intelligence is "normally distributed" among the students. At my law school it was not popular, and students who found themselves on the wrong "tail" were convinced it was a conspiracy, even though tests were graded anonymously ("blind anonymous grading system", or BAGS). The fact that for the three years each student stayed pretty close to the same place on the curve was considered proof of the conspiracy rather than proof that there was no conspiracy. I happened to be on the preferred "tail" so I thought the system was fine. I did not attend an elite law school, but the students came from all over the country and whose LSAT scores and undergraduate grades were very high. I appreciate that the elites are very expensive and the the parents paying the freight would not be pleased if junior didn't get an A, so it's the parents not the students who are being patronized.

Respond

Sure, they are A students compared to students as a whole, but even at the elites intelligence is "normally distributed" among the students.

Not if their admission process is any level of functional. At elite schools, the intelligence distribution should look like a truncated normal, absolutely nothing like a normal distribution. For example, the average of a normal distribution is the center of the distribution. The average of a truncated normal is more like the bottom of the distribution.

Respond

Grades are probably dead and can't be salvaged. They're too subjective. Standardized tests and certification exams could be salvaged. The SAT, for example, should be made more difficult (as it used to be) and no one should receive extra time. No school should be test optional. As it is, the best you can do is guess. E.g., in the absence of better information, I'm biased in favor of using East Asian doctors because I assume they faced racial discrimination in getting into medical school, and therefore are likely highly competent.

Respond

Grades are useless signifiers of an outdated one:many education system. Personalized AI-based ed will enable individualized credentialing that is truly skill-based. Each student can have their own database of every single problem set they’ve ever solved/responded to in any subject. Applications for jobs would give access to the employees AI to query that database and find the best fit to real tasks in real roles (that training to task suitability algorithm should be built). Or at least one could get a very tailored output of exactly what a student is familiar with, has memorized, or truly understands.

Respond

"Grades are useless signifiers of an outdated one:many education system. Personalized AI-based ed will enable individualized credentialing that is truly skill-based. "
------------

Alex's post is only a few hours old but is really a nostolgic look back at the pre-AI education world.

Respond

The most popular major at both Harvard and the University of Chicago? Economics. Is there a grade inflation problem? At Harvard yes, at Chicago no.

Respond

I don't see how this works. I was once stuck in the bottom third of a class with no hope of doing better. It was 400 level E-M class. There was a grad student, the start of the undergraduate department and poor, pitiful me.

But it does highlight that schooling, especially college, is about getting good grades, not real learning. In this case, the grades are against whomever happened to sign up for the course that semester.

And until busing, you competed for grades against whoever happened to live around the school. And let's not skip over the dependence of grades on not questioning the teacher or exhibiting disfavored viewpoints. Not all professors, but enough that you keep your head down and get the magic parchment or risk blasphemy charges

Respond

One solution is to get rid of student evaluations of faculty. The students reward the professors for giving great grades, the teachers reward the students for good evaluations.

Grade inflation is bilateral.

Ditch the evaluations, and it will be much easier to have tougher grading.

Respond

The province of Québec is using a rating system for Cégep (pre-university colleges, similar to last years of High School) to evaluate performance in view of managing university admissions. The system is called Cote R and is based on:
- Your grade vs. the class average (cote Z)
- Strength of the group (how strong students were academically in secondary school)
- Dispersion of the group (how spread out grades are)

The average Cote R across all your classes is then used for selection to competitive university programs (medicine, engineering, law, etc.)
More info in French:
https://bci-qc.ca/wp-content/uploads/2025/09/CRC-Ce-quelle-est_Ce-quelle-fait-BCI-21janvier2025-4.pdf

Respond

This is a great example of the “see around corners” theme from a recent MR post. Those who "designed" the new system probably weren't even aware of how much they didn't know about the problem. That's how you get the law of unintendend consequences.

(What I found a bit more surprising is when the AEA advocated for interview deadlines to prevent unraveling of the market just a few years after Al Roth had stepped down as their president -- as if that Nobel had never been given.)

Respond

I think most of these concerns are second order. The greatest harm from grade inflation is the way it reduces the incentive for students to work harder and learn more. This is the true social loss.

Respond

> It’s that students are penalized for taking harder courses with stronger peers

By that logic, everyone should go to Directional State University.

Respond

At Simon (Rochester business school), it was graded on a curve with only a certain percentage getting an A. I don't remember the percentage but there was no chance I was ever going to get past Ariane, Adam, Cameron, Arun, Naomi, etc. Highly diminishing returns for that marginal additional mastery of the subject. I could get a B+/A- and enjoy life or I could bust my tail and still get a B+/A-. Why bother? Knowledge isn't zero sum but it attempts to turn it into something that is.

Respond

Arguably, business school is 2/3 building your network, 1/3 knowledge transfer i.e., your main job is to party.

And maybe that's the underlying argument for grade inflation at elite schools...

Respond

I have not heard a good reason of why academia doesn't use an approach like what Valen Johnson suggests, other than inertia. Is there one?

Respond

The Super Mario Wonder video game has difficulty scores for courses. The geniuses at Harvard should be capable of figuring out how to do that with their own courses.

Respond

One way to curb this would be to show the number and percent of students who received each grade for the course.

For example, if 100 students took an econ course and 90 received an A, 10 an A-, the transcript would show 90% A, 10%. The employer/admission department would immediately see that an A in this course is worthless.

Likewise, a student with a B+ in an econ class where only 5% receive an A or A- and 10% receive a B+ would look much better to employers/admissions departments.

Many students would want to stop taking easy courses and seek harder courses to separate themselves from the pack.

If you were picking a student and one had a 3.2 GPA but was consistently in the top 10-20% of classes, that would be a safer pick than a student with a 3.9 GPA but only took courses where >90% students received an A or A-.

Respond

This doesn't really solve the problem of inflation, which is that it compresses the upper end of the grade range so as to be meaningless. Of course that is the entire point from the student perspective since the costs of not being at the top end are so high.

But if you want to correct for this issue just weight GPA contribution by the amount of information gained by a certain grade. Everyone gets an A in this class, OK that's expected, it basically doesn't count. Now "easy A" classes essentially only have downside risk and a prof giving a median grade is a median grade no matter if it's an A or C, so they are incentivized to bring their rubrick down to normal without a cap.

This is "academic performance above expectation". Now it doesn't compare majors to create a universal GPA, but not sure that you want that anyway even if AT says that this is the main problem.

Obviously the real real problem is that schools and students want everyone to be excellent, not label some graduates as average or poor, but that is inherent with any ranking system.

Respond

This is why rank is better than grade If everyone gets an A in a course, then everyone is in the 50th percentile

Respond

*BOTH* of these issues are problems, and they're somewhat independent.

A heavily compressed grading scale ~cannot provide informational content, and grades that are divorced from difficulty provide misleading information. (A major college team wins 56-14 - that means something very different if they're playing Alabama or West Directional State.)

Further, this is becoming even more of an issue at the high school level, where some previously selective high schools have dropped the ~best measure of capability (ACT/SAT), and lean on GPAs from schools that are very different in average student body aptitude.

Respond

It makes more sense to compare ranks in courses than grades in courses and at the same time accounting for course fixed effects (where high ranking students take more courses where there are other high ranking students). In this way the problem posed its solved and in a much more intuitive way than by Galen Johnson. See the open access "Average Rank and Adjusted Rank Are Better Measures of College Student Success than GPA"
https://doi.org/10.1111/emip.12521

Respond

One can also employ student and course fixed effects for grades. But Rank appears to be better.

Respond

As Ivy League as Harvard is, they can be pretty dumb sometimes

Respond

Note also that when studies are made of the determinants of college admission - especially ACT/SAT and HS GPA, often the main or only dependent variable is freshman year college GPA.

But when college GPA are compressed, and an A in Sociology 101 rates higher than a B+ in Organic Chem, then you get misleading results for your study.

My guess is that among the two groups in opposite corners of the quadrant (relatively low HS GPA, high SAT gagainst high GPA, low SAT), the latter disproportionately take the softer college schedule.

Respond

"A grade tries to capture two things—student ability and course difficulty—with a single number. "

What? I thought a grade was supposed to measure knowledge.

I expect everyone to get an A in a "how to wash hands" course, but only top 1% ( or even no one) in a real analysis course.

Respond

I found the use of poli sci as the 'easy' major amusing. It is a long time ago (50 years), but when I was in college I knew a fair number of STEM types and econ types who avoid poli sci and history classes because they found them too difficult--too much reading and writing. Oh, and just for the record, I majored in history and poli sci because they were what had interested me since childhood not because they were easy. Things may have changed in 50 years, but I like to think that for at least some students the choice of major and the amount of work they put in reflects what interests them and not the pursuit of easy As or the desire to get a 'good' job right out of college, perhaps at the cost of long term life satisfaction and success.

Respond

Isn't course difficulty supposed to be signaled by course number? For example, when I was in college, there were sometimes 400 and 600 level versions of the same course, where literally the only difference was the grading curve. Shouldn't the information contained in the first digit of the course number be used somehow?

Respond

I didn't realize that grade inflation was such a recent phenomenon. (In my mind, 2005 is "recent".) What was the shock in the aughts that precipitated grade inflation? Some blame course evaluations, but those began long before 2000. Others blame affirmative action (of course). But, affirmative action also began long before 2000. Even the claim that schools want to help their graduates' chances in grad school admissions and the job market seems suspect. Making grades equivalent to pass/fail (because an "A" just means "pass") doesn't necessarily help a school's top graduates --- the ones most likely to compete for top grad school admissions or the jobs at white shoe firms --- distinguish themselves.

It seems like one issue with any new grading system is how it might be gamed. For that, we need to understand what prompted people to start gaming the current system to begin with.

Respond

I don't think this algorithm is resistant to adversarial behavior.

For example, let's say a particular degree program started giving out worse grades to anyone who wasn't a major. Everyone who's an X major gets an A, anyone else gets a B. This seems statistically indistinguishable from the courses actually being harder.

What I would like to see is the grading record of all students and all courses being made public. With modern AI, it is fairly straightforward for different groups to do their own analysis, to produce their own weighted GPAs in whatever manner they see fit. The large amount of input data isn't a problem.

You could even have public rankings of, say, the graduating Harvard economics class. Like they do for college basketball players.

Respond

This is oriented towards employers and grad schools that want to hire or accept the very best, and are finding it difficult to distinguish such people. But that's a small percentage of schools and businesses. Many grad schools and employers want a combination of good enough and interest/aptitude in their particular field.

I got near perfect undergrad grades (engineering) and a near perfect score on the GMAT, but was turned away from one of the top MBA schools because my grades and test scores were too high. In the interview they told me this indicated I was unsuitable for business, "too academic" they called me.

Respond

Grades in college are meaningless unless you're applying to grad school. And I'd be real surprised if anyone chose their PhD discipline based on grading policy. So I don't really understand the problem that is being solved here. Just grade on a curve and be done with it.

Respond

So in essence, students are just paying for a degree? Just give everybody a degree and students are happy, colleges make money and the only losers are students who don't learn anything. And businesses are so stupid they will pay a student in social science the same as someone with a math degree? Why not just hire kids out of high school and charge to teach them what you want them to know and do and charge them for the privilege of learning a useful skill? Maybe we should just get rid of higher education altogether. That's basically what online schools do.

Respond

What problem is this solving? Ask a random college-educated person “if one person gets an A in Physics and another an A in comms, who is probably smarter?” I think we all know what they’ll say. On any prestigious college campus it’s well understood which disciplines are hard and which are easy.

Respond

Forget grades--why not require class rankings. That will let everyone know who finishes in top 10% consistantly.

Respond

A lot depends on what you are going to use the grades for. Are they about qualification? Are they about comparison? What are you comparing and why?

We don't mark driving tests on a curve. Someone either knows the rules of the road fairly well or not well enough. No one asks an electrician what they got on the test. They passed, so they know enough.

Most grade comparison is done for admissions to various levels of college, and it is usually done by professionals who know a fair bit about schools and their grading practices. They use standardized test results for calibration. They've been doing this kind of stuff for decades now. They already know that different majors are graded differently and that different schools have different patterns of grading.

Companies hiring people for their first job might use grades for comparison, but it's rare to be asked one's grades on one's second or later job hunts.

Respond

What has been stopping anyone who cares about the performance of students from using all these wonderful ideas already decades ago? I suppose those employers and such that care already are doing these sorts of higher evaluations.

Respond

I missed the report that said Harvard was graduating a bunch of dum-dums.

Respond

"Harvard faculty committee has proposed capping A grades at 20 percent of each class "

Wouldn't it be interesting to know who is on that committee and what their backgrounds are....Nigel Tufnel?...why not just make all grades to to 11?

Respond
Respond

Add Comment