Science Isn’t Strict Enough
Once upon a time, a younger Eliezer had a stupid theory. Eliezer18 was careful to follow the precepts of Traditional Rationality that he had been taught; he made sure his stupid theory had experimental consequences. Eliezer18 professed, in accordance with the virtues of a scientist he had been taught, that he wished to test his stupid theory.
This was all that was required to be virtuous, according to what Eliezer18 had been taught was virtue in the way of science.
It was not even remotely the order of effort that would have been required to get it right.
The traditional ideals of Science too readily give out gold stars. Negative experimental results are also knowledge, so everyone who plays gets an award. So long as you can think of some kind of experiment that tests your theory, and you do the experiment, and you accept the results, you’ve played by the rules; you’re a good scientist.
You didn’t necessarily get it right, but you’re a nice science-abiding citizen.
(I note at this point that I am speaking of Science, not the social process of science as it actually works in practice, for two reasons. First, I went astray in trying to follow the ideal of Science—it’s not like I was shot down by a journal editor with a grudge, and it’s not like I was trying to imitate the flaws of academia. Second, if I point out a problem with the ideal as it is traditionally preached, real-world scientists are not forced to likewise go astray!)
Science began as a rebellion against grand philosophical schemas and armchair reasoning. So Science doesn’t include a rule as to what kinds of hypotheses you are and aren’t allowed to test; that is left up to the individual scientist. Trying to guess that a priori would require some kind of grand philosophical schema, and reasoning in advance of the evidence. As a social ideal, Science doesn’t judge you as a bad person for coming up with heretical hypotheses; honest experiments, and acceptance of the results, is virtue unto a scientist.
As long as most scientists can manage to accept definite, unmistakable, unambiguous experimental evidence, science can progress. It may happen too slowly—it may take longer than it should—you may have to wait for a generation of elders to die out—but eventually, the ratchet of knowledge clicks forward another notch. Year by year, decade by decade, the wheel turns forward. It’s enough to support a civilization.
So that’s all that Science really asks of you—the ability to accept reality when you’re beat over the head with it. It’s not much, but it’s enough to sustain a scientific culture.
Contrast this to the notion we have in probability theory, of an exact quantitative rational judgment. If 1% of women presenting for a routine screening have breast cancer, and 80% of women with breast cancer get positive mammographies, and 10% of women without breast cancer get false positives, what is the probability that a routinely screened woman with a positive mammography has breast cancer? It is 7.5%. You cannot say, “I believe she doesn’t have breast cancer, because the experiment isn’t definite enough.” You cannot say,
“I believe she has breast cancer, because it is wise to be pessimistic and that is what the only experiment so far seems to indicate.” Seven point five percent is the rational estimate given this evidence, not 7.4% or 7.6%. The laws of probability are laws.
It is written in the Twelve Virtues, of the third virtue, lightness:
If you regard evidence as a constraint and seek to free yourself, you sell yourself into the chains of your whims. For you cannot make a true map of a city by sitting in your bedroom with your eyes shut and drawing lines upon paper according to impulse. You must walk through the city and draw lines on paper that correspond to what you see. If, seeing the city unclearly, you think that you can shift a line just a little to the right, just a little to the left, according to your caprice, this is just the same mistake.
In Science, when it comes to deciding which hypotheses to test, the morality of Science gives you personal freedom of what to believe, so long as it isn’t already ruled out by experiment, and so long as you move to test your hypothesis. Science wouldn’t try to give an official verdict on the best hypothesis to test, in advance of the experiment. That’s left up to the conscience of the individual scientist.
Where definite experimental evidence exists, Science tells you to bow your stubborn neck and accept it. Otherwise, Science leaves it up to you. Science gives you room to wander around within the boundaries of the experimental evidence, according to your whims.
And this is not easily reconciled with Bayesianism’s notion of an exactly right probability estimate, one with no flex or room for whims, that exists both before and after the experiment. Bayesianism doesn’t match well with the ancient and traditional reason for Science—the distrust of grand schemas, the presumption that people aren’t rational enough to get things right without definite and unmistakable experimental evidence. If we were all perfect Bayesians, we wouldn’t need a social process of science.
Nonetheless, around the time I realized my big mistake, I had also been studying Kahneman and Tversky and Jaynes. I was learning a new Way, stricter than Science. A Way that could criticize my folly, in a way that Science never could. A Way that could have told me what Science would never have said in advance: “You picked the wrong hypothesis to test, dunderhead.”
But the Way of Bayes is also much harder to use than Science. It puts a tremendous strain on your ability to hear tiny false notes, where Science only demands that you notice an anvil dropped on your head.
In Science you can make a mistake or two, and another experiment will come by and correct you; at worst you waste a couple of decades.
But if you try to use Bayes even qualitatively—if you try to do the thing that Science doesn’t trust you to do, and reason rationally in the absence of overwhelming evidence—it is like math, in that a single error in a hundred steps can carry you anywhere. It demands lightness, evenness, precision, perfectionism.
There’s a good reason why Science doesn’t trust scientists to do this sort of thing, and asks for further experimental proof even after someone claims they’ve worked out the right answer based on hints and logic.
But if you would rather not waste ten years trying to prove the wrong theory, you’ll need to essay the vastly more difficult problem: listening to evidence that doesn’t shout in your ear.
Even if you can’t look up the priors for a problem in the Handbook of Chemistry and Physics—even if there’s no Authoritative Source telling you what the priors are—that doesn’t mean you get a free, personal choice of making the priors whatever you want. It means you have a new guessing problem that you must carry out to the best of your ability.
If the mind, as a cognitive engine, could generate correct estimates by fiddling with priors according to whims, you could know things without looking at them, or even alter them without touching them. But the mind is not magic. The rational probability estimate has no room for any decision based on whim, even when it seems that you don’t know the priors.
Similarly, if the Bayesian answer is difficult to compute, that doesn’t mean that Bayes is inapplicable; it means you don’t know what the Bayesian answer is. Bayesian probability theory is not a toolbox of statistical methods; it’s the law that governs any tool you use, whether or not you know it, whether or not you can calculate it.
As for using Bayesian methods on huge, highly general hypothesis spaces— like, “Here’s the data from every physics experiment ever; now, what would be a good Theory of Everything?”—if you knew how to do that in practice, you wouldn’t be a statistician, you would be an Artificial General Intelligence programmer. But that doesn’t mean that human beings, in modeling the universe using human intelligence, are violating the laws of physics / Bayesianism by generating correct guesses without evidence.
Nick Tarleton comments:
The problem is encouraging a private, epistemic standard as lax as the social one.
which pinpoints the problem I was trying to indicate much better than I did.