Unravelling the Double Helix. The Lost Heroes of DNA
|James Dewey Watson and Francis Harry Compton Crick, 1953|
I did not know (or forgot) that 40 years before Watson and Crick, a researcher called Phoebus Levene established the structure of the sugar and the 4 bases in DNA in the course of decades of research and put it together in a model of DNA:
|figuur 7.2 in Gareth Williams (2019)|
DNA according to Phoebus_Levene.
In the famous textbook Molecular Biology of The Gene (James D. Watson, et al) Levene is not even mentioned. Yes, Levene's model was not a double helix, but he got the building blocks and the links right. This puts the achievements of Watson and Crick in the right historical context. They have become famous for putting the last pieces of the puzzle together and got the Nobel price for it. But without the work of Levene they could never have done it.
I discovered this thanks to medical historian Gareth Williams. In his book Unravelling the Double Helix, Gareth Williams shows that 85 year of DNA research by hundreds of scientists preceded Watson and Crick's discovery. That is the period 1868 up to 1953. His book is very entertaining and easy-going with special attention to the characters and personalities involved including conflicts between scientists, their prejudices, failures and successes.
Dogmatic?I think the most interesting and controversial researcher was Phoebus Levene. Levene produced a wrong structure of the DNA model (tetranucleotide) and is supposed to have defended it at all costs, he has been accused of obstructing the progress of the science of DNA. On its own it should not be a disaster to publish a wrong hypothesis.The famous Linus Pauling published a faulty DNA model. And the first model of DNA produced by Watson and Crick was hopelessly wrong. This kind of mistakes are usually corrected by subsequent research. But Levene's model was not corrected quickly. Prof. Levene was an authority. He had great technical skills. Not many others had the skills to repeat his experiments and refute him. According to Williams he ignored criticism and inconvenient facts. As a consequence research into DNA came to a halt. DNA was a boring molecule. Therefore, scientists focussed on proteins as the carrier of heredity. That is tragic indeed. But, it is highly unfair to judge Levene after 1953 with the correct DNA structure at hand. The question is: was he unreasonable given the data available at the time?
Levene in WikipediaWas Levene wrong in all respects? The DNA structure in figure 7.2 (above) has its merits. The structure is linear and not closed. It can be extended indefinitely in both directions, which is a promising feature for information storage. However, wikipedia shows a different structure :
|Tetranucleotide in wikipedia|
This is the Tetranucleotide according to wikipedia. It is a visually powerful image with a clear message . It clearly is a closed molecule containing each of the 4 bases A, T, C, G only once. It has no possibilities for extensions. This model is not in Williams' book, but he informed me that it originates from H. Takahashi (1932) Uber fermentative Phosphorierung der Nucleinsaure. So, wikipedia attributes the figure to Levene incorrectly .
The problem is that we are not sure what Levene really thought. There is evidence that Levene thought DNA has a closed structure or consists of a stack of closed structures. But there is also evidence that he thought DNA was linear . As far as I can see, Levene never published the closed tetranucleotide structure himself and the models he did publish are linear . Does that tell us something? This requires investigation of the primary sources, his own publications, not what others tell about him. Finally, why would he spend so much time on a boring molecule?
With the benefit of hindsightWith the benefit of hindsight, the linear model, but even the cyclic model had more potential than Levene had realised. The linear model has the potential to be extended in both directions infinitely because it is an open structure. Allowing for any order of the bases would fulfil the requirements for information storage. OK, it is not a double helix, but that does not affect information storage capacity .
In Williams' figure 7.2 (above) the bases T, A, C, G are present, but in figure 4.8 and 4.9 of Portugal and Cohen (1979) only pyrimidine (T or C) and purine (A or G) are present. I guess that the illustrations of Portugal and Cohen are close to the original. So, that suggests Levene had no fixed order of the 4 bases in mind. In my view Levene could not have evidence that there is only one fixed order of the bases in his tetranucleotide or linear model. Then (if he did not impose a purine - pyrimidine alternation), 6 possible tetranucleotides are possible:
|6 different closed tetranucleotides ©GK|
What could cast doubt on the tetranucleotide hypothesis is experimental data suggesting equal amounts of A, T, C, G in DNA samples (A:T:C:G = 1:1:1:1). At the time scientists accepted that these data are correct. However, do equal amounts of A, T, C, G really kill information storage capacity? No, because A, T, C, G could be distributed in many different ways in a linear single-stranded sequence. For example these two 8-base-sequences
have the same number of the four bases: 2xA, 2xT, 2xC, 2xG, but are obviously different sequences. Why did not this occur to people (Levene) at the time? Could it be that they simply did not have the concepts of information storage and digital coding like we do nowadays with computers in every pocket? . If so, the problem was a conceptual problem. In that case Levene was not dogmatic, but a general conceptual barrier prevented Levene and others to solve the DNA puzzle. Maybe another conceptual barrier was the idea of a helix or the idea of a double helix because these concepts were not yet invented in chemistry? .However, the fact that, after analysis of the macromolecule DNA many breakdown products were to be found in the test-tube, was also a practical barrier
The most beautiful molecule
All these possible barriers make the history of the discovery of DNA fascinating. Although that history has been written at length by others, including Watson, Crick and Wilkins , and I read them years ago, Gareth Williams managed to renew my interest in the subject. I started rereading the older books because I had questions and wanted answers.
|Scientists of DNA timeline (source).|
I found a time-line of the discovery of DNA on the internet. Amazingly, the period between Miescher (1869) and Chargaff (1950) is completely blanc! As if nothing had been done on DNA. Gareth Williams did a great job filling in that empty space.
But Gareth Williams did more than that. For example, for me Miescher was just a name. But Friedrich Miescher did his discovery at a very young age and at first his boss was not interested in publishing it. Amazingly, Miescher did not believe nucleic acid had anything to do with heredity! He preferred to see nucleic acid as a storage for phosphorus! This sort of stories make what was only the name of an author into a real person.
Another example is the famous Thomas Hunt Morgan, who at first was unimpressed by Mendelian genetics and chromosomes, but "in 1915 he publicly purged himself of his earlier sins in The Mechanism of Mendelian Heredity". It is travelling back in time. Lessons to learn when reading the scientific journals of today.
DNA is by far the most beautiful, oldest and most informative molecule in the universe and Gareth Williams made even more interesting. Such a molecule deserves another book. Williams wrote a friendly introduction into the history of that molecule. The merit of Williams' book is that he is not only telling the hard scientific facts, but also the personal and conceptual struggles of the scientists involved in such a way that one is intrigued and wants to know more.
- Gareth Williams (2019) Unravelling the Double Helix: The Lost Heroes of DNA, Weidenfeld-Nicolson.
- Gareth Williams op youtube talking about his book.
- website Professor Gareth Williams
- Google books online
- Jan Witkowski (2019) The forgotten scientists who paved the way to the double helix, is a book review in Nature 16 april 2019. This triggered me to buy the book.
- Phoebus Levene. The first man who identified the components of DNA. Tetranucleotide Hypothesis. Youtube
- Phoebus Levene on wikipedia
- The “scientific catastrophe” in nucleic acids research that boosted molecular biology, J Biol Chem. 2019 Feb 15; 294(7): 2249–2255 (met figuren)
- F.H. Portugal, J.S. Cohen (1979) A Century of DNA. MIT. Chapter 4: Tetranucleotide Hypothesis.
- Robert Olby (2013) The Path to the Double Helix. The Discovery of DNA.
- Tetra-nucleotide hypothesis is very shortly discussed in The Eight Day of Creation, but a whole chapter is dedicated to the Tetranucleotide hypothesis in Portugal and Cohen; and in Olby. But I must have forgotten it completely.
- Portugal and Cohen in A Centry of DNA wrote "Thus, in 1938, Levene and Schmidt were able to measure the molecular weight of native DNA at between 200,000 and 1 million. (88). And: Levene "proposed a linear combination of nucleotides with the correct internucleotide linkage." (p.89).
- The publication of Takahashi is in the list publication of GW, but he does not include the Takahashi figure in his book.
- However the wikipedia figure must be a modern reconstruction of Takahashi figure. This must be because in F.H. Portugal, J.S. Cohen (1979) figure 4.10 shows the tetranucleotide in a different form, and the bases are indicated with 'pu' (purine) and 'py' (pyrimidine) and not with A, C, T, G.
- Williams informs me that "Levene never dreamed up a cyclic structure" (personal communication 17 June 2019). I had formed the view that Levene was dogmatic about the tetranucleotide and therefore obstructed the progress of science.
- After I wrote that, I found in Olby The Path to the Double Helix: "The number of Bausteine which can take part in the formation of the proteins is about as large as the number of letters in the alphabet." which was written by Kossel in 1911. So, the alphabet was used as a metaphor for information in proteins. Maybe this was not a general view? But there seems to be no conceptual barrier viewing DNA as a carrier of information.
- The tetranucleotide hypothesis and especially the single-stranded DNA hypothesis prevented Chargaff to correctly interpret his own data. He did not try a double-stranded structure. Apparently, that is a huge leap.
- Maurice Wilkins (2003) The Third Man of the Double Helix, wrote no more than one sentence about Phoebus Levene: " ... analytical chemist Phoebus Levene, who had argued for decades that DNA could not be gene material because it only contained four bases, which would, he claimed, make it far too simple a compound to contain genetic information." (p.152). That is really disappointing. What a terrible thing to say about the man who discovered the D-ribose sugar and the nucleotides and nucleosides in DNA! Even worse: Wilkins wrote Levene had a negative influence on Chargaff!
- In Francis Crick (1988) What Mad Pursuit, "Phoebus Levene, the leading expert on nucleic acid in the 1930s, had proposed that they had a regular repeating structure [the so-called tetranucleotide hypothesis]. This hardly suggested that they could easily carry genetic information. (p33-34). But it is highly unfair to condemn Levene after 1953! What is needed the data available at his time. And therefore actual publications of that time should be investigated.
- James D. Watson (1982) The Double Helix. Penguin Books. Nothing about the history of DNA research! DNA has no history!
- Single-stranded DNA does not occur in bacteria, plants and animals, but there exist single-stranded DNA virusses. RNA is single-stranded.