Saturday, May 23, 2020

Genome-Wide Association Studies (GWAS) Part 1 - Genetic variation and traits


Genome-wide association studies (GWAS) are a method of identifying genetic variants statistically associated with a biological trait of interest. Before we can understand what that means, at least for those of you who are beginners when it comes to genetics, we need to learn what is meant by genetic variants and what is meant by traits.

A common metaphor to understand the human genome is to think of it as an instruction manual on how to build and maintain YOU written in the language of DNA. The DNA alphabet is only 4 letters long consisting of the nucleotides adenine, cytosine, guanine and thymine denoted as A, C, G and T respectively.




The letters in this alphabet are used to compose three letter words called codons which code for their corresponding amino acids as well as codons which signal the start and end of a sequence to be translated.



In the same way that a chain of words form a sentence a chain of amino acids form a protein. A gene can be thought of as a paragraph containing these sentences which tell your cells how to make the proteins which form the building blocks of your body. Genes consist of exons, the sequences which are translated into amino acids, and introns, non-coding sequences of DNA which can be thought of as the punctuation between sentences.





Paragraphs are divided amongst chapters of the genome called chromosomes. Within each chapter paragraphs are separated by pages and pages of non-coding DNA, vastly outnumbering the amount of space taken up by the paragraphs. Within these pages are found sequences called transcription factors which contain information such as under which conditions and how frequently the instructions in the paragraphs should be carried out. The instruction manual contains 23 chapters and is 3.3 billion letters long.




Your genome is more than 99 percent identical to the genome of any other human. The less than one percent difference in our genomes are genetic variants. Often these are one letter changes in different positions across your genome called Single Nucleotide Polymorphisms or SNPs. A "C" in one part of a page where others have a "T" might result in a differently shaped protein or a change in the conditions under which the protein is produced or the frequency which it is produced at.



More often than not however the one letter changes will either be on a part of a page which makes no difference to the instructions or the changes they make will not result in a biological difference. The few SNPs which do make a difference are responsible for the biologically inherited diversity we can observe between individuals including factors which influence the way we behave. This observable diversity in biological features is what we mean when we are talking about traits.

A trait can be either quantitative or qualitative. A quantitative trait is one which differs in the level which it is expressed. Height would be an example of quantitative trait because people differ on a spectrum from short to tall. A qualitative trait is one which takes one of a number of distinct forms such as eye colour which can be one of a few different colours e.g. blue, brown or green.

Short And Tall Businessmen Stock Photo, Picture And Royalty Free ... Human eye color chart, with fun facts

To use a behavioural example, how highly someone scores on a measure of extroversion from a personality test would be a quantitative trait whereas whether or not someone has ever went sky-diving would be a qualitative trait.

GWAS, therefore, is way of answering the question "what are the biological consequences of single letter variations in our instruction manuals" by finding out what genetic variants are associated with different traits.

GWAS are carried out by taking a measure of a trait from a large sample of individuals alongside nucleotides from a series of carefully chosen positions within the genome and performing statistical analyses to identify associations between genetic variants and the trait of interest. Essentially, the goal in carrying out GWAS is to find genetic variants which are more likely to be found in those possessing a qualitative trait, or those who express a quantitative trait to a greater extent, than would be expected from the population as a whole.

So if individuals who have the aforementioned C in one part of a page were 0.05% more likely to have gone skydiving than those who have a T some newspaper might claim that scientists have discovered "the skydiving gene". 

Counting down the Best Places to Skydive in New Zealand | GloHoliday

The next article will discuss the actual methodology of GWAS in more detail including how genetic variants are identified and the statistical tests which are used to identify associations.

Revision

Instruction Manual
Genome
Chapter
Chromosome
Paragraph
Gene
Sentence
Exon
Punctuation
Intron
Word
Codon
Language
DNA
Letter
Nucleotide

Saturday, May 16, 2020

Introduction to Behavioural Genomics

Genomics refers to the study of the genome, the complete set of DNA possessed by an organism including all of its genes. A copy of the genome of an organism is found in the nucleus of each of its cells. The genes which are contained in the genome are transcribed into RNA and then translated into proteins which form the biological machinery of each cell and ultimately the physiology of the organism as a whole.



The relevance of genomics to the study of physical, biological characteristics is straightforward. Behaviour on the other hand is a non-physical characteristic of an organism so naturally the relationship between DNA and behaviour is less obvious. The relevance of genomics to behaviour becomes clear only after two facts regarding the biology of behaviour are taken into consideration.

The first is that behaviour has a physiological basis, an example of which being the structure and functioning of neurons in the brain. The second is that the physiological basis of behaviour is heritable, which is to say passed on from one generation to the next in a similar way to that of physical characteristics.

Environmental factors do also play a role in shaping behaviour of course but this does not subtract from the fact that behaviour does have a heritable component. The heritability of behaviour, the degree to which it is influenced by genetics, can be examined and measured using family studies including twin and adoption studies.

Behavioural Genomics therefore is the study of the relationship between genetic variation across the genome and the behaviour of individual organisms including the prediction of behaviour and understanding the biological pathways which mediate the production of behaviour. Genomics can also be used to understand the physiological basis of differences in behaviour between closely related species, a field known as comparative genomics, but here we will focus on variation in behaviour between individuals within the same species. Particularly humans.

Behavioural genomics studies in humans most often involve taking a measure of behaviour from a large sample of individuals alongside a series of genetic variants from multiple sites within their genome. Statistical analyses are then carried out to identify associations between known genetic variants and different modes of behaviour. Studies of this kind are known as genome wide association studies (GWAS).

I made this blog to discuss how the young science of behavioural genomics can shed light upon the fascinating subject of how our genes influence the way we behave. My goal is to draw attention to and to invite as many people as possible to join the discussion around behavioural genomics and the implications it has in science, medicine, social policy, law and philosophy. The next article will be about GWAS; how they are carried out and the scientific findings which they have produced.

View this article as a YouTube video here.