What is the Genetic Structure Methodology used for Purebred Domestic Dogs?

  • Thread starter Thread starter marcus
  • Start date Start date
  • Tags Tags
    Structure
AI Thread Summary
The discussion centers on the Genetic Structure Methodology as applied to the genetic analysis of purebred domestic dogs, referencing a 2004 article in Science. It highlights the use of algorithms to analyze genetic data, emphasizing the potential of these algorithms to uncover unexpected results while acknowledging their current limitations. The conversation touches on the importance of understanding the underlying methodology rather than passively consuming scientific findings. Key concepts include the use of microsatellite typing, phylogenetic analysis, and the Bayesian model-based clustering algorithm known as "structure" to identify genetic relationships among dog breeds. The thread also playfully discourages the mention of "race," focusing instead on the fascinating aspects of canine genetics and the implications of the research for understanding breed differentiation. Overall, the discussion promotes an appreciation for the complexities of genetic analysis in dogs while encouraging a forward-looking perspective on technological advancements in the field.
marcus
Science Advisor
Homework Helper
Gold Member
Dearly Missed
Messages
24,753
Reaction score
794
After i say this rule the first person who says the word "race" has to go home :smile:
That is not because it's a bad word, it is a very nice word with nice
etymological roots and a rightful place in the human mind. It simply is not what the thread is about.
------------------
This thread is about Genetic Structure Methodology, as illustrated in the recent article in Science (21 May 2004, vol 304 page 1160)

Genetic Structure of the Purebred Domestic Dog

http://www.akcchf.org/news/press/releases/2004/dogbreeds.pdf

algorithms are modernday monsters. when an algorithm is created we do not know where it will lead. usually to nothing bad though, just to unexpected stuff. We are talking about a genetic structure algorithm.
At present it must be helped along by humans but in principle it could
be entirely programmed. give it 85 blood samples from 85 different breeds of dog, and the unaided computer builds a tree.

it isn't that good yet. It is still very rudimentary. the first tree (see figure 2) had only about 10 breeds of dog, plus a catch-all category
--------------------

with a new technology it is always tempting to point out lots of reasons why it cannot succeed or cannot generalize or will never be able to do this or that. But it's a basically poor idea to do that. Humans are ingenious and you just have to wait and see. Please go along with me in doing two contracdictory things:(1) recognize that a new algorithm is a potential monster that could take us where we don't want to go (2)dont be afraid, just try to see how it works.

---------------------
"We therefore ran structure on the entire data set with increasing values of K...to identify the ancestral source populations."
page 1162

the italicized word structure is the name of a computer program.
when most of us were children the names of computer programs were set in all-caps, these people use italics. It is less jarring and makes for a nicer-looking page. In this article the italicized word appears many time.
---------------------

It is always tempting to be a passive consumer of science. Let "them" do the hard stuff and tell me the results. In this case, strangely enough, the method fascinates me more than the results. The topic, Purebred Domestic Dogs, is both solemn and a bit ludicrous. It is fun but I don't have any urge to take it seriously. the method though.

the next quote is from page 1162

"We first used standard neighbor-joining methods to build a majority-rule consensus tree of breeds (Fig. 2), with distances calculated using the chord distance measure (26)...The tree was rooted using wolf samples."

This sentence has a pleasant clanking sound of contented jargon.
One wants immediately to know what is the "chord distance measure" was it perchance invented by Mr. Cavalli-Sforza? Yes it was, seemingly in 1967.
The revered and still-active gentleman was then 45 years old. It could be his most famous contribution to Dog Genetics.

"...The tree was rooted using wolf samples." This sentence is too beautiful for comment. Some poet writing for the New Yorker might steal it.
--------------

there is a very nice passage in the middle of page 1161. I will try to mouse it so I don't have to type it in myself.

dont you get the feeling that we ought to understand the main steps in the algorithm?
(those like myself who don't yet)
 
Last edited by a moderator:
Biology news on Phys.org
exerpts from the dog article with some samples of terminology bolded

------exerpts----
Here, we show that microsatellite typing of a diverse collection of 85 breeds, combined with phylogenetic analysis and modern genetic clustering methods (11, 12), allows the definition of related groups of breeds and that genetic relatedness among breeds often correlates with morphological similarity and shared geographic origin.

To assess the amount of sequence variation in purebred dogs, we first resequenced 19,867 base pairs of noncontiguous genomic sequence in 120 dogs representing 60 breeds. We identified 75 single nucleotide polymorphisms (SNPs), with minor allele frequencies ranging from 0.4 to 48% (table S1).

Strong genetic differentiation among dog breeds suggests that breed membership could be determined from individual dog genotypes (9). To test this hypothesis, we first applied a Bayesian model–based clustering algorithm, implemented in the program structure (11, 12, 21), to the microsatellite data. The algorithm attempts to identify genetically distinct subpopulations on the basis of patterns of allele frequencies. We applied structure to overlapping subsets of 20 to 22 breeds at a time (22) and observed that most breeds formed distinct clusters consisting solely of all the dogs from that breed (Fig. 1A). Dogs in only four breeds failed to consistently cluster with others of the same breed: Perro de Presa Canario, German Shorthaired Pointer, Australian Shepherd, and Chihuahua.



The tree was rooted using wolf samples. The deepest split in the tree separated four Asian spitz-type breeds, and within this branch the Shar-Pei split first, followed by the Shiba Inu, with the Akita and Chow Chow grouping together. The second split separated the Basenji, an ancient African breed. The third split separated two Arctic spitz-type breeds, the Alaskan Malamute and Siberian Husky, and the fourth split separated two Middle Eastern sight hounds, the Afghan and Saluki, from the remaining breeds. The first four splits exceeded the majorityrule criterion, appearing in more than half of the bootstrap replicates.

-----end quotes-----

these interesting-sounding terms are probably good ones to understand (in some cases almost self-explanatory from context, but wouldn't hurt to be more explict)
 
marcus said:
with a new technology it is always tempting to point out lots of reasons why it cannot succeed or cannot generalize or will never be able to do this or that. But it's a basically poor idea to do that. Humans are ingenious and you just have to wait and see. Please go along with me in doing two contracdictory things:(1) recognize that a new algorithm is a potential monster that could take us where we don't want to go (2)dont be afraid, just try to see how it works.
Er, marcus were you aware that structure has been used to analyse human genes?

In fact, that "Rosenberg et al" paper which was referenced several times in the "Is there a scientific basis for 'human races'?" is a paper giving the results of just such an analysis. https://www.physicsforums.com/showpost.php?p=240905&postcount=38 hissquad's post, bringing it to our attention.

Also, the two papers - both with Cavalli-Sfroza among the authors - which iansmith provided links to (well, to the abstracts) seem to have used similar techniques to the structure algorithm. Of, and btw, C-S et al in their 1994 book include a) a detailed description of how to construct trees (yes, they talk about the difference between 'rooted' and 'rootless' trees), and b) many trees for Homo sap., based on their analyses of genetic information available to them at the time. The first paper also illustrates well advances in the past decade or more, e.g. "For the first time, with biparentally transmitted markers, the microsatellite tree also shows that the San are the first branch of the human tree before the branch leading to all other Africans"
 
Nereid said:
Er, marcus were you aware that structure has been used to analyse human genes?

Hush Nereid! Shhhhh!
I am trying to keep all ideas of humans away from this thread!
Wont it be nice if just think about dogs!
 
But the title of the thread is 'Genetic Structure Methodology' (I didn't see the sign "Dogs Only") :wink:
 
Nereid said:
But the title of the thread is 'Genetic Structure Methodology' (I didn't see the sign "Dogs Only") :wink:

Try focussing on dogs. You will be happy. Dogs like you.
 
Deadly cattle screwworm parasite found in US patient. What to know. https://www.usatoday.com/story/news/health/2025/08/25/new-world-screwworm-human-case/85813010007/ Exclusive: U.S. confirms nation's first travel-associated human screwworm case connected to Central American outbreak https://www.reuters.com/business/environment/us-confirms-nations-first-travel-associated-human-screwworm-case-connected-2025-08-25/...
Chagas disease, long considered only a threat abroad, is established in California and the Southern U.S. According to articles in the Los Angeles Times, "Chagas disease, long considered only a threat abroad, is established in California and the Southern U.S.", and "Kissing bugs bring deadly disease to California". LA Times requires a subscription. Related article -...
I am reading Nicholas Wade's book A Troublesome Inheritance. Please let's not make this thread a critique about the merits or demerits of the book. This thread is my attempt to understanding the evidence that Natural Selection in the human genome was recent and regional. On Page 103 of A Troublesome Inheritance, Wade writes the following: "The regional nature of selection was first made evident in a genomewide scan undertaken by Jonathan Pritchard, a population geneticist at the...

Similar threads

Back
Top