Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
QUANTITATIVE TRAIT LOCUS ASSOCIATED WITH A PATHOGEN RESISTANCE TRAIT IN CANNABIS
Document Type and Number:
WIPO Patent Application WO/2024/033886
Kind Code:
A2
Abstract:
The invention relates to methods of identifying and characterizing a Cannabis spp. plant comprising a quantitative trait locus (QTL) or a causative polymorphism associated with pathogen resistance, and to Cannabis spp. plants having a pathogen resistance trait of interest comprising defined allelic states of polymorphisms defining the QTL. Also provided are plants with a pathogen resistance trait of interest identified by the methods described herein. The invention further relates to marker assisted selection and marker assisted breeding methods for obtaining plants having a pathogen resistance trait of interest, as well as to methods of producing Cannabis spp. plants with the pathogen resistance trait of interest and plants produced by these methods, based on the allelic state of the QTLs or the causative polymorphism.

Inventors:
CROPANO CLAUDIO (CH)
CARRERA DÁNIEL ÁRPÁD (CH)
GEORGE GAVIN MAGER (CH)
KATSIR LERON (CH)
SCHMID MARC (CH)
VOGT MAXIMILIAN MORITZ (CH)
RUCKLE MICHAEL EDUARD (CH)
WYLER MICHELE (CH)
Application Number:
PCT/IB2023/058132
Publication Date:
February 15, 2024
Filing Date:
August 11, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PUREGENE AG (CH)
PIENAAR DANIE (ZA)
International Classes:
C12Q1/6895; A01H6/28
Attorney, Agent or Firm:
SPOOR & FISHER et al. (ZA)
Download PDF:
Claims:
CLAIMS:

1 . A method for characterizing a Cannabis spp. plant with respect to a pathogen resistance trait, the method comprising the steps of:

(i) genotyping at least one plant with respect to at least one pathogen resistance QTL by detecting one or more polymorphisms associated with the pathogen resistance trait as defined in Table 2 or 9; and

(ii) characterizing the plant as having an increased pathogen resistance QTL, a decreased pathogen resistance QTL or an intermediate pathogen resistance QTL based on the genotype at the polymorphism.

2. The method of claim 1 , wherein the polymorphism is selected from the group consisting of “common_3407”, “common_703”, “common_1691”, “common_701”, and combinations thereof as defined in Table 2 or 9.

3. The method of claim 1 , wherein the genotyping is performed by PCR-based detection using molecular markers, sequencing of PCR products containing the one or more polymorphisms, targeted resequencing, whole genome sequencing, or restriction-based methods, for detecting the one or more polymorphisms.

4. The method of claim 3, wherein the molecular markers are for detecting polymorphisms at regular intervals within the at least one pathogen resistance QTL such that recombination can be excluded.

5. The method of claim 3, wherein the molecular markers are for detecting polymorphisms at regular intervals within the at least one pathogen resistance QTL such that recombination can be quantified to estimate linkage disequilibrium between a particular polymorphism and a pathogen resistance phenotype.

6. The method of claim 3, wherein the molecular markers are designed based on a context sequence for the polymorphism described in T able 3 or 10, or are selected from the primer pairs as defined in Table 4 or 11 .

7. The method of claim 1 , wherein the pathogen resistance QTL is selected from the group consisting of: i. a quantitative trait locus having a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL; ii. a quantitative trait locus having a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; iii. a quantitative trait locus defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and iv. a combination of any of (i) to (iii).

8. The method of claim 1 , wherein the pathogen resistance trait is a Botrytis cinerea resistance trait.

9. A method of producing a Cannabis spp. plant having a pathogen resistance trait of interest, the method comprising the steps of:

(i) providing a donor parent plant having in its genome at least one pathogen resistance QTL characterized by one or more polymorphisms associated with the pathogen resistance trait of interest as defined Table 2 or 9;

(ii) crossing the donor parent plant having the at least one pathogen resistance QTL with at least one recipient parent plant to obtain a progeny population of cannabis plants;

(iii) screening the progeny population of cannabis plants for the presence of the at least one pathogen resistance QTL; and

(iv) selecting one or more progeny plants having the at least one pathogen resistance QTL, wherein the mature plant displays the pathogen resistance trait of interest.

10. The method of claim 9, further comprising:

(v) crossing the one or more progeny plants with the donor recipient plant; or

(vi) selfing the one or more progeny plants.

1 1 . The method of claim 9, wherein the screening comprises genotyping at least one plant from the progeny population with respect to the at least one pathogen resistance QTL by detecting one or more polymorphisms associated with the pathogen resistance trait of interest as defined Table 2 or 9.

12. The method of claim 9, wherein the method comprises a step of genotyping the donor parent plant with respect to the at least one pathogen resistance QTL by detecting one or more polymorphisms associated with the pathogen resistance trait of interest as defined Table 2 or 9, prior to step (i).

13. The method of claim 1 1 or 12, wherein the genotyping is performed by PCR-based detection using molecular markers, sequencing of PCR products containing the one or more polymorphisms, targeted resequencing, whole genome sequencing, or restriction-based methods, for detecting the one or more polymorphisms.

14. The method of claim 13, wherein the molecular markers are for detecting polymorphisms at regular intervals within the at least one pathogen resistance QTL such that recombination can be excluded or such that recombination can be quantified to estimate linkage disequilibrium between a particular polymorphism and the pathogen resistance trait of interest.

15. The method of claim 13, wherein the molecular markers are designed based on a context sequence for the polymorphism in Tables 3 or 10, or are selected from the primer pairs as defined in Table 4 or 1 1 .

16. The method of claim 9, wherein the polymorphism is selected from the group consisting of “common_3407”, “common_703”, “common_1691”, “common_701”, and combinations thereof as defined in Table 2 or 9.

17. The method of claim 9, wherein the pathogen resistance QTL is an increased pathogen resistance QTL, a decreased pathogen resistance QTL, or an intermediate pathogen resistance QTL as defined in Table 2 or 9.

18. The method of claim 9, wherein the pathogen resistance QTL is selected from the group consisting of: i. a quantitative trait locus having a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL; ii. a quantitative trait locus having a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; iii. a quantitative trait locus defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and iv. a combination of any of (i) to (iii).

19. The method of claim 9, wherein the pathogen resistance trait of interest is a Botrytis cinerea resistance trait.

20. A method of producing a Cannabis spp. plant that has a pathogen resistance trait of interest, the method comprising introducing at least one pathogen resistance QTL characterized by one or more polymorphisms associated with the pathogen resistance trait of interest as defined in Table 2 or 9 into a Cannabis spp. plant, wherein said QTL is associated with the pathogen resistance trait of interest in the plant.

21 . The method of claim 20, wherein introducing the at least one pathogen resistance QTL comprises crossing a donor parent plant having the at least one pathogen resistance QTL with a recipient parent plant.

22. The method of claim 20, wherein introducing the at least one pathogen resistance QTL comprises genetically modifying the Cannabis spp. plant.

23. The method of claim 20, wherein the pathogen resistance QTL is selected from the group consisting of: i. a quantitative trait locus having a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL; ii. a quantitative trait locus having a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; iii. a quantitative trait locus defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and iv. a combination of any of (i) to (iii).

24. The method of claim 20, wherein the pathogen resistance trait of interest is a Botrytis cinerea resistance trait.

25. A Cannabis spp. plant characterized according to the method of claim 1 .

26. A Cannabis spp. plant produced according to the method of claim 9.

27. A Cannabis spp. plant produced according to the method of claim 20.

28. A Cannabis spp. plant comprising at least one pathogen resistance QTL characterized by one or more polymorphisms associated with a pathogen resistance trait of interest as defined in Table 2 or 9, provided that the plant is not exclusively obtained by means of an essentially biological process.

29. The Cannabis spp. plant of claim 28, wherein the pathogen resistance trait of interest is a Botrytis cinerea resistance trait.

30. An isolated nucleic acid comprising a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL.

31. An isolated nucleic acid comprising a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL.

32. An isolated nucleic acid comprising a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus is defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and is associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL.

33. A Cannabis spp. plant comprising an isolated nucleic acid comprising the quantitative trait locus of any one of claims 30 to 32.

34. A Cannabis spp. plant comprising: i. a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL; ii. a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and iii. a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus is defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and is associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL.

Description:
QUANTITATIVE TRAIT LOCUS ASSOCIATED WITH A PATHOGEN RESISTANCE TRAIT IN CANNABIS

BACKGROUND OF THE INVENTION

The invention relates to methods of identifying and characterizing a Cannabis spp. plant comprising a quantitative trait locus (QTL) associated with pathogen resistance, including to Botrytis cinerea, and to Cannabis spp. plants having a resistance trait of interest comprising defined allelic states of the polymorphisms defining the QTL. The invention further relates to plants with a pathogen resistance trait of interest identified by the methods described herein. The invention also relates to marker assisted selection and marker assisted breeding methods for obtaining plants having a pathogen resistance trait of interest. Also provided are methods of producing Cannabis spp. plants with the pathogen resistance trait of interest and plants produced by these methods, based on the allelic state of the QTLs, as well as genes responsible for controlling the pathogen resistance trait of interest.

Modern Cannabis is derived from the cross hybridization of three biotypes; Cannabis sativa L. ssp. indica, Cannabis sativa L. ssp. sativa, and Cannabis sativa L. ssp. ruderalis. Cannabis was divergently bred into two distinct, albeit tentative types, called Hemp and HRT (high-resin-type) Cannabis, respectively, which are typically used for different purposes. Hemp is primarily used for industrial purposes, for example in feed, food, seed, fiber, and oil production. Alternatively, HRT cannabis is largely cultivated and bred for high concentrations of the pharmacological constituents, cannabinoids, produced largely in the trichomes. Biomass, primarily the leaf, of cannabis can also be an important source of cannabinoids.

Cannabis is one of a few species in the plant kingdom to produce phytocannabinoids. Phytocannabinoids are a class of terpenoid acting as antagonists and agonists of mammalian endocannabinoid receptors. The pharmacological action is derived from this ability of phytocannabinoids to disrupt and mimic endocannabinoids. Due to its psychoactive properties, one cannabinoid, delta-9-tetrahydrocannabinol (THC), the decarboxylation product of the plant- produced delta-9-tetrahydrocannabinolic acid (THCA), has received much attention in illegal or unregulated breeding programs, with modern HRT varieties having THC concentrations of 0.5% to 30%.

HRT cannabis is predominantly cultivated in controlled indoor or greenhouse environments while hemp is majorly cultivated in field, both cannabis types are grown globally in varied geographical settings. These varied environments expose cannabis to a wide range of potential pathogens that can impact negatively on harvest and yield. Pathogens that impact flower quality in field settings is a major driver for HRT cannabis being grown in controlled environments. Because of prohibition breeding HRT and Hemp type cannabis for resistance to potential pathogens has been limited. Particularly for HRT cannabis, altering the growth environment has been generally used in place of breeding for resistance. With the lifting of prohibition in many regions, cultivation is occurring in new environments leading to exposure to a range of new pathogen challenges. It is estimated that more than 90 different fungal species can cause disease in cannabis.

The fungal pathogen Botrytis cinerea (Botrytis) referred to as grey mould is of the most significant hindrances to cultivation of cannabis in controlled environment and in field cannabis production systems. Botrytis is an air-borne necrotrophic fungus that produces spreading lesions covered by a grey mat of conidia on leaves, flowers, and stems that can lead to rapid decay of cannabis plants. Flowers of cannabis in their later stages of development and at post-harvest are particularly susceptible.

Botrytis secretes lytic enzymes in order to compromise and invade plant tissue, this is followed by the synthesis of phytotoxic metabolites to trigger an oxidative burst and the induction of host programmed cell death. Infected plant cells collapse allowing nutrients to be extracted for fungal growth. Environmental factors like humidity can greatly impact success pathogen success, outdoor hemp yield can be reduced by Botrytis up to 32% in rainy years. Botrytis, and other pathogens, can continue to propagate on harvested cannabis flowers contributing to significant post-harvest losses. While the economic impact of Botrytis on Cannabis has not been globally evaluated, in comparison 14-45% of ornamental flowers are lost due to Botrytis in post-harvest storage. Furthermore, Botrytis is recognized globally as being one of the most significant plant pathogens - causing billions of dollars of losses related to reduced harvest yield and loss postharvest.

Management of Botrytis is often approached by identifying resistant varieties. Many varieties are still susceptible and high fungicidal treatments are employed in addition to creating environmental conditions where Botrytis does not thrive. Fungicidal treatment is often not an option for producers concerned about consumer preference and health. Plant management can significantly reduce loss but can be cost intensive. Extending resistance to all susceptible varieties could offer producers of both flower-type and hemp type cannabis solutions to reducing the impact of Botrytis, improving flower quality and yield, without relying on fungicidal treatment. However, breeding for resistant varieties in hemp has been slow to advance and in HRT cannabis is not pursued.

The molecular basis for fungal disease resistance in Cannabis is fundamentally unknown, hindering breeding by marker based selection and population based pre-screening. Furthermore HRT- and hemp cannabis genotypes that are resistant to Botrytis have not been described. In model species like Arabidopsis thaliana research identified the phytohormones jasmonic acid, abscisic acid, salicylic acid, and ethylene as key players in this defense response. The cross talk between these hormones is often altered in mutant plants that are more resistant or susceptible. This underlies that the defense response to fungal pathogens is complex, polygenic and involves overlapping and redundant signaling pathways. Several genes linked to jasmonate and ethylene hormone signaling, have been suggested as candidate genes for improving disease resistance in Arabidopsis, however no equivalent candidate genes have been identified in Cannabis.

Because there are no good methods for selecting disease resistant cannabis seedlings based on morphological indicators early on, molecular markers would be useful for assessing disease resistance and susceptibility at the seedling stage in order to use those selections for production or as the basis for breeding.

Selection of cannabis plants with resistance to disease caused by these pathogens is challenging for breeders and producers as plants may not exhibit symptoms of the disease until later stages of growth, after flower, or sometimes not at all because the pathogen is not present. As such, the identification of molecular markers to identify and select for and against cannabis plants that may have increased disease resistance would be a significant contribution to the cannabis industry by lowering the prevalence and damage associated with fungal pathogens in HRT and fiber-type cannabis production systems.

Traditional resistance breeding strategies, while potentially useful, are slower and can result in the loss of favorable linked characteristics.

In the present invention, several genetic regions in cannabis that significantly associate with cannabis resistance to B. cinerea were identified from a parent population displaying such resistance, with the aim of developing varieties with a stabilized B. cinerea resistance phenotype. The markers were also shown to be useful in breeding cannabis spp. plants displaying resistance to other pathogens.

SUMMARY OF THE INVENTION

The present invention describes methods of identifying or characterizing a Cannabis spp. plant comprising a quantitative trait locus (QTL) associated with resistance to biotic pathogens, and to Cannabis spp. plants comprising the QTL of interest. The invention also relates to disease resistant plants identified by the methods. The invention further relates to marker assisted selection and marker assisted breeding methods for obtaining plants that have general disease resistance, as well as to methods of producing disease resistant Cannabis spp. plants and to the disease resistant plants produced by these methods.

According to a first aspect of the present invention there is provided for a method for characterizing a Cannabis spp. plant with respect to a pathogen resistance trait, and in particular to a Botrytis resistance trait, the method comprising the steps of: (i) genotyping at least one plant with respect to at least one pathogen resistance QTL by detecting one or more polymorphisms associated with the pathogen resistance trait as defined in Table 2 or 9; and (ii) characterizing the one or more plants with respect to the at least one pathogen resistance QTL as having an increased pathogen resistance QTL, a decreased pathogen resistance QTL or an intermediate pathogen resistance QTL based on the genotype at the polymorphism. In a first embodiment of the method for characterizing a Cannabis spp. plant with respect to a pathogen resistance phenotype, the polymorphism may be selected from the group consisting of “common_3407”, “common_703”, “common_1691”, “common_701”, or combinations thereof as defined in Table 2 or 9. These markers have all been shown to have particularly high predictive value for the pathogen resistance QTL and trait.

In a second embodiment of the method for characterizing a Cannabis spp. plant with respect to a pathogen resistance phenotype, the genotyping may be performed by any PCR- based detection method using molecular markers, by sequencing of PCR products containing the one or more polymorphisms, by targeted resequencing, by whole genome sequencing, or by restriction-based methods, for detecting the one or more polymorphisms.

In a third embodiment of the method for characterizing a Cannabis spp. plant with respect to a pathogen resistance phenotype, the molecular markers may be for detecting polymorphisms at regular intervals within the at least one pathogen resistance QTL such that recombination can be excluded. In an alternative embodiment, the molecular markers may be for detecting polymorphisms at regular intervals within the at least one pathogen resistance QTL such that recombination can be quantified to estimate linkage disequilibrium between a particular polymorphism and the pathogen resistance phenotype. It will be appreciated by those of skill in the art that several possible markers may be designed for detecting the polymorphisms. For example, molecular markers may be for detecting polymorphisms such that recombination events can be detected to a resolution of 10’000 or 100’000 or 500’000 base pairs within the QTL. In one embodiment, the molecular markers may be designed based on a context sequence for the polymorphism described in Table 3 or 10, or may be selected from the primer pairs as defined in Table 4 or 11 . In another embodiment, the molecular markers may further be designed based on a context sequence for a causative polymorphism described in Table 8 or a genetic marker linked to the QTL described in Table 13.

In a fourth embodiment of the method for characterizing a Cannabis spp. plant with respect to a pathogen resistance phenotype, the pathogen resistance QTL may be selected from the group consisting of: i. a quantitative trait locus having a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL, such as a genetic marker described in Table 13; ii. a quantitative trait locus having a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; iii. a quantitative trait locus defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and/or iv. a combination of any of (i) to (iii).

In a fifth embodiment of the method for characterizing a Cannabis spp. plant with respect to a pathogen resistance trait, the pathogen resistance trait is a Botrytis cinerea resistance trait, and the pathogen resistance QTL is a Botrytis resistance QTL.

According to a second aspect of the present invention, there is provided for a method of producing a Cannabis spp. Plant having a pathogen resistance trait of interest, the method comprising the steps of: (i) providing a donor parent plant having in its genome at least one pathogen resistance QTL characterized by one or more polymorphisms associated with the pathogen resistance trait of interest as defined Table 2 or 9; (ii) crossing the donor parent plant having the at least one pathogen resistance QTL with at least one recipient parent plant to obtain a progeny population of cannabis plants; (iii) screening the progeny population of cannabis plants for the presence of the at least one pathogen resistance QTL; and (iv) selecting one or more progeny plants having the at least one QTL, wherein the mature plant displays the pathogen resistance trait of interest. The pathogen resistance trait of interest may be an increased pathogen resistance trait, a decreased pathogen resistance trait, or an intermediate pathogen resistance. In this way, the trait can be modulated in a plant using the pathogen resistance QTL of the invention.

In a first embodiment of the method of producing a Cannabis spp. Plant having a pathogen resistance trait of interest, the method may further comprise the steps of: (v) crossing the one or more progeny plants with the donor recipient plant; or (vi) selfing the one or more progeny plants.

According to a second embodiment of the method of producing a Cannabis spp. Plant having a pathogen resistance trait of interest, the screening may comprise genotyping at least one plant from the progeny population with respect to the at least one pathogen resistance QTL by detecting one or more polymorphisms associated with the pathogen resistance trait of interest as defined Table 2 or 9.

In a third embodiment of the method of producing a Cannabis spp. Plant having a pathogen resistance trait of interest, the method my comprise a step of genotyping the donor parent plant with respect to the at least one pathogen resistance QTL by detecting one or more polymorphisms associated with the pathogen resistance trait of interest as defined Table 2 or 9, prior to step (i).

According to a fourth embodiment of the method of producing a Cannabis spp. Plant having a pathogen resistance trait of interest, the genotyping may be performed by a PCR-based detection using molecular markers, by sequencing of PCR products containing the one or more polymorphisms, by targeted resequencing, by whole genome sequencing, or by restriction-based methods, for detecting the one or more polymorphisms. In a fifth embodiment of the method of producing a Cannabis spp. Plant having a pathogen resistance trait of interest, the molecular markers may be for detecting polymorphisms at regular intervals within the pathogen resistance QTL such that recombination can be excluded. In an alternative embodiment, the molecular markers may be for detecting polymorphisms at regular intervals within the pathogen resistance QTL such that recombination can be quantified to estimate linkage disequilibrium between a particular polymorphism and the pathogen resistance phenotype. For example, molecular markers may be for detecting polymorphisms such that recombination events can be detected to a resolution of 10’000 or 100’000 or 500’000 base pairs within the QTL. It will be appreciated by those of skill in the art that several possible markers may be designed for detecting the polymorphisms. In one embodiment, the molecular markers may be designed based on a context sequence for the polymorphism described in Table 3 or 13, or may be selected from the primer pairs as defined in Table 4 or 11 . In another embodiment, the molecular markers may further be designed based on a context sequence for a causative polymorphism described in Table 8 or a genetic marker linked to the QTL described in Table 13.

According to a further embodiment of the method of producing a Cannabis spp. plant having a pathogen resistance trait of interest, the pathogen resistance QTL may be an increased pathogen resistance QTL, a decreased pathogen resistance QTL, or an intermediate pathogen resistance QTL as defined in Table 2 or 9. Of particular use in producing a Cannabis spp. plant having a pathogen resistance trait of interest, are the polymorphisms selected from the group consisting of “common_3407”, “common_703”, “common_1691”, “common_701”, or combinations thereof, as defined in Table 2 or 9, which have been shown to have particularly high predictive value for the pathogen resistance QTL and trait.

According to a third aspect of the present invention there is provided for a method of producing a Cannabis spp. plant that has a pathogen resistance trait of interest, the method comprising introducing at least one pathogen resistance QTL characterized by one or more polymorphisms associated with the pathogen resistance trait of interest as defined in Table 2 or 9 into a Cannabis spp. plant, wherein said QTL is associated with the pathogen resistance trait of interest in the plant. In one embodiment, introducing the at least one pathogen resistance QTL comprises crossing a donor parent plant having the at least one pathogen resistance QTL with a recipient parent plant. In an alternative embodiment, introducing the at least one pathogen resistance QTL comprises genetically modifying the Cannabis spp. plant. Several methods of genetic modification are known to those of skill in the art, including targeted mutagenesis, genome editing, and gene transfer. For example, one or more of the polymorphisms associated with the pathogen resistance trait of interest as defined in Table 2 or 9 herein, or a causative polymorphism linked thereto, may be introduced into a plant by mutagenesis and/or gene editing. In particular the methods of genetically modifying a plant may be selected from the group consisting of CRISPR-Cas9 targeted gene editing, heterologous gene expression using various expression cassettes; TILLING, and non-targeted chemical mutagenesis using e.g., EMS. For example, CRISPR-Cas9 targeted gene editing may be achieved using a guide RNA. Alternatively, a cannabis spp. plant may be transformed with a cassette containing the QTL associated with the pathogen resistance trait of interest or a part thereof, via any transformation method known in the art.

In a one embodiment of the method of producing a Cannabis spp. plant that has a pathogen resistance trait of interest, the pathogen resistance QTL may be selected from the group consisting of: i. a quantitative trait locus having a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL, such as a genetic marker described in Table 13; ii. a quantitative trait locus having a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; iii. a quantitative trait locus defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and/or iv. a combination of any of (i) to (iii).

According to a further embodiment of the method of producing a Cannabis spp. plant that has a pathogen resistance trait of interest, the pathogen resistance trait may be a Botrytis cinerea resistance trait.

According to a fourth aspect of the present invention there is provided for a Cannabis spp. plant characterized according to the method for characterizing a Cannabis spp. plant with respect to a pathogen resistance phenotype as described herein.

In a fifth aspect of the present invention there is provided for a Cannabis spp. plant produced according to the method of a producing a Cannabis spp. plant having a pathogen resistance trait of interest as described herein. In some embodiments, the Cannabis spp. plant characterized or produced according to the method as described herein is not exclusively obtained by means of an essentially biological process.

According to a further aspect of the present invention there is provided for a Cannabis spp. plant comprising at least one pathogen resistance QTL characterized by one or more polymorphisms associated with a pathogen resistance trait of interest as defined in Table 2 or 9. In some embodiments, the plant is not exclusively obtained by means of an essentially biological process.

In one embodiment of the Cannabis spp. Plant of the invention, the pathogen resistance trait is a Botrytis cinerea resistance trait. According to another aspect of the present invention there is provided for a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 18901 130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL, such as a genetic marker described in Table 13.

According to another aspect of the present invention there is provided for a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL.

In yet a further aspect of the present invention there is provided for a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus is defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and is associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL. In some embodiments, the quantitative trait loci may be comprised on an isolated nucleic acid.

According to one aspect of the present invention there is provided for a Cannabis spp. plant comprising a quantitative trait locus, or an isolated nucleic acid comprising the quantitative trait locus, defined herein.

In particular, there is provided for a Cannabis spp. plant comprising a pathogen resistance QTL selected from the group consisting of: i. a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 18901130 to 23661598 of NC_044371.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2 or 9, or a genetic marker linked to the QTL, such as a genetic marker described in Table 13; ii. a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus has a sequence that corresponds to nucleotides 86855033 to 90075725 of NC_044375.1 of the CS10 genome and is defined by one or more polymorphisms associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and iii. a quantitative trait locus that controls a pathogen resistance trait in Cannabis spp., wherein the quantitative trait locus is defined by a single nucleotide polymorphism at position 79997 of NC_044373.1 of the CS10 genome and is associated with pathogen resistance as defined in Table 2, or a genetic marker linked to the QTL; and/or iv. a combination of any of (i) to (iii). In a further aspect there is provided for an isolated gene that controls a pathogen resistance trait in a Cannabis spp. plant, wherein the gene is selected from the group consisting of the genes as defined in Table 7 with reference to the CS10 genome. In particular, the pathogen resistance trait may be a Botrytis cinerea resistance trait.

According to a further aspect there is also provided for possible causative polymorphisms for botrytis resistance in a cannabis spp. plant selected from: a polymorphism in a gene encoding a Protein kinase domain-containing protein (PKD) (NCBI accession: XP 030492442.1 ) at position 23309645 on chromosome NC_044371.1 with reference to the CS10 genome; and a polymorphism in a gene encoding a PTI6-like protein (NCBI accession: XP 030505675.1 ) at position 86872263 on chromosome NC_044375.1 , as described in Table 8.

BRIEF DESCRIPTION OF THE FIGURES

Non-limiting embodiments of the invention will now be described by way of example only and with reference to the following figures:

Figure 1 : Segregation of Botrytis resistance in the F2 populations tested. The F2 population designation is shown above each plot. The Y-axis shows plant count and the X-axis show the resistance score from 1 -9 with 9 being most susceptible.

Figure 2: A Manhattan plot showing the results of a GWA of Botrytis resistance of a combined Cannabis F2 population using Mixed Linear Model. Each box represents a separate chromosome, with the chromosome name above the plot and the position (in base pairs) on the chromosome on the X-axis below, the Y-axis is the LCD score, -log 10(p).

Figure 3: A Manhattan plot showing the results of a GWA of Botrytis resistance of a training population of diverse cannabis genotypes using Mixed Linear Model. Each box represents a separate chromosome, with the chromosome name above the plot and the position (in base pairs) on the chromosome on the X-axis below, the Y-axis is the LOD score, -log 10(p).

Figure 4: A multiple regression model of Botrytis resistance comparing random markers to markers identified in QTL1 .

SEQUENCES

The nucleic acid and amino acid sequences listed herein and in any accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases, and the standard one or three letter abbreviations for amino acids. It will be understood by those of skill in the art that only one strand of each nucleic acid sequence is shown, but that the complementary strand is included by any reference to the displayed strand. DETAILED DESCRIPTION OF THE INVENTION

The present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which some, but not all embodiments of the invention are shown.

The invention as described should not be limited to the specific embodiments disclosed and modifications and other embodiments are intended to be included within the scope of the invention. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.

As used throughout this specification and in the claims, which follow, the singular forms “a”, “an” and “the” include the plural form, unless the context clearly indicates otherwise.

The terminology and phraseology used herein is for the purpose of description and should not be regarded as limiting. The use of the terms “comprising”, “containing”, “having” and “including” and variations thereof used herein, are meant to encompass the items listed thereafter and equivalents thereof as well as additional items. It is, however, contemplated as a specific embodiment of the present disclosure that the term “comprising” encompasses the possibility of no further members being present, i.e., for the purpose of such an embodiment “comprising” is to be understood as having the meaning of “consisting of’.

Methods are provided herein for identifying and obtaining plants having a pathogen resistance trait of interest, preferably a Botrytis resistance trait of interest, using a molecular marker detection technique. The inventors of the present invention have further produced and selected for increased Botrytis resistance in cannabis plants by crossing plants with various degrees of the botrytis resistance trait. Also demonstrated herein, the inventors were able to use genome wide association (GWA) to identify multiple QTLs linked to increased Botrytis resistance and to show that these QTLs contribute in combination to Botrytis resistance. The method provides evidence that the genetic basis of Botrytis resistance is polygenic. The inventors also show that the QTLs identified for Botrytis resistance can be extended to providing general pathogen resistance to a range of plant pathogens as well. This finding provides for the improvement of methods for producing plants displaying differing degrees of botrytis resistance.

A total of three QTLs for Botrytis resistance were identified in the combined F2 populations tested.

Tables 2 and 9 herein provide several single nucleotide polymorphisms (SNPs) which define the QTLs associated with a pathogen resistance trait of interest, specifically a Botrytis resistance trait of interest. In some embodiments one or more of the identified SNPs can be used to incorporate the increased pathogen resistance trait from a donor plant, containing one or more of the QTLs associated with the trait, into a recipient plant. For example, the incorporation of the pathogen resistance phenotype may be performed by crossing a donor parent plant to a recipient parent plant to produce plants containing a haploid genome from both parents. Recombination of these genomes provides F1 progeny where each haploid complement of chromosomes, of the diploid genome, is comprised of genetic material from both parents. In some embodiments, methods of identifying one or more QTLs that are characterized by a haplotype comprising of a series of polymorphisms in linkage disequilibrium are provided. The QTLs each display limited frequency of recombination within the QTLs. Preferably the polymorphisms are selected from any one described in Table 2 or 9 herein, representing the pathogen resistance QTLs, or a genetic marker linked thereto, such as a genetic marker described in Table 13. Molecular markers may be designed for use in detecting the presence of the polymorphisms and thus the QTLs. Further, the identified QTL polymorphisms and the associated molecular markers may be used in a cannabis breeding program to predict the pathogen resistance trait of interest of plants in a breeding population and can be used to produce cannabis plants that either display an increased pathogen resistance trait, or do not, compared to a control population.

As used herein, reference to a plant or a variety with a “pathogen resistance trait” refers to a plant or a variety that shows resistance to infection by, and disease symptoms from a broad array of plant pathogen species, in particular the fungal species Botrytis cinerea, during the course of plant growth, at the time of harvest, and in post-harvest.

As used herein, reference to a plant or a variety with a “general resistance” refers to a plant or a variety that show resistance to infection by, and disease symptoms from, a broad array of pathogen species during the course of plant growth, at the time of harvest, and in post-harvest. General resistance may represent general defences that limit the spread of pathogens through plant tissues.

As used herein, reference to a plant or variety with a “Botrytis resistance trait” refers to plant or variety that shows resistance to infection by, and disease symptoms from, the fungal species Botrytis cinerea during the course of plant growth, at the time of harvest and in postharvest.

A “pathogen resistance trait of interest” refers to the state of the plant with respect to the pathogen resistance trait, and includes increased pathogen resistance, decreased pathogen resistance, or an intermediate level of pathogen resistance compared with a control population.

As used herein, reference to a plant or variety with an “increased pathogen resistance trait” refers to a plant or variety having a propensity for increased pathogen resistance compared to plants from which it is derived. In some cases, a plant or variety with an increased pathogen resistance trait has a propensity for increased pathogen resistance in comparison to the mean pathogen resistance of plants from a population from which the plant or variety was derived.

As used herein, reference to a plant or variety with a “decreased pathogen resistance trait” refers to a plant or variety having a propensity for decreased pathogen resistance compared to plants from which it is derived. In some cases, a plant or variety with an decreased pathogen resistance trait has a propensity for decreased pathogen resistance in comparison to the mean pathogen resistance of plants from a population from which the plant or variety was derived. As used herein, reference to a plant or variety with an “intermediate pathogen resistance trait” refers to a plant or variety having a propensity for intermediate pathogen resistance compared to plants from which it is derived. In some cases, a plant or variety with an intermediate pathogen resistance trait has a propensity for intermediate or average pathogen resistance in comparison to the mean pathogen resistance of plants from a population from which the plant or variety was derived.

The “time of harvest” is defined with respect to the maturity of the flower, where approximately greater than 50% of the pistils have turned brown in appearance. The “time of harvest” can also be determined by initiation of flowering for hemp-type cannabis or by other agronomic criteria common in the art.

It is a particular aim of the present invention to identify and characterize a plant for the pathogen resistance trait of interest early in the plant lifecycle, particularly prior to the plant displaying the pathogen resistance trait of interest, more particularly to incorporate the pathogen resistance trait of interest into the breeding population early on. This can be achieved by genotyping the plant using molecular markers for detecting at least one QTL associated with the pathogen resistance trait of interest prior to the time of harvest.

As used herein a “quantitative trait locus” or “QTL” is a polymorphic genetic locus with at least two alleles that differentially affect the expression of a continuously varying phenotypic trait when present in a plant or organism which is characterised by a series of polymorphisms in linkage disequilibrium with each other.

As used herein, the term “pathogen resistance QTL” or “pathogen resistance quantitative trait locus” refers to a quantitative trait locus comprising part, or all, of the QTLs characterized by the polymorphisms having an allelic state associated with pathogen resistance, and in particular Botrytis resistance, described Table 2 or 9, or characterized by combinations of said polymorphisms.

In some cases, it is particularly desirable to obtain a plant displaying a pathogen resistance trait or botrytis resistance trait of interest, where it is desired to obtain a plant displaying an increased pathogen resistance trait. However, resistance traits may impart a growth penalty on plant development, which can include, among others, an impact on plant growth, floral development, and yield of desirable cannabis products including flower, fiber, and seed. Thus, in some cases it may be more desirable to obtain plants displaying a decreased pathogen resistance trait, or an intermediate pathogen resistance trait. Thus, the pathogen resistance QTL may be an increased pathogen resistance QTL, a decreased pathogen resistance QTL, or an intermediate pathogen resistance QTL as defined herein.

As used herein, the term “increased pathogen resistance QTL” or “increased pathogen resistance quantitative trait locus” refers to a quantitative trait locus characterized by one or more polymorphisms having an allelic state associated with increased pathogen resistance described or defined in Table 2 or 9. As used herein, the term “decreased pathogen resistance QTL” or “decreased pathogen resistance quantitative trait locus” refers to a quantitative trait locus characterized by one or more polymorphisms having an allelic state associated with decreased pathogen resistance described or defined in Table 2 or 9.

As used herein, the term “intermediate pathogen resistance QTL” or “intermediate pathogen resistance quantitative trait locus” refers to a quantitative trait locus characterized by one or more polymorphisms having an allelic state associated with intermediate pathogen resistance described or defined in Table 2 or 9.

As used herein, “haplotypes” refer to patterns or clusters of alleles or single nucleotide polymorphisms that are in linkage disequilibrium and therefore inherited together from a single parent. The term “linkage disequilibrium” refers to a non-random segregation of genetic loci or markers. Markers or genetic loci that show linkage disequilibrium are considered linked.

As used herein, the term “pathogen resistance haplotype” refers to the subset of the polymorphisms contained within any one of the pathogen resistance QTLs which exist on a single haploid genome complement of the diploid genome, and which are in linkage disequilibrium with the pathogen resistance trait. Exemplary polymorphisms for determining the pathogen resistance haplotype are provided in Tables 2, 9 and 13 herein.

As used herein, the term “donor parent plant” refers to a plant having a pathogen resistance haplotype, or one or more pathogen resistance alleles, associated with the pathogen resistance trait of interest.

As used herein, the term “recipient parent plant” refers to a plant having a pathogen resistance haplotype, or one or more pathogen resistance alleles, not associated with the pathogen resistance trait of interest.

The term “pathogen resistance allele” refers to the haplotype allele within a particular QTL that confers, or contributes to, the pathogen resistance trait of interest, or alternatively, is an allele that allows the identification of plants with the pathogen resistance trait of interest that can be included in a breeding program, particularly to select for the pathogen resistance trait (“marker assisted breeding”, “marker assisted selection”, or “genomic selection”).

The term “crossed” or “cross” means the fusion of gametes via pollination to produce progeny (e.g., cells, seeds or plants). The term encompasses both sexual crosses (the pollination of one plant by another) and selfing (self-pollination, e.g., when the pollen and ovule are from the same, or genetically identical plant). The term “crossing” refers to the act of fusing gametes via pollination to produce progeny.

The term “GWAS” or “Genome wide association study” or “GWA” or “Genome wide association” as used herein refers to an observational study of a genome-wide set of genetic variants or polymorphisms in different individual plants to determine if any variant or polymorphism is associated with a trait, specifically the pathogen resistance trait, in particular the Botrytis resistance trait. As used herein a “polymorphism” is a particular type of variance that includes both natural and/or induced multiple or single nucleotide changes, short insertions, or deletions in a target nucleic acid sequence at a particular locus as compared to a related nucleic acid sequence. These variations include, but are not limited to, single nucleotide polymorphisms (SNPs), indel/s, genomic rearrangements, and gene duplications.

As used herein, the term “LOD score” or “logarithm (base 10) of odds” refers to a statistical estimate used in linkage analysis, wherein the score compares the likelihood of obtaining the test data if the two loci are indeed linked, to the likelihood of observing the same data purely by chance. The LOD score is a statistical estimate of whether two genetic loci are physically near enough to each other (or “linked”) on a particular chromosome that they are likely to be inherited together. A LOD score of 3 or higher is generally understood to mean that two genes are located close to each other on the chromosome. In terms of significance, a LOD score of 3 means the odds are 1 ,000:1 that the two genes are linked and therefore inherited together.

As used herein, the term “quantile-quantile” or “Q-Q” refers to a graphical method for comparing two probability distributions by plotting their quantiles against each other. If the two distributions being compared are similar, the points in the Q-Q plot will approximately lie on the line y = x. If the distributions are linearly related, the points in the Q-Q plot will approximately lie on a line, but not necessarily on the line y = x. Q-Q plots can also be used as a graphical means of estimating parameters in a location-scale family of distributions.

As used herein, a “causal gene” is the specific gene having a genetic variant (the “causal variant”) which is responsible for the association signal at a locus and has a direct biological effect on the pathogen resistance trait. In the context of association studies, the genetic variants which are responsible for the association signal at a locus are referred to as the “causal variants”. Causal variants may comprise one or more “causal polymorphisms” that have a biological effect on the phenotype.

The term “nucleic acid” encompasses both ribonucleotides (RNA) and deoxyribonucleotides (DNA), including cDNA, genomic DNA, isolated DNA and synthetic DNA. The nucleic acid may be double-stranded or single-stranded. Where the nucleic acid is singlestranded, the nucleic acid may be the sense strand or the antisense strand. A “nucleic acid molecule” or “polynucleotide” refers to any chain of two or more covalently bonded nucleotides, including naturally occurring or non-naturally occurring nucleotides, or nucleotide analogs or derivatives. By “RNA” is meant a sequence of two or more covalently bonded, naturally occurring or modified ribonucleotides. The term “DNA” refers to a sequence of two or more covalently bonded, naturally occurring or modified deoxyribonucleotides. By “cDNA” is meant a complementary or copy DNA produced from an RNA template by the action of RNA-dependent DNA polymerase (reverse transcriptase).

In some embodiments, the nucleic acid molecules of the invention may be operably linked to other sequences. By “operably linked” is meant that the nucleic acid molecules, such as those comprising the QTLs of the invention or genes identified herein, and regulatory sequences are connected in such a way as to permit expression of the proteins when the appropriate molecules are bound to the regulatory sequences. Such operably linked sequences may be contained in vectors or expression constructs which can be transformed or transfected into plant cells or plants for expression. A “regulatory sequence” refers to a nucleotide sequence located either upstream, downstream or within a coding sequence. Generally regulatory sequences influence the transcription, RNA processing or stability, or translation of an associated coding sequence. Regulatory sequences include but are not limited to: effector binding sites, enhancers, introns, polyadenylation recognition sequences, promoters, RNA processing sites, stem-loop structures, translation leader sequences and the like.

The term “promoter” refers to a DNA sequence that is capable of controlling the expression of a nucleic acid coding sequence or functional RNA. A promoter may be based entirely on a native gene, or it may be comprised of different elements from different promoters found in nature. Different promoters are capable of directing the expression of a gene at different stages of development, or in response to different environmental or physiological conditions. An “inducible promoter” is promoter that is active in response to a specific stimulus. Several such inducible promoters are known in the art, for example, chemical inducible promoters, developmental stage inducible promoters, tissue type specific inducible promoters, hormone inducible promoters, environment responsive inducible promoters.

The term “isolated”, as used herein means having been removed from its natural environment. Specifically, the nucleic acid(s) or gene(s) identified herein may be isolated nucleic acids or gene(s), which have been removed from plant material where they naturally occur.

The term “purified”, relates to the isolation of a molecule or compound in a form that is substantially free of contamination or contaminants. Contaminants are normally associated with the molecule or compound in a natural environment, purified thus means having an increase in purity as a result of being separated from the other components of an original composition. The term "purified nucleic acid" describes a nucleic acid sequence that has been separated from other compounds including, but not limited to polypeptides, lipids, and carbohydrates which it is ordinarily associated with in its natural state.

The term “complementary” refers to two nucleic acid molecules, e.g., DNA or RNA, which are capable of forming Watson-Crick base pairs to produce a region of double-strandedness between the two nucleic acid molecules. It will be appreciated by those of skill in the art that each nucleotide in a nucleic acid molecule need not form a matched Watson-Crick base pair with a nucleotide in an opposing complementary strand to form a duplex. One nucleic acid molecule is thus “complementary” to a second nucleic acid molecule if it hybridizes, under conditions of high stringency, with the second nucleic acid molecule. A nucleic acid molecule according to the invention includes both complementary molecules. As used herein a “substantially identical” or “substantially homologous” sequence is a nucleotide sequence that differs from a reference sequence only by one or more conservative substitutions, or by one or more non-conservative substitutions, deletions, or insertions located at positions of the sequence that do not destroy or substantially alter the activity of the polypeptide encoded by the nucleic acid molecule. Alignment for purposes of determining percent sequence identity can be achieved in various ways that are within the knowledge of those with skill in the art. These include using, for instance, computer software such as ALIGN, Megalign (DNASTAR), CLUSTALW or BLAST software. Those skilled in the art can readily determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. In one embodiment of the invention there is provided for a polynucleotide sequence that has at least about 80% sequence identity, at least about 90% sequence identity, or even greater sequence identity, such as about 95%, about 96%, about 97%, about 98% or about 99% sequence identity to the sequences described herein.

Alternatively, or additionally, two nucleic acid sequences may be “substantially identical” or “substantially homologous” if they hybridize under high stringency conditions. The “stringency" of a hybridisation reaction is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation which depends upon probe length, washing temperature, and salt concentration. In general, longer probes required higher temperatures for proper annealing, while shorter probes require lower temperatures. Hybridisation generally depends on the ability of denatured DNA to re-anneal when complementary strands are present in an environment below their melting temperature. A typical example of such “stringent” hybridisation conditions would be hybridisation carried out for 18 hours at 65 °C with gentle shaking, a first wash for 12 min at 65 °C in Wash Buffer A (0.5% SDS; 2XSSC), and a second wash for 10 min at 65 °C in Wash Buffer B (0.1% SDS; 0.5% SSC).

Nucleotide positions of polymorphisms described herein are provided with reference to the corresponding position on the Cannabis sativa (assembly cs10) representative genome, provided as RefSeq assembly accession: GCF 900626175.2 on NCBI, loaded on 14 February 2019, referred to herein as “cs10 reference genome” or “cs10 genome”.

Methods of identifying a QTL or haplotype responsible for pathogen resistance, particularly botrytis resistance, and molecular markers therefor

In some embodiments, methods are provided for identifying a QTL or haplotype responsible for pathogen resistance and for selecting plants with the pathogen resistance trait of interest, thereby to identify the QTL or haplotype responsible for the trait. In some embodiments, the methods may comprise the steps of: a. Identifying a plant that displays the pathogen resistance phenotype within a breeding program. b. Establishing a population by crossing the identified plant to itself (selfing) or a recipient parent plant. c. Genotyping the resultant F1 , or subsequent populations, for example by sequencing methods. d. Performing association studies, including phenotyping and linkage analysis, to discover QTLs and/or polymorphisms contained within the QTL. e. Optionally, identifying cannabis paralogs of previously characterized genes that may be involved in conferring the pathogen resistance trait. f. Developing molecular markers that detect one or more polymorphisms linked to QTLs, alleles within these QTLs, or existing or induced polymorphisms. g. Validating the molecular markers by determining the linkage disequilibrium between the marker and the pathogen resistance trait.

Trait development and introgression

In some embodiments, methods are provided for marker assisted breeding (MAB) or marker assisted selection (MAS) of plants having a pathogen resistance QTL or displaying the pathogen resistance trait. The methods may comprise the steps of: a. Identifying a plant that displays the pathogen resistance trait or phenotype or contain a pathogen resistance QTL as defined herein. b. Establishing a population by crossing the identified plant to itself (selfing) or another recipient parent plant. c. Genotyping and phenotyping the resultant F1 , or subsequent, populations, for example by sequencing methods. d. Performing association studies, inputting phenotype and genotype information to identify genomic regions enriched with polymorphisms associated with the pathogen resistance trait, to discover QTLs and/or polymorphisms contained within the QTL. e. Optionally, identifying cannabis paralogs of previously characterized genes that may be involved in the pathogen resistance trait. f. Developing molecular markers that detect one or more polymorphisms linked to QTLs, alleles within these QTLs, or existing or induced polymorphisms. g. Using the molecular markers when introgressing the QTLs or polymorphisms into new or existing cannabis varieties to select plants containing the pathogen resistance haplotype or the pathogen resistance trait, or plants where the pathogen resistance haplotype or the pathogen resistance trait is absent.

QTLs and Marker Assisted Breeding

In some embodiments, during the breeding process, selection of plants displaying the pathogen resistance may be based on molecular markers designed to detect polymorphisms linked to genomic regions that control the trait of interest by either an identified or an unidentified mechanism. Previously identified genetic mechanisms may, for example, have a direct or pleiotropic effect on pathogen and/or botrytis resistance in a plant. In some embodiments, QTLs containing such elements are identified using association studies. Knowledge of the mode-of- action is not required for the functional use of these genomic regions in a breeding program. Identification of regions controlling unidentified mechanisms may be useful in obtaining plants with the pathogen resistance phenotype, based on identification of polymorphisms that are either linked to, or found within QTLs that are associated with pathogen resistance phenotype using association studies.

Construction of breeding populations

Breeding populations are the offspring of sexual reproduction events between two or more parents. The parent plants (FO) are crossed to create an F1 population each containing a chromosomal complement of each parent. In a subsequent cross (F2), recombination has occurred and allows for mostly independent segregation of traits in the offspring and importantly the reconstitution of recessive phenotypes that existed in only one of the parental lines.

According to some embodiments, QTLs that lead to the pathogen resistance phenotype are identified within synthetic populations of plants capable of revealing dominant, recessive, or complex traits. In one embodiment of the invention, a genetically diverse population of cannabis varieties, that are used to produce the synthetic population are integrate them into a breeding program by unnatural processes. In some embodiments, the identification of parental plants that are a source of resistance are used to create F2 populations. Specific parental populations may be the source of general and/or novel traits. In some embodiments, these processes result in changes in the genomes of the plants. The changes may include, but are not limited to, mutations and rearrangements in the genomic sequences, duplication of the entire genome (polyploidy), or activation of movement of transposable elements which may inactivate, activate or attenuate the activity of genes or genomic elements. According to one embodiment of the invention, the following methods are employed to integrate the plants into a breeding program include some or all of the following: a. Growing plants in rich media or soils under artificial lighting; b. Cloning of plants, often through a multitude of sub-cloning cycles; c. Introduction of plants into in vitro, sterile growth environments, and subsequent removal to standard growth conditions; d. Exposure to mutagens such as EMS, colchicine, silver nitrate, ethidium bromide, dinitroanalines, high concentrations mono or poly-chromatic light sources; e. Growing plants under highly stressful conditions which include restricted space, drought, pathogen challenge, atypical temperatures, and nutrient stresses. Pathogen resistance trait of interest association studies and QTL identification

In some embodiments, the synthetic populations created are either the offspring of sexual reproduction or clones of plants in the breeding program such that genetic material of individuals in the synthetic populations is derived from one, or two, or more plants from the breeding program.

In one embodiment, plants identified within the synthetic population as having a trait of interest, such as the botrytis resistance trait, may be used to create a structured population for the identification of the genetic locus responsible for the trait. The structured population may be created by crossing one (selfing) or more plants and recovering the seeds from those plants.

Plants in the structured population may be fully genotyped using genome sequencing to identify genetic markers for use in the association study (AS) database. Association mapping is a powerful technique used to detect quantitative trait loci (QTLs) specifically based on the statistical correlation between the phenotype and the genotype. In this case the trait is the pathogen resistance phenotype, in particular the Botrytis resistance phenotype. In a population generated by crossing, the amount of linkage disequilibrium (LD) is reduced between genetic marker and the QTL as a function of genetic distance in cannabis varieties with similar genome structures. Simple association mapping is performed by biparental crosses of two closely related lines where one line has a phenotype of interest and the other does not. In some embodiments, advanced population structures may be used, including nested association mapping (NAM) populations or multi-parent advanced generation inter-cross (MAGIC) populations, however it will be appreciated that other population structures can also be effectively used. Biparental, NAM, or MAGIC structured populations can be generated and offspring, at F1 or later generations, may be maintained by clonal propagation for a desired length of time. In some embodiments, QTLs may be identified using the high-density genetic marker database created by genotyping the founder lines and structured population lines. This marker database may be coupled with an extensive phenotypic trait characterization dataset, including, for example, the pathogen resistance phenotype or botrytis resistance phenotype of the plants. Using the association studies described herein, together with accurate phenotyping, this method is able to identify genomic regions, QTLs and even specific genes or polymorphisms responsible for the pathogen resistance or botrytis resistance phenotype that are directly introduced into recipient lines. Polygenic phenotypes may also be identified using the methods described herein.

In one embodiment, the structured population is grown to the flowering stage. To characterize the phenotypes of the lines they are clonally reproduced so the phenotypic data can be collected in feasible replicates.

Genomic Selection

In some embodiments, during the breeding process, selection of plants by genomic selection (GS) may be conducted. Genomic selection is a method in plant breeding where the genome wide genetic potential of an individual is determined to predict breeding values for those individuals. In some embodiments, the accuracy of genomic selection is affected by the data used in a GS model including size of the training population, relationships between individuals, marker density, use of pedigree information, and inclusion of known QTLs.

In some embodiments, a QTL or a SNP known to be associated with a trait that contributes to selection criteria can improve the accuracy of genomic selection models. In some embodiments, a genomic selection model that incorporates pathogen resistance can be improved by the inclusion of the pathogen resistance QTLs in the GS model. In some embodiments, the SNPs described in Tables 2, 9 and 13 herein may be useful in a genomic selection model, for example where genotypes with unknown phenotypes are evaluated using an approach like a random forest algorithm for prediction of the pathogen resistance trait, and particularly in combination, to improve the predictive power of the model. In a further embodiment, combinatorial markers for strengthening genomic selection are provided in Tables 5, 6 and 12.

Molecular Markers to detect polymorphisms

As used herein, the term “marker” or “genetic marker” refers to any sequence comprising a particular polymorphism or haplotype described herein that is capable of detection. For example, a marker may be a binding site for a primer or set of primers that is designed for use in a PCR-based method to amplify and thus detect a polymorphism or haplotype. Alternatively, the marker may introduce a restriction enzyme recognition site, or result in the removal of a restriction enzyme recognition site. Plants can be screened for a particular trait based on the detection of one or more markers confirming the presence of the polymorphism. Marker detection systems that may be used in accordance with the present invention include, but are not limited to polymerase chain reaction (PCR) followed by sequencing, Kompetitive allele specific PCR (KASP), restriction fragment length polymorphisms (RFLPs) analysis, amplified fragment length polymorphisms (AFLPs), cleaved amplified polymorphic sequences (CAPS), or any other markers known in the art.

In some embodiments “molecular markers” refers to any marker detection system, including primers designed based on the context sequences for the polymorphisms identified herein, provided in Table 3, 10 or 13. Such molecular markers may include PCR primers, or targeted sequencing primers such as those described in the examples below, more specifically the primers defined in Table 4 or 1 1 .

For example, PCR primers may be designed that consist of a reverse primer and two forward primers that are homologous to the part of the genome that contains a polymorphism but differ in the 3’ nucleotide such that the one primer will preferentially bind to sequences containing the polymorphism and the other will bind to sequences lacking it. The three primers are used in single PCR reactions where each reaction contains DNA from a cannabis plant as a template. Fluorophores linked to the forward primers provide, after thermocycling, a different relative fluorescent signal for homozygous and heterozygous alleles containing the polymorphism and for those lacking the polymorphism, respectively.

In some embodiments, allele-specific primers may each harbor a unique tail sequence that corresponds with a universal FRET (fluorescence resonant energy transfer) cassette. For example, the primer specific to the SNP may be labelled with a FAM and the other specific primer with a HEX dye. During the PCR thermal cycling performed with these primers, the allele-specific primer binds to the genomic DNA template and elongates, so attaching the tail sequence to the newly synthesized strand. The complement of the allele-specific tail sequence is then generated during subsequent rounds of PCR, enabling the FRET cassette to bind to the DNA. Alleles are discriminated through the competitive binding of the two allele-specific forward primers. At the end of the PCR reaction a fluorescent plate is read using standard tools which may include RT- PCR devices with the capacity to detect florescent signals and is evaluated with commercial software.

If the genotype at a given polymorphism site is homozygous, one of the two possible fluorescent signals will be generated. If the genotype is heterozygous, a mixed fluorescent signal will be generated. By way of example, genomic DNA extracted from cannabis leaf tissue at seedling stage can be used as a template for PCR amplifications with reaction mixtures containing the three primers. Final fluorescent signals can be detected by a thermocycler and analyzed using standard software for this purpose, which discriminates between individuals that are heterozygotes or homozygotes for either allele.

In some embodiments, molecular markers to one, two or more of the SNPs in the haplotype can be used to identify the presence of the QTL and by association, the pathogen resistance trait of interest.

Further, the QTL may include a number of individual polymorphisms in linkage disequilibrium, which constitute a haplotype and which, with high frequency, can be inherited from a donor parent plant as a unit. Therefore, in some embodiments, molecular markers can be utilized which have been designed to identify numerous polymorphisms which are in linkage disequilibrium with other polymorphisms, any of which can be used to effectively predict the phenotype of the offspring for the pathogen resistance trait of interest.

According to some embodiments, any polymorphism in linkage disequilibrium with one or more of the pathogen resistance QTLs can be used to determine the pathogen resistance haplotype in a breeding population of plants, as long as the polymorphism is unique to the pathogen resistance trait in the donor parent plant when compared to the recipient parent plant.

In some embodiments the desired trait is the increased pathogen resistance trait, and the donor parent plant may be a plant that has been genetically modified or selected to include an increased pathogen resistance QTL defined by a polymorphism associated with the increased pathogen resistance trait, for example any, some, or all of the polymorphisms defined in Table 2 or 9 associated with the trait. Alternatively, the desired trait may be the decreased pathogen resistance trait, or intermediate pathogen resistance trait, and the donor parent plant may be a plant that has been genetically modified or selected to include a decreased pathogen resistance QTL or intermediate pathogen resistance QTL defined by a polymorphism associated with the decreased pathogen resistance trait or the intermediate pathogen resistance trait, for example any, some, or all of the polymorphisms defined in Table 2 or 9 associated with the trait, or combinations thereof as described herein.

In some embodiments, donor parent plants, as described above, are used as one of two parents to create breeding populations (F1 ) through sexual reproduction. In this embodiment, donor parent plants may be identified by detecting polymorphisms using the molecular markers as described above.

Methods for reproduction that are known in the art may be used. The donor parent plant provides the trait of interest to the breeding population. The trait is made to segregate through the population (F2) through at least one additional crossing event of the offspring of the initial cross. This additional crossing event can be either a selfing of one of the offspring or a cross between two individuals, provided that each plant used in the F1 cross contains at least one copy of a desired QTL allele or haplotype.

In some embodiments, the pathogen resistance allele or pathogen resistance haplotype in plants to be used in the F1 cross is determined using the described molecular markers. In some embodiments, the resulting F2 progeny, or subsequent progeny, is/are screened for any of the pathogen resistance polymorphisms, and in particular Botrytis resistance polymorphisms, described herein.

The plants at any generation can be produced by asexual means like cutting and cloning, or any method that yields a genetically identical offspring.

Production of Cannabis spp. plants having the increased pathogen resistance trait

It is a particular aim of the present invention to provide for the production of Cannabis spp. plants that have an increased pathogen resistance trait. Accordingly, in some embodiments, a Cannabis spp. plant that has the decreased pathogen resistance trait may be converted into a plant having an increased pathogen resistance trait according to the methods of the present invention by providing a breeding population where the donor parent plant contains an increased pathogen resistance QTL associated with an increased pathogen resistance trait and the recipient parent plant either displays the decreased pathogen resistance trait or contains the decreased pathogen resistance QTL.

In some embodiments the decreased pathogen resistance phenotype may be removed from a recipient parent plant by crossing it with a donor parent plant having the increased pathogen resistance QTL. In some embodiments the donor parent plant has an increased pathogen resistance phenotype and a contains a contiguous genomic sequence characterized by one or more of the polymorphisms of Table 2 or 9 associated with the increased pathogen resistance allele or a haplotype conferring the increased pathogen resistance trait.

In some embodiments, the donor parent plant is any cannabis spp. variety that is cross fertile with the recipient parent plant.

In some embodiments, MAS or MAB may be used in a method of backcrossing plants carrying the pathogen resistance trait, in particular the Botrytis resistance trait, to a recipient parent plant. For example, an F1 plant from a breeding population can be crossed again to the recipient parent plant. In some embodiments, this method is repeated.

In some embodiments, the resulting plant population is then screened for the pathogen resistance trait using MAS with molecular markers to identify progeny plants that contain one or more polymorphism, such as any of those described Table 2 or 9, indicating the presence of an allele of a QTL associated with the increased pathogen resistance phenotype. In another embodiment, the population of cannabis plants may be screened by any analytical methods known in the art to identify plants with desired characteristics, specifically increased pathogen resistance.

Production of Cannabis spp. plants having decreased- or intermediate pathogen resistance

In some embodiments, a Cannabis spp. plant that has the increased pathogen resistance trait may be converted into a plant having a decreased pathogen resistance trait or intermediate pathogen resistance trait according to the methods of the present invention by providing a breeding population where the donor parent plant contains decreased pathogen resistance QTL associated with a decreased pathogen resistance trait, or an intermediate pathogen resistance QTL associated with the intermediate pathogen resistance trait, and the recipient parent plant either displays the increased pathogen resistance phenotype or contains the increased pathogen resistance QTL.

In some embodiments the increased pathogen resistance phenotype may be removed from a recipient parent plant by crossing it with a donor parent plant having the decreased- or intermediate pathogen resistance QTL. In some embodiments the donor parent plant has a decreased- or intermediate pathogen resistance phenotype and a contains a contiguous genomic sequence characterized by one or more of the polymorphisms of Table 2 or 9 associated with the decreased- or intermediate pathogen resistance allele or haplotype associated therewith.

In some embodiments, the donor parent plant is any cannabis spp. Variety that is cross fertile with the recipient parent plant.

In some embodiments, MAS or MAB may be used in a method of backcrossing plants carrying the decreased- or intermediate pathogen resistance trait to a recipient parent plant. For example, an F1 plant from a breeding population can be crossed again to the recipient parent plant. In some embodiments, this method is repeated. In some embodiments, the resulting plant population is then screened for the pathogen resistance trait using MAS with molecular markers to identify progeny plants that contain one or more polymorphism, such as any of those described Table 2 or 9, indicating the presence of an allele of a QTL associated with the decreased- or intermediate pathogen resistance phenotype. In another embodiment, the population of cannabis plants may be screened by any analytical methods known in the art to identify plants with desired characteristics, specifically the decreased- or intermediate pathogen resistance trait.

Methods to genetically engineer plants to achieve the increased pathogen resistance or decreased pathogen resistance using mutagenesis or gene editing techniques

Identifying QTLs, and individual polymorphisms, that correlate with a trait when measured in an F1 , F2, or similar, breeding population indicates the presence of one or more causative polymorphisms in close proximity the polymorphism detected by the molecular marker. In some embodiments, the polymorphisms associated with the increased-, decreased-, or intermediate pathogen resistance trait are introduced into a plant by other means so that a trait, such as the pathogen resistance trait, can be introduced into plants that would not otherwise contain associated causative polymorphisms or removed from plants that would otherwise contain associated causative polymorphisms. For example, the polymorphisms detailed in Tables 2 and 9 are molecular markers that can be used to indicate the presence of a possible causative polymorphism. Further, the polymorphisms detailed in Table 13 may also be useful as molecular markers for predicting the presence of possible causative polymorphisms.

The entire QTLs or parts thereof which confer the pathogen resistance trait of interest described herein may be introduced into the genome of a cannabis spp. plant to obtain plants with the pathogen resistance trait of interest through a process of genetic modification known in the art, for example, but not limited to, heterologous gene expression using an expression cassette including a sequence encoding the QTL(s) or part thereof, the gene(s), or the nucleic acids comprising them. The expression cassettes may contain all or part of the QTL(s) or gene(s), including possible causative polymorphisms.

The trait described herein may be removed from, or introduced into, the genome of a cannabis spp. plant to obtain plants that exclude or include the causative polymorphisms and the potential to display a desired pathogen resistance trait of interest through processes of genetic modification known in the art, for example, but not limited to, CRISPR-Cas9 targeted gene editing, TILLING, non-targeted chemical mutagenesis using e.g., EMS.

The present invention further provides methods for producing a modified Cannabis plant using genome editing or modification techniques. For example, genome editing can be achieved using sequence-specific nucleases (SSNs) the use of which results in chromosomal changes, such as nucleotide deletions, insertions or substitutions at specific genetic loci, particularly those associated with the pathogen resistance of interest as described in Table 2 or 9 or a causative polymorphism linked thereto. Specific causative polymorphisms may include a polymorphism in a gene encoding a Protein kinase domain-containing protein (PKD) (NCBI accession: XP 030492442.1 ) at position 23309645 on chromosome NC_044371.1 with reference to the CS10 genome; and a polymorphism in a gene encoding a PTI6-like protein (NCBI accession: XP 030505675.1 ) at position 86872263 on chromosome NC_044375.1 , as described in Table 8. Non limiting examples of SSNs include zinc finger nucleases (ZFNs), TAL effector nucleases (TALENs), meganuclease, and, clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein (Cas) system. In some embodiments, non-limiting examples of Cas proteins suitable for use in the methods of the present invention include Csnl, Cpfl Cas9, Cas 12, Cas 13, Cas 14, CasX and combinations thereof. In one embodiment, a modified Cannabis spp. plant having a pathogen resistance trait of interest, such as a Botrytis resistance trait, is generated using CRISPR/Cas9 technology, which is based on the Cas9 DNA nuclease guided to a specific DNA target by a single guide RNA (sgRNA). For example, the genome modification may be introduced using guide RNA, e.g., single guide RNA (sgRNA) designed and targeted to introduce a polymorphism associated with the pathogen resistance trait of interest, such as a polymorphism associated with the pathogen resistance trait of interest described in Table 2 or 9, or a causative polymorphism linked thereto, such as a polymorphism in a gene encoding a Protein kinase domain-containing protein (PKD) (NCBI accession: XP 030492442.1 ) at position 23309645 on chromosome NC_044371.1 with reference to the CS10 genome; and a polymorphism in a gene encoding a PTI6-like protein (NCBI accession: XP 030505675.1 ) at position 86872263 on chromosome NC_044375.1 , as described in Table 8.

DNA introduction into the plant cells can be performed using Agrobacterium infiltration, virus-based plasmid delivery of the genome editing molecules and mechanical insertion of DNA (PEG mediated DNA transformation, biolistics, etc.). In some embodiments, the Cas9 protein may be directly inserted together with a gRNA (ribonucleoprotein- RNP’s) in order to bypass the need for in vivo transcription and translation of the Cas9+gRNA plasmid in planta to achieve gene editing. In one embodiment, a genome edited plant may be developed and used as a rootstock, so that the Cas protein and gRNA can be transported via the vasculature system to the top of the plant and create the genome editing event in the scion.

According to one embodiment of the present invention, the method of genetically modifying a plant may be achieved by combining the Cas nuclease (e.g., Cas9, Cpf 1 ) with a predefined guide RNA molecule (gRNA). The gRNA is complementary to a specific DNA sequence targeted for editing in the plant genome and which guides the Cas nuclease to a specific nucleotide sequence. The gRNA may be designed based on the context sequence for the polymorphisms provided in Table 3, 8, 10 or 13. The predefined gene specific gRNA’s may be cloned into the same plasmid as the Cas gene and this plasmid is inserted into plant cells as described above. In some embodiments, once the guide RNA molecule and Cas9 nuclease reach the specific predetermined DNA sequence, the Cas9 nuclease cleaves both DNA strands to create double stranded breaks leaving blunt ends. This cleavage site is then repaired by the cellular non homologous end joining DNA repair mechanism resulting in insertions or deletions which introduce a mutation at the cleavage site.

In one embodiment, a deletion form of the mutation may consist of at least 1 base pair deletion. As a result of this base pair deletion the gene coding sequence for a gene responsible for botrytis susceptibility is disrupted and the translation of the encoded protein is compromised either by a premature stop codon or disruption of a functional or structural property of the protein.

In another embodiment, the pathogen resistance trait of interest in Cannabis spp. plants may be introduced by generating gRNA with homology to a specific site of predetermined genes in the Cannabis genome or the QTLs defined herein. This gRNA may be sub-cloned into a plasmid containing the Cas9 gene, and the plasmid inserted into the Cannabis spp. plant cells. In this way site specific mutations in the QTLs are generated, particularly causative polymorphisms linked to a SNP associated with the pathogen resistance trait of interest described in Table 2, 9 or 13, thus effectively conferring pathogen resistance in the genome edited plant. In some embodiments, the causative polymorphism may be a polymorphism in a gene encoding a Protein kinase domaincontaining protein (PKD) (NCBI accession: XP 030492442.1 ) at position 23309645 on chromosome NC_044371 .1 with reference to the CS10 genome; and a polymorphism in a gene encoding a PTI6-like protein (NCBI accession: XP 030505675.1 ) at position 86872263 on chromosome NC_044375.1 , as described in Table 8.

In some embodiments, a modified Cannabis spp. plant exhibiting an increased pathogen resistance trait, such as increased botrytis resistance, may be obtained using the targeted genome modification methods described above, wherein the plant comprises a targeted genome modification to introduce one or more polymorphisms associated with the increased pathogen resistance trait, particularly a causative polymorphism linked to a SNP defined in Table 2, 9 or 13, or alternatively a causative polymorphism detailed in Table 8, wherein the modification effects an increased pathogen resistance trait.

In some embodiments, the genetic modification may be introduced using gene silencing, a process by which the expression of a specific gene product is lessened or attenuated. Gene silencing can take place by a variety of pathways, including by RNA interference (RNAi), an RNA dependent gene silencing process. In one embodiment, RNAi may be achieved by the introduction of small RNA molecules, including small interfering RNA (siRNA), microRNA (miRNA) or short hairpin RNA (shRNA), which act in concert with host proteins (e.g., the RNA induced silencing complex, RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. In particular, RNAi may be used to silence one or more of the putative causative genes described in Table 7 herein. Such RNAi molecules may be designed based on the sequence of these genes. These molecules can vary in length (generally 18-30 base pairs) and may contain varying degrees of complementarity to their target mRNA in the antisense strand. Some, but not all, RNAi molecules have unpaired overhanging bases on the 5' or 3' end of the sense strand and/or the antisense strand. As used herein, the term “RNAi molecule” includes duplexes of two separate strands, as well as single strands that can form hairpin structures comprising a duplex region. The RNAi molecules may be encoded by DNA contained in an expression cassette and incorporated into a vector. The vector may be introduced into a plant cell using Agrobacterium infiltration, virus-based plasmid delivery of the vector containing the expression cassette and/or mechanical insertion of the vector (PEG mediated DNA transformation, biolistics, etc.).

Plants may be screened with molecular markers as described herein to identify transgenic individuals with or without pathogen resistance or having an increased, decreased or intermediate resistance QTL or polymorphism(s), following the genetic modification.

In some embodiments, cannabis spp. plants having one or more of the polymorphisms of Table 2, 9 or 13 associated with the increased pathogen resistance, decreased pathogen resistance, or intermediate pathogen resistance QTLs or a causative polymorphism linked thereto are provided. Possible causative polymorphism include a polymorphism in a gene encoding a Protein kinase domain-containing protein (PKD) (NCBI accession: XP 030492442.1 ) at position 23309645 on chromosome NC_044371.1 with reference to the CS10 genome; and a polymorphism in a gene encoding a PTI6-like protein (NCBI accession: XP 030505675.1 ) at position 86872263 on chromosome NC_044375.1 , as described in Table 8. The polymorphisms may be introduced, for example, by genetic engineering. In some embodiments the one or more polymorphisms associated with the pathogen resistance of interest or linked thereto are introduced into the plants by breeding, such as by MAS or MAB, for example as described herein.

The increased pathogen resistance, decreased pathogen resistance, or intermediate pathogen resistance QTLs described herein, or genes identified herein responsible for conferring an increased, decreased, or intermediate pathogen resistance trait, may be under the control of, or operably linked to, a promoter, for example an inducible promoter. Such QTLs or genes may be operably linked to the inducible promoter so as to induce or suppress the pathogen resistance trait or phenotype in the plant or plant cell.

Accordingly, in a further embodiment, Cannabis spp. plants comprising an increased, decreased, or intermediate pathogen resistance QTL described herein, or one or more polymorphisms associated therewith, are provided. In some cases, such plants are provided for with the proviso that the plant is not exclusively obtained by means of an essentially biological process.

The following examples are offered by way of illustration and not by way of limitation. EXAMPLE 1

Genome-wide association studies (GWAS) of Botrytis resistance in Cannabis

During outdoor field trials in 2020 the inventors grew to harvest several hundred distinct cannabis HRT and hemp cultivars of diverse origin. The investigators scored all plants for resistance or susceptibility to Botrytis at the time of harvest. The investigators identified cannabis plants that were generally resistant to Botrytis and plants that were susceptible to Botrytis infection. Until this time there was no indication of the resistance status of any genotype of cannabis to Botrytis. The inventors generated crosses between susceptible and resistant cannabis genotypes identified in the 2020 field experiment in order to create F2 populations that segregate for resistance to Botrytis. The inventors evaluated the F2 populations in a field experiment in 2021 at the time of harvest for resistance or susceptibility to Botrytis cinerea.

The inventors found that a number of the F2 populations generated had varying levels of Botrytis resistance amongst individuals at the time of harvest in the 2021 field trial. They sought to better understand the molecular basis for disease resistance by scoring disease symptoms in these designed F2 populations, comprising in total 1075 individuals (Table 1 ). The inventors found that resistance to Botrytis was segregating within the individual populations selected (Figure 1 ).

Table 1 : An overview of the F2 populations used including the (Average) average Botrytis resistance score (a score from 1 -9, where 1 is resistant and 9 is susceptible, the (StDev) the standard deviation of the Botrytis resistance score, and the population size given by the number of plants.

At the time of harvest whole plants including flower, leaf, and stem were scored for susceptibility to Botrytis cinerea. This trait has been quantified using a subjective scale from 1 to 9 where 1 indicates plants with high resistance to Botrytis cinerea, while 9 indicate plants with high susceptibility to Botrytis i.e. showing the presence of Botrytis disease symptoms and Botrytis conidiophore. Fungal- pathogens that mimic the disease symptoms of Botrytis cinerea cannot be ruled out, however visual identification of Botrytis is commonplace.

To complement visual identification, PCR based assays were used to demonstrate the pathogen observed was Botrytis cinerea. DNA was extracted from about 70 mg of leaf discs from all the plants evaluated using an adapted kit with “sbeadex” magnetic beads by LGC Genomics. Published primers to detect the Botrytis cinerea, Glyceraldehyde 3-phosphate dehydrogenase (Botr_G3PDH_Fw ATT GAC ATC GTC GCT GTC AAC GA (SEQ ID NO:1 ) and Botr_G3PDH_Rv ACC CCA CTC GTT GTC GTA CCA (SEQ ID NO:2)) and Heat Shock Protein 60 genes (Botr_HSP60_Fw CAA TTG AGA TTT GCC CAC AAG (SEQ ID NO:3) and Botr_HSP60_Rv GAT GGA TCC AGT GGT ACC GAG CAT (SEQ ID NO:4)) were used with an annealing temp 55°C and 120 sec elongation time. The PCR based assay confirmed the visual identification of Botrytis in all samples analyzed.

Looking at the individual populations, the segregation of the Botrytis resistance trait can be observed (Figure 1 ). Notably the populations have distributions of plants between resistant and susceptible, however the average resistance in the individual population differ, where some populations are on average more susceptible to Botrytis than others. For example, population GID 210020460000 is comprised of plants that are more susceptible to Botrytis on average than the others (Figure 1 ). This may indicate that the individual populations are segregating for shared as well as distinct resistance traits.

Following scoring for Botrytis resistance, All F2 plants described in Table 1 were sequenced.

DNA was extracted from about 70 mg of leaf discs from all the plants evaluated using an adapted kit with “sbeadex” magnetic beads by LGC Genomics, which was automated on a KingFisher Flex with 96 Deep-Well Head by Thermo Fisher Scientific.

The extracted DNA served as a template for the subsequent library preparation for sequencing. The library pools were prepared according to the manufacturer’s instructions (AgriSeq™ HTS Library Kit — 96 sample procedure from Thermo Fisher Scientific). Targeted sequencing of a custom SNP marker panel was employed. The inventors designed the marker panel based on a pangenome comprising 13 genotypes in order to include SNP markers common to a broad diversity of HRT and hemp cannabis including the Cannabis Sativa CS10 reference genome. Targeted sequencing was carried out on the Ion Torrent system by Thermo Fisher Scientific. The primers for the SNPs identified are provided in Table 4, below. The library pool was loaded onto Ion 550 chips with Ion Chef and sequenced with Ion GeneStudio S5 Plus according to the manufacturer’s instructions (Ion 550™ Kit from Thermo Fisher Scientific).

From a population of 9 combined F2 populations with a total of 1075 individuals, a genome-wide association study (GWAS) was performed to detect significant associations between genotypic information derived from targeted resequencing of the custom SNP marker panel described above and the susceptibility of Cannabis to Botrytis. The Botrytis resistance scores from 1 -9, where 1 indicates most resistant plants, were used as an input for GW AS.

The genotypic matrix was filtered for SNPs having a minor allele frequency lower than 1 %. The GWA was performed using GAPIT version 3 (Wang and Zhang, 2021 ) with five statistical models: General Linear Model (GLM), Mixed Linear Model (MLM), FarmCPU and Blink (model=c(“GLM”, “MLM”, “FarmCPU”, “Blink”). A quantile-quantile plot (QQ plot) was used to evaluate the statistical models. In all models we failed to detect SNPs with significant association with Botrytis resistance.

For better modelling the inventors reasoned that a high rate of missing values may be impacting the estimation of population structure and kinship among individuals. To solve this, the GWA with all F2 populations combined was performed again but instead used a SNP matrix that underwent a round of imputation for reducing the number of missing values. In order to reduce missing data in the genotype file, an imputation has been performed using the HapMapJmputation software Briefly, the genotype file is converted to a hapmap format (comma separated, esalq/Hapmap-and-VCF-formats-and-its-integration-with-onemap /#hapmap).

In a first step, HapMapJmputation counts the occurrence of each nucleotide at every single genotyped position. The most common nucleotide is defined as major allele, the second is defined as minor allele. Missing genotyping information is excluded. In the case major and minor alleles occur at the same number, the nucleotide of the reference cs10 (available as GCF 900626175.1 on NCBI) is chosen as major allele. Subsequently, HapMapJmputation sorts markers by position and parses the hapmap into the required fastPHASE (Scheet and Stephens, 2006) input format. Briefly, HapMapJmputation splits the haplotypes into two separate rows, converts major and minor alleles into 0 and 1 respectively and produces temporary files for each chromosome.

During the third step, HapMapJmputation downloads the latest fastPHASE version and runs the imputation using 8 cores in parallel. fastPHASE is run with ten random starts of the imputation algorithm. After imputation, HapMapJmputation reverses the 0 and 1 coding into the major and minor nucleotide, respectively. Subsequently, the two haplotypes are combined, and the separate chromosomes are merged into a single file.

This combination of techniques resulted in 4461 SNP markers after filtering using 9 F2 populations made up of a total 978 plants. The GWA was performed using GAPIT version 3 (Wang and Zhang, 2021) with a Mixed Linear Model (MLM). SNPs surpassing a LOD (-log (p- value)) value of 3 were considered to have a significant association with trait variation.

SNPs showing a significant association with susceptibility, with an LOD value greater than 3, were found on chromosome NC_044371 .1 , NC_044373.1 , NC_044375.1 with reference to the Cannabis Sativa CS10 genome, shown in Figure 2, defining three QTLs: QTL1 , QTL2, and QTL3, respectively. The homozygous allele of the SNPs in Table 2 that can distinguish botrytis resistance are listed along with their position and reference sequence. Included also are the possible alleles at this position and the corresponding average resistance to Botrytis based on a score from 1 -9, with a score of 1 being complete resistance. Interestingly, for each significant SNP identified as associated with Botrytis resistance, the heterozygous allele shows intermediate resistance compared to the homozygous alleles, indicating this may a semi-dominant trait.

Specifically, the inventors identified a QTL associated with Botrytis resistance on chromosome NC_044371 .1 - QTL1 at position 192011 11 -23661598 defined here by the identified SNPs in Table 2. The SNP most significantly associated with QTL1 is common_703, with an LOD score of 4.23. Based on these findings, if the presence of this or these SNPs associated with QTL are detected they can be used in marker assisted selection or marker assisted identification by someone skilled in the art improve the selection for Botrytis resistance by distinguishing plants with a propensity for Botrytis resistant form non-resistant plants and populations of cannabis (Table 2). In QTL1 , when SNP common_703 is Allele_3, homozygous GG, this would indicate a plant with a significantly increased average resistance to Botrytis as compared to plants with the alternative allele, Allele_1 , or that are heterozygous for this allele, Allele_2, with reference to Table 2.

A second locus associated with Botrytis resistance, QTL2, was identified on NC_044373.1 at position 79997 defined by SNP common_1691 . Common_1691 can be similarly used in marker assisted selection or marker assisted identification by someone skilled in the art selectively to distinguish Botrytis resistant plants or populations from non-resistant plants. In QTL2 when SNP common_1691 is Allele_3, homozygous AA, this would indicate a plant with a significantly increased average resistance to Botrytis as compared to plants with the alternative allele, Allele_1 , or that are heterozygous for this allele, Allele_1 , with reference to Table 2.

The inventors identified a third locus associated with Botrytis resistance on chromosome NC_044375.1 - QTL3 at position 86855033-90075725 defined here by the identified SNPs in Table 2. The SNP most significantly associated with QTL3 common_3407, with an LOD score of 5.44, can be used for marker assisted selection by someone skilled in the art to selectively distinguish Botrytis resistant from non-resistant plants and populations of cannabis, Table 2. In QTL3 when SNP common_3407 is Allele_3, homozygous AA, this would indicate a plant with a significantly increased average resistance to Botrytis as compared to plants with the alternative allele, Allele_2, or that are heterozygous for this allele, Allele_1 , with reference to Table 2. We find that the multiple QTLs identified for Botrytis resistance provide evidence that the genetic basis of this trait is polygenic. The polygenic nature of this trait underlies why breeding for resistance can be challenging.

The reference sequence for each of the SNPs identified is given in Table 3 with reference to the CS10 genome. In T able 4, PCR primers designed to amplify each of the regions containing these SNPs, with reference to the CS10 genome, are provided in order for the allelic variant to be determined. Table 2: SNPs associated with the Botrytis resistance in the F2 populations. The presence of the resistance to Botrytis is predicted by the occurrence of the indicative allele (marked with *). The positions and chromosome of the SNPs are provided with reference to the CS10 reference genome as described herein. The LOD score is provided as LOD. “Mean_1” denotes the average phenotypic value associated with Allele_1 based on scoring for resistance from 1 to 9, where 1 is the highest resistance, “Mean_2” denotes the average phenotypic value associated with Allele_2 based on scoring for resistance from 1 to 9, where 1 indicates the highest resistance and Mean_3 denotes the average phenotypic value associated with Allele_3 scoring for resistance from 1 to 9, where 1 indicates the highest resistance. Count_1 , Count_2, and Count_3 denote the number of plants that contributed to the average phenotypic value of Mean_1 , Mean_2, and Mean_3 respectively. The SNP position on the chromosome is provided with reference to the CS10 reference genome.

Table 3: Detailed information of each of the SNPs associated with botrytis resistance in Cannabis as provided in Table 2. The “ref” reference allele based on the CS10 genome and the identified “alt” alternative allele based on the SNP marker panel are given for each SNP. The “context sequence” is given with the SNP given in brackets. All of the sequences and alleles are provided with reference to the plus strand.

Table 4: Targeted sequencing primers (5’ to 3’) for the SNPs identified in Table 2, as described in Example 1 .

Used individually, the associated SNPs identified that define the three QTLs for Botrytis resistance can distinguish Botrytis resistant plants or populations of cannabis from less resistant plants or populations. The inventors propose that QTL1 , QTL2, and QTL3 also act in combination to contribute to Botrytis resistance. To test this the inventors evaluated the most significant markers from each of the three QTLs (common_703, common_1691 , common_340), in combination, in 978 genotyped plants from the populations used in the above GWA, for Botrytis resistance (Table 2). Similarly, a scale from 1 -9 was used, where 1 indicated highly resistant plants and 0 indicated plants that are not resistant. The inventors found that in plants where common_703 is AA, common_1691 is GG , and common_3407 is CC on average these plants were more susceptible to Botrytis than in any other allelic state tested (Table 5). This indicates that the QTLs identified contribute to Botrytis resistance in combination and that the markers are effective in combination to identify and predict plants that are susceptible to Botrytis. The use of these markers alone and in combination, conducted by someone skilled in the art, is effective at improving the selection of plants that are susceptible to Botrytis in order to purge them from the breeding population improving the likelihood that the breeding population will have improved resistance to Botrytis. By contrast plants are highly resistant where SNP common_703 is GG, common_1691 is AA , and common_3407 is AA (Table 5). Several combinations of allele state conferred broad resistance. Each QTL alone contributed to some Botrytis resistance; however, the average resistance was improved when more than one of the markers for each QTL were in allelic states are predicted to contribute to resistance. When all three markers representing allele states that were predicted to contribute to resistance, the average resistance was improved the most.

Table 5: Combinatorial effect of QTL 1 , 2, and 3 on Botrytis resistance. The three most significantly associated SNPs common_703, common_1691 , and common_3407 for each QTL1 , QTL2, and QTL3, respectively, are evaluated for their effect in combination on average to the resistance of plants with those alleles to Botrytis. Mean indicates the average resistance scored from 1-9 where 1 is most resistant. Count indicates the number of plants and frequency is the occurrence of that particular allele combination in all plants tested.

The inventors reasoned that the QTLs identified could contribute to broader resistance to pathogens beyond Botrytis. The F2 populations that were used in the experiment for Botrytis were also scored for general resistance at the same time they were scored for Botrytis resistance. Cannabis F2 plants deriving from different populations were harvested at maturity between October and November 2021 . Plants were phenotypically assessed for general resistance using a subjective scale from 1 to 9. In this scale, “1” indicates plants with low resistance while a “9” indicates plants with high resistance.

As above, the inventors evaluated the most significant markers, common_703, common_1691 , common_3407, from each of the three QTLs in the 978 plants that were genotyped previously for use in the GWA in combination for general resistance, Table 2. Similarly for Botrytis resistance we found that the QTLs identified contribute to general resistance as phenotyped. Table 6 shows results that indicate that the allele state of the SNPs that are predictive of Botrytis resistance are also predictive of general resistance and that in combination they show a greater affect. For example, plants with the allelic variants common_703 GG, common_1691 AA, common_3407 AA are resistant to Botrytis, this was also found to be true for general resistance (Table 6). Further supporting this, the allelic variants that are associated with loss of Botrytis resistance in these three QTLs are also indicative of a loss of general resistance.

Table 6: Combinatorial effect of QTL 1 , 2, and 3 on general resistance. The three most significantly associated SNPs common_703, common_1691 , and common_3407 for each QTL1 , QTL2, and QTL3, respectively, are evaluated for their effect in combination on average to the general resistance of plants with those alleles. Mean indicates the average general resistance scored from 1 -9 where 9 is most resistant. Count indicates the number of plants and frequency is the occurrence of that particular allele combination in all plants tested.

EXAMPLE 2

Gene Identification

There are presently no known genes identified in Cannabis that have been shown to regulate Botrytis resistance or general resistance in Cannabis. The genetic regulation of disease resistance has been described and characterized in several plant species, however the multitude of different genes involved in this process does not easily allow the identification of disease resistance genes in Cannabis. The inventors considered genes that may influence disease resistance related to basal resistance, signal transduction, genes that may be related to reactive oxygen species, those that have homologs in other organisms that contribute to disease resistance. They next sought to identify putative genes that could encode proteins that may be responsible for changes in Botrytis or general resistance based on SNPs present that could alter that protein’s function or expression. Using the findings of the association studies they identified candidate genes at the QTL identified.

The inventors determined that SNP differences between cannabis genomes could inform which genes play a role in the trait of interest. Short reads from sequenced lines were dereplicated with NGSReadsTreatment (version 1.3, Gaia et aL, 2019) and pre-processed with fastp (version 0.23.2, Chen et aL, 2018). Reads were aligned to the CS10 reference genome with Bowtie2 (version 2.3.5.1 , with options -rg and — rg-id to add read-group identifiers, Langmead and Salzberg, 2012). Only unique alignments with a mapping quality of at least 10 were kept. SNPs were called with freebayes and filtered for a minimal quality of 20 (version v1 ,3.2-40-gcce27fc, parameters -p 2 -min-coverage 20 -g 30000 -min-alternate-count 4 -min-alternate-fraction 0.1 -min-mapping-quality 10 -max-complex-gap -1 , Garrison and Marth, 2012). SNPs were finally filtered for a coverage between 5 and 10,000 within each line and annotated with Ensembl Variant Effect Predictor (version 106.1 , McLaren et aL, 2016).

For each line, the inventors constructed a pseudogenome by incorporating its variants into the CS10 reference genome with vcf-consensus (Danecek et aL, 201 1 ). CS10 annotation was lifted over, to align genes from a reference genome to a target genome, with liftoff (version 1 .6.3, Shumate and Salzberg, 2021 ). Protein and cDNA sequences were extracted with custom scripts. Proteins and cDNA sequences for a given protein/transcript from all lines were aligned with muscle (v3.8.31 , Edgar, 2004).

Proteins from each of the three QTLs, QTL1 : NC_044371.1 being located between 20201 11 1 - 24661598 bp, QTL2: NC_044373.1 being located between 70000-90000 bp, QTL3: NC_044375.1 being located between 85855033-91075725 bp were extracted.

Multiple alignments from protein sequences were converted to tables including the variant positions and protein variants were tested for correlation with the significant SNPs from the GW AS marker panel. Only proteins with significant associations were kept. These proteins were then used to extract all SNPs with the associated genes. SNPs were also tested for association with the significant SNPs from the GWAS marker panel. Only significant SNPs with an effect on the coding sequence were kept and associated proteins with homologs in Arabidopsis were used as candidates (Table 7).

The polymorphisms at the given position in the gene candidates in Table 7 in each of the three QTL, QTL1 -3, are considered to represent a haplotype that contributes to variance in Botrytis resistance. Given the complexity of fungal resistance response in plants, multiple contributors to resistance is not unusual and contributes to the difficulty in identifying specific genetic determinants of resistance. The inventors inspected the gene candidates for effect of the polymorphism likelihood to impact protein function, the predicted function of the respective encoded protein, and homology to known proteins in other plants that have been implicated in fungal resistance.

The investigators determined that at QTL1 the Protein kinase domain-containing protein (PKD) (NCBI accession: XP 030492442.1 ) contributes to Botrytis resistance. A SNP at position 23309645 on chromosome NC_044371.1 leads to a frameshift mutation disrupting the function of the resultant protein. The reference sequence for this SNP can be found in Table 8. The loss of function of this protein kinase domain containing protein will result in the variance in Botrytis by as yet discovered mechanism. Disruption in the perception of fungal elicitors of the plant immune response is likely. Alternatively, Botrytis may subvert host defence through the activation of this receptor pathway. Its disruption may nullify this.

The investigators also determined that at QTL3, PTI6-like protein (NCBI accession: XP 030505675.1 ) contributes to Botrytis resistance. A SNP at position 86872263 (C/A) on chromosome NC_044375.1 leads to a missense variant Proline (P) 11 1 to Threonine (T) 1 11 , with respect to CS10. PTI6-like is 51% identical to the PTI6 in Solanum lycopersicum. The reference sequence for this SNP can be found in Table 8. The proline at amino acid position 1 10 is conserved in Solanum as well as in many other plants species with PTI6 homologs, indicating it may be critical to the protein’s function. A SNP that leads to the loss of proline at this position or a change to proline likely impacts the secondary structure of the protein. Interestingly P1 10 is situated close to the proposed DNA binding domain of this protein, suggestion that a change in structure could impact its DNA binding activity. PTI6 in Solanum has been characterized as being involved in the basal plant defence response. Overexpression of PTI6 in Arabidopsis and Solanum can improve plant defence responses to bacterial challenge. PTI6-like is a AP2/ERF family transcription factor, this class of protein has been shown to bind to the GCC-box pathogenesis-related promoter.

Table 7: A list of genes derived from the three identified QTLs on NC_044371 .1 , NC_044373.1 , and NC_044375.1 . The chromosome, position of the SNP variant, the false discovery rate (FDR) for the SNP variant, the type of variant, the predicted Severity of the Variant on the protein, the protein ID and its description with reference to the CS10 reference genome is given.

Table 8: Detailed information of the two polymorphisms determined to contribute to Botrytis resistance in QTL1 and QTL3 (possible causative polymorphisms). The chromosome and position are given based on the CS10 reference genome. The “context sequence” is given with the polymorphism provided in square brackets. The first mentioned sequence in square brackets is the reference allele based on the CS10 genome and the identified alternative allele based on the polymorphism detected is the second mentioned sequence in square brackets. All of the sequences and alleles are provided with reference to the plus strand.

EXAMPLE 3

Validation of QTL in selection for Botrytis resistance

The inventors conceived to evaluate the identified QTL and SNP markers in predicting plants with resistance to Botrytis for marker-based selection through the use of a training population of 234 diverse cannabis genotypes that included high THC varieties, low THC/high CBD varieties, and assorted hemp plants. The training population was grown and harvested in an open field in 2022. The inventors determined the genotypes and phenotypes of these plants.

To determine genotype, DNA was extracted from about 70 mg of leaf discs from all the plants evaluated using an adapted kit with “sbeadex” magnetic beads by LGC Genomics, which was automated on a KingFisher Flex with 96 Deep-Well Head by Thermo Fisher Scientific.

The extracted DNA served as a template for the subsequent library preparation for sequencing. The library pools were prepared according to the manufacturer’s instructions (AgriSeq™ HTS Library Kit — 96 sample procedure from Thermo Fisher Scientific). Targeted sequencing of a custom SNP marker panel was employed. The inventors designed the marker panel based on a pangenome comprising 13 genotypes in order to include SNP markers common to a broad diversity of HRT and hemp cannabis, including the Cannabis Sativa CS10 reference genome. Targeted sequencing was carried out on the Ion Torrent system by Thermo Fisher Scientific. The primers for the SNP identified are provided in Table 11 . The library pool was loaded onto Ion 550 chips with Ion Chef and sequenced with Ion GeneStudio S5 Plus according to the manufacturer’s instructions (Ion 550™ Kit from Thermo Fisher Scientific). To determine phenotype, at the time of harvest whole plants including flower, leaf, and stem were scored for susceptibility to Botrytis cinerea. This trait has been quantified using a subjective scale from 0 to 5, where a score of 0 indicates plants with high resistance to Botrytis cinerea, while a score of 5 indicates plants with high susceptibility to Botrytis i.e., showing the presence of Botrytis disease symptoms and Botrytis conidiophore. Fungal pathogens that mimic the disease symptoms of Botrytis cinerea cannot be ruled out, however visual identification of Botrytis is commonplace.

The Botrytis resistance scores from 0 - 5, where 0 indicates most resistant plants, were used as an input for GWAS.

For consistency with the original model, the training population data underwent a round of imputation for reducing the number of missing values. In order to reduce missing data in the genotype file, an imputation has been performed using the HapMapJmputation software (GitHub -mwylerCH/HapMap Imputation). Briefly, the genotype file is converted to a hapmap format (comma separated, integral

In a first step, HapMapJmputation counts the occurrence of each nucleotide at every single genotyped position. The most common nucleotide is defined as major allele, the second is defined as minor allele. Missing genotyping information is excluded. In the case major and minor alleles occur at the same number, the nucleotide of the reference cs10 genome is chosen as major allele. Subsequently, HapMapJmputation sorts markers by position and parses the hapmap into the required fastPHASE (Scheet and Stephens, 2006) input format. Briefly, HapMapJmputation splits the haplotypes into two separate rows, converts major and minor alleles into 0 and 1 respectively and produces temporary files for each chromosome.

During the third step, HapMapJmputation downloads the latest fastPHASE version and runs the imputation using 8 cores in parallel. fastPHASE is run with ten random starts of the imputation algorithm. After imputation, HapMapJmputation reverses the 0 and 1 coding into the major and minor nucleotide, respectively. Subsequently, the two haplotypes are combined, and the separate chromosomes are merged into a single file.

This combination of techniques resulted in 4461 SNP markers after filtering using 234 plants. The GWA was performed using GAPIT version 3 (Wang and Zhang, 2021 ) with a Mixed Linear Model (MLM). SNPs surpassing a LOD (-log (p-value)) value of 5 were considered to have a significant association with trait variation.

The GWA identified a single marker with a LOD score greater than 5 in the originally identified QTL1 , “common_701 ” at position 18901130 on NC_044371 with respect to the CS10 genome (Table 9, Figure 3). A reference or context sequence for the SNP “common_701” is provided in Table 10, while exemplary primers that can be used to detect the SNP are provided in Table 1 1. The identification of QTL1 by GWA in the training population is strong validation of this QTL and the use of markers based on this QTL to predict for Botrytis resistance. The GWA was not expected to pick up all originally identified QTLs as the number of plants in the training population were few and the plants used did not belong to a structured population as in the original experiment in Example 1. Additionally, the environmental and pathogen prevalence pressures in the 2022 field season were not identical to 2021 , due to variables outside the inventors’ control.

From the results of the GWA, the inventors validated the finding that QTL1 and the specific markers identified, in Example 1 , Table 2, and including the SNP marker “common_701”, in the GWA are instructive in enabling someone skilled in the art to perform marker assisted selection and marker-based identification to select for plants resistant to Botrytis.

This is supported clearly when evaluating the allelic effect of the markers “common_703” (QTL1), “common_1691” (QTL2), and “common_3407” (QTL3) on average Botrytis resistance in the training population using a combination of these markers (Table 12). For reference, Table 12, evaluates the allele states of these three markers and identifies plants and the average allele effect, in combination, on Botrytis resistance. Marker “common_703” in allele state GG is clearly associated with resistance to Botrytis, as demonstrated in Example 1 . Furthermore, regardless of the allele states of the other two markers it represents an excellent marker for the selection of resistant plants. Notably when selecting based plants based on all three markers in combination, where “common_703” is GG, “common_1691” is AA and “common_3407” is AA, the mean phenotype for Botryits resistance of the plants with this combination of allele states is 0, indicating strong resistance to Botrytis. The combinatorial data also indicates that “common_1691” (QTL2) and “common_3407” (QTL3) are highly predictive for Botrytis resistance in the field trial in 2022. It can also be observed that when the alternative allele state of “common_703” is AA, “common_1691” is GG and “common_3407” is CC is determined, on average these plants are susceptible to Botrytis, this includes heterozygous states of the allele as shown in Table 12.

Table 9: GWA results of a SNP, “common_701” in QTL1 at position 18901130 on chromosome NC_044371 with reference to the CS10 genome, associated with Botrytis resistance in the training population. The presence of the resistance to Botrytis is predicted by the occurrence of the indicative allele (marked with *). The LOD score for the mixed linear model is provided as LOD. “Mean_1” denotes the average phenotypic value associated with Allele_1 based on scoring for resistance, “Mean_2” denotes the average phenotypic value associated with Allele_2 based on scoring for resistance and Mean_3 denotes the average phenotypic value associated with Allele_3 based on scoring for resistance. Count_1 , Count_2, and Count_3 denote the number of plants that contributed to the average phenotypic value of Mean_1 , Mean_2, and Mean_3 respectively.

Table 10: Detailed information of the SNP associated with botrytis resistance in Cannabis as provided in Table 9. The “ref’ reference allele based on the CS10 genome and the identified “alt” alternative allele based on the SNP marker panel are given for each SNP. The “context sequence” is given with the SNP given in brackets. The sequence and alleles are provided with reference to the plus strand.

Table 1 1 : Targeted sequencing primers (5’ to 3’) for the SNP identified in Table 9, as described in Example 3.

Table 12. Combinatorial allele states predictive of Botrytis resistance in the training population. The three most significantly associated SNPs, common_703, common_1691 , and common_3407 for each QTL1 , QTL2, and QTL3, respectively, are evaluated for their effect in combination on average to the Botrytis resistance of plants with those alleles based on scoring for resistance on a scale of 0 - 5, where 0 indicates most resistant plants. The allele state that is predictive of Botryits resistance is shown in parenthesis in the first row. The mean phenotype for Botrytis resistance in specific plants with a combination of the three markers specific allele state is shown. The number of plants with the combination of these markers is given as plant count and the frequency of their appearance in the training population studied is noted.

The inventors next sought to demonstrate the use of a genomic selection model in prediction of the pathogen resistance trait and tested this on the training population of 234 diverse cannabis plants described above. The approach tested QTL1 , where 22 markers were selected from the inventors’ custom marker panel, described in the previous examples, at QTL1. The marker and the corresponding reference sequence are listed in Table 13.

The model based on the 22 markers listed in Table 13 below tests if the markers together improve the prediction power for the selection of Botrytis resistant plants compared to 22 randomly selected markers. The inventors performed a multiple regression analysis with the allele as variable and Botrytis resistance as target using the random forest algorithm implemented in the ranger package (v 0.12.1 , Wright and Zieger, 2017). The resulting R squares are derived from the comparison of the predictions from the developed model with the measured phenotype of the training population (Figure 4). 100 permutations for the 22 specific markers and 100 permutations of the 22 random markers were conducted, each dot in Figure 4 represents one of those permutations. The results of the genomic selection model demonstrate that the 22 specific markers in this region greatly improve the accuracy of selecting for Botrytis resistance for QTL1 , R-squared of ~0.3 over the use of random markers, R-squared of -0.15. Through the modeling approach, the inventors demonstrate that the selected markers for QTL1 significantly enhance the identification of plants resistant to botrytis. In fact, these markers explain twice the resistance variability compared to randomly selected markers, enabling a targeted selection of plants resistant to this disease. Table 13: Detailed information of the 22 additional SNPs used in the genomic selection model for selecting Botrytis resistance for QTL1 in Cannabis. The positions of the SNPs on chromosome NC_044371.1 are provided with reference to the CS10 reference genome as described herein The “context sequence” is provided with the SNP given in square brackets. The first mentioned allele in square brackets is the reference allele based on the CS10 genome and the identified alternative allele based on the polymorphism detected is the second mentioned sequence in square brackets. All of the sequences and alleles are provided with reference to the plus strand.