Isolated DNA comprising one or more genes specific for 5S clavam biosynthesis, vectors comprising such DNA and streptomyces hosts capable of improved clavulanic acid production
Application no:09350702 -
Filed date:1999-07-09 -
Novel bacterial genes, microorganisms and processes for improving the manufacture of 5R clavams, eg. clavulanic acid.
No:435 - Chemistry: molecular biology and microbiology / No:252.3 - Transformants (e.g., recombinant DNA or vector or foreign or exogenous gene containing, fused bacteria, etc.)
No:435 - Chemistry: molecular biology and microbiology / No:320.1 - VECTOR, PER SE (E.G., PLASMID, HYBRID PLASMID, COSMID, VIRAL VECTOR, BACTERIOPHAGE VECTOR, ETC.)
What is claimed is:
1. An isolated DNA comprising one or more genes essential for clavam-2carboxylate (C2C), 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl)clavam (AC) biosyithesis in S. clavuligerus, wherein said one or more genes is not essential for clavulanic acid biosynthesis; wherein said one or more genes include an open reading frame selected from the group consisting of: SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7.
2. The DNA according to claim 1 comprising the sequence of SEQ ID NO:1.
3. The DNA according to claim 1 comprising the sequence of SEQ ID NO:4.
4. An isolated DNA which hybridizes with the DNA of claim 1, wherein said hybridization is performed overnight at 60.degree. C. in a solution consisting of 5.times.SSC, 5.times. Denhardt's solution and 0.5% SDS (1.times.SSC: 0.15M NaCl+0.015M Na.sub.3 citrate; 1.times. Denhardt's solution: 0.02% BSA, 0.02% Ficoll and 0.02%PVP), and washed at 680.degree. C. for 30 minutes in a solution of 0.5% SSC+0.1% SDS; wherein said isolated DNA comprises one or more genes essential for clavam-2-carboxylate (C2C), 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl)clavam (AC) clavam biosynthesis; and wherein said genes are not essential for clavulanic acid biosynthesis.
5. A vector comprising the DNA of claim 1.
6. A Streptomyces host containing the vector of claim 5.
7. A Streptomyces host according to claim 6 which is S. clavuligerus.
8. A vector comprising the DNA of claim 4.
9. A Streptomyces host comprising the vector of claim 8.
10. A Streptomyces host according to claim 9 which is S. clavuligerus.
11. An isolated DNA comprising at least one open reading frame selected from the group consisting of: SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6 and SEQ ID NO:7 wherein at least one said open reading frame is interfered with such that when at least one corresponding endogenous open reading frame in Streptomyces clavuligerus genome is replaced by at least one said interfered with open reading frame the levels of clavam-2-carboxylate (C2C), 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl)clavam (AC) produced by said Streptomyces clavuligerus are reduced or eliminated.
12. A vector comprising the isolated DNA of claim 11.
13. An Streptomyces clavuligerus comprising the isolated DNA of claim 11.
14. Streptomyces clavuligerus having reduced or eliminated clavam-2-carboxylate (C2C), 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl)clavam (AC) biosynthesis wherein at least one open reading frame essential for clavam-2-carboxylate (C2C). 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl)clavam (AC) biosynthesis, but not essential for clavulanic acid biosynthesis, selected from the group consisting of: orfup1 (SEQ ID NO:4), orfdwn 1 (SEQ ID NO:5), orfdwn2 (SEQ ID NO:6) and orfdwn3 (SEQ ID NO:7) is interfered with or eliminated by a genetic technique selected from the group consisting of: gene disruption, random mutagenesis, and site-directed mutagenesis, and wherein clavam-2-carboxylate (C2C), 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl clavam (AC) biosynthesis in said Streptomyces clavuligerus is reduced or eliminated and clavulanic acid production is maintained or increased relative to clavulanic acid production from S. clavuligerus strain ATCC 27064.
15. The S. clavuligerus of claim 14, wherein the open reading frame is orfup1 (SEQ ID NO:4).
16. The S. clavuligerus of claim 14, wherein the open reading frame is orfdwn1 (SEQ ID NO:5).
17. The S. clavuligerus of claim 14, wherein the open reading frame is orfdwn2 (SEQ ID NO:6).
18. The S. clavuligerus of claim 14, wherein the open reading frame is orfdwn3 (SEQ ID NO:7).
19. The S. clavuligerus of claim 14, wherein the biosynthesis of clavam-2-carboxylate (C2C) is reduced or eliminated.
20. The S. clavuligerus of claim 14, wherein the biosynthesis of 2-hydroxymethylclavam (2HMC) is reduced or eliminated.
21. The S. clavuligerus of claim 14, wherein the biosynthesis of 2-(3-alanyl)clavam (AC) is reduced or eliminated.
22. Streptomyces clavuligerus wherein a portion of at least one open reading from selected from the group consisting of: orfup1 (SEQ ID NO:4), orfdwn 1 (SEQ ID NO:5), orfdwn2 (SEQ ID NO:6) and orfdwn3 (SEQ ID NO:7) is deleted from the genome of said Streptomyces clavuligerus and wherein clavam-2-carboxylate (C2C), 2-hydroxymethylclavam (2HMC), or 2-(3-alanyl)clavam (AC) biosynthesis by said Streptomyces clavuligerus is reduced or eliminated and clavulanic acid production by said Streptomyces clavuligerus is maintained or increased relative to clavulanic acid production from S. clavuligerus strain ATCC 27064.
23. The Streptomyces clavuligerus of claim 22 wherein the portion of said at least one open reading frame that is deleted from said genome comprises at least one entire open reading frame.
FIELD OF THE INVENTION
The present invention relates to novel bacterial genes and processes for improving the manufacture of clavams e.g. clavulanic acid. The present invention also provides novel organisms capable of producing increased amounts of clavulanic acid.
BACKGROUND OF THE INVENTION
Microorganisms, in particular Streptomyces sp. produce a number of antibiotics including clavulanic acid and other clavams, cephalosporins, polyketides, cephamycins, tumicamycin, holomycin and penicillins. There is considerable interest in being able to manipulate the absolute and relative amounts of these antibiotics produced by the microorganism and accordingly there have been a large number of studies investigating the metabolic and genetic mechanisms of the biosynthetic pathways (Domain, A. L. (1990) "Biosynthesis and regulation of beta-lactam antibiotics." In: 50 years of Penicillin applications, history and trends). Many of the enzymes which carry out the various steps in the metabolic pathways and the genes which code for these enzymes are known.
Clavams can be arbitrarily divided into two groups dependent on their ring stereochemistry (5S and 5R clavams). The biochemical pathways for the biosynthesis of 5R and 5S clavams have not yet been fully elucidated but it has been suggested that they are derived from the same starter units (an as yet unidentified 3 carbon compound (Townsend, C.A. and Ho, M. F. (1985) J. Am. Chem. Soc. 107 (4), 1066-1068 and Elson, S. W. and Oliver, R. S. (1978) J. Antibiotics XXXI No. 6. 568) and arginine (Velentine, B. P. et al (1993) J. Am Chem. Soc. 15, 1210-1211) and share some common imtermediates ([Iwata-Reuyl, D. and C. A. Townsend (1992) J. Am. Chem. Soc. 114: 2762-63, and lane, J. W. et al (1993) Bioorg. Med Chem. lett 3:2313-16).
Examples of 5S clavams include clavarn-2-carboxylate (C2C), 2-hydroxymethylclavam (2HMC), 2(3-alanyl)clavam, valclavam and clavaminic acid (GB 1585661, Rohl, F. et al. Arch. Miorbiol. 147:315-320, U.S. Pat. No. 4,202,819). There are, however, few examples of 5R clavams and by far the most well known is the beta lactamase inhibitor clavulanic acid which is produced by the fermentation of Streptomyces clavligerus. Clavulanic acid, in the form of potassium clavulanate is combined with the beta-lactam amoxycillin in the antibiotic AUGMENTIN (Trade Mark SmithKline Beecham). Because of this commercial interest, investigations into the understanding of clavam biosynthesis have concentrated on the biosynthesis of the 5R clavam, clavulanic acid, by S.clavuligerus. A number of enzymes and their genes associated with the biosynthesis of clavulanic acid have been identified and published, Examples of such publications include Hodgson, J. E. et al., Gene 166, 49-55 (1995), Aidop, K. A. et al., Gene 147, 41-46 (1994), Patadkar, A. S. et al., J. Bact 177(5), 1307-14 (1995). In contrast nothing is known about the biosynthesis and genetics of 5S clavams other than clavarninic acid which is a clavulanic acid precursor produced by the action of clavaminic acid synthase in the clavulanic acid biosynthetic pathway in S. clavuligerus.
Gene cloning experiments have identified that S. clavuligerus contains two clavaminic acid synthase isoenzymes, cas1 and cas2 (Marsh E. N. et al Biochemistry 31, 12648-57, (1992)) both of which can contribute to clavulanic acid production under certain nutritional conditions (Paradkar, A. S. et al., J. Bact. 177(5), 1307-14 (1995)). Clavaminic acid synthase activity has also been detected in other clavulanic acid producing micro-organisms, ie. S. jumonjinensis (Vidal, C. M., ES 550549, (1987)) and S. katsurahamancus (Kitano, K. et al., JP 53-104796, (1978)) as well as S. antibiticos, a producer of the 5S clavam, valclavam (Baldwin, J. E. et al, Tetrahedron Letts. 35(17), 2783-86, (1994)). The later paper also reported S. antibioticos to have proclavarnic acid amidino hydrolase activity, another enzyme known to be involved in clavulanic acid biosynthesis. All other genes identified in S. clavuligerus as involved in clavulanic biosynthesis have been reported to be required for clavulanic acid biosynthesis (Hodgson, J. E. et al., Gene 166, 49-55 (1995), Aidoo, K. A. et al., Gene 147, 41-46 (1994)) and as yet none have been reported which arc specific for the biosynthesis of 5S clavams.
BRIEF SUMMARY OF THE INVENTION
The present invention relates to novel bacterial genes, processes for improving the manufacture of clavams, and novel organisms capable of producing increased amounts of clavulanic acid.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 shows the nucleotide sequence of the S. Clavuligerus chromosome including the flanking cas1 (SEQ ID NO. 1).
DETAILED DESCRIPTION OF THE INVENTION
We have now identified certain genes which are specific for the biosynthesis of 5S clavams as exemplified by C2C and 2HMC in S. clavuligerus. Accordingly the present invention provides DNA comprising one or more genes which are specific for 5S clavam biosynthesis in S. clavuligerus and which are not essential for 5R clavam (e.g. clavulanic acid) biosynthesis.
By "gene" as used herein we also include any regulatory region required for gene function or expression. In a preferred sect the DNA is as identified as FIG. 1 (SEQ ID NO: 1). Preferably the DNA comprises the nucleotide sequences indicated in FIG. 1 (SEQ ID NO: 1) designated as orfup3 (SEQ ID NO: 2), orfup2 (SEQ ID NO: 3), orfup1 (SEQ ID NO: 4), orfdwn1 (SEQ ID NO: 5), orfdwn2 (SEQ ID NO: 6) and orfdwn3 (SEQ ID NO: 7). The present invention also provides proteins coded by said DNA. The present invention also provides vectors comprising the DNA of the invention and hosts containing such vectors.
Surprisingly we found that when at least one of the genes according to the invention is defective the amount of clavulanic acid produced by the organism is increased. Accordingly the present invention also provides processes for increasing the amount of clavulanic acid produced by a suitable microorganism. In one aspect of the invention the genes identified can be manipulated to produce an organism capable of producing increased amounts of clavam, suitably clavulanic acid. The findings of the present work also allow an improved process for the identification of organisms with higher clavulanic acid production comprising a preliminary screening for organisms with low or no 5S clavam production (for example by hplc and/or clavam bioassay as described in the examples herein).
Suitably the 5S clavam genes of the present invention can be obtained by conventional cloning methods (such as PCR) based on the sequences provided herein. The function of the gene can be interfered with or eliminated/deleted by genetic techniques such as gene disruption [Aidoo, K. A. et al., (.1994), Gene, 147, 41-46]., random mutagenesis, site directed mutagenesis and antisense RNA
In a further aspect of the invention there are provided plasmids containing one or more defective genes, preferably the plasmids pCEC060, pCEC061, pCEC056 and pCEC057. described below.
Suitably, the plasmids of the invention are used to transform an organism such as S. clavuligerus, e.g. strain ATCC 27064 (which corresponds to S. clavuligerus NRRL 3585). Suitable transformation methods can be found in relevant sources including Sambrook, J., Fritsch, E. F. and Maniatis, T. (1989), Molecular cloning: a laboratory manual, 2nd Ed., ColdSpring Harbor Laboratory, Cold Spring Harbor, N.Y.; Hopwood, D. A. et al. (1985), Genetic Manipulation of Streptomyces. A Cloning Manual, and Paradkar, A. S. and Jensen, S. E. (1995), J. Bacteriol. 177 (5): 1307-1314.
Strains of the species S. clavuligerus are used industrially to produce clavulanic acid (potassium clavulanate). Within the British and United States Pharmacopoeias for potassium clavulanate (British Pharmacopoeia 1993, Addendum 1994; p1362-3 and U.S. Pharmacopeia Official Monographs 1995, USP 23 NF18 p384-5) the amounts of the toxic 5S clavam, clavam-2-carboxylate, are specifically controlled.
Therefore in a further aspect of the invention there is provided an organism capable of producing high amounts of clavulanic acid but has been made unable to make C2C or capable of producing high amounts of clavulanic acid but able to make only low levels of C2C. Suitably the clavulanic acid producing organism contains one or more defective clavam genes, and is preferably the S. clavuligerus strain 56-1A, 56-3A, 57-2B, 57-1C, 60-1A, 60-2A, 60-3A, 61-1A, 61-2A, 61-3A, and 61-4A, described below. Such organisms are suitable for the production of clavulanic acid without the production of the 5S clavam, clavam-2-carboxylate or with significantly reduced production of clavam-2-carboxylate.
In the examples all methods are as in Sambrook, J., Fritsch, E. F. and Maniatis, T. (1989) Molecular Cloning A Laboratory Manual (2nd Edition), or Hopwood, D. A. et al. (1985) Genetic Manipulation of Streptomyces. A Cloning Manual, and Paradkar, A. S. and Jensen, S. E. (1995) J. Bacteriol. 177 (5): 1307-1314 unless otherwise stated.
I. DNA Sequencing of the Streptomyces clavuligerus Chromosome Upstream and Downstream of the Clavaminate Synthase Gene cas1
A. Isolation of cas1
To isolate chromosomal DNA fragments from Streptomyces clavuligerus NRRL 3585 encoding the gene for clavaminate synthase isozyme 1 (cas1) an oligonucleotide probe RMO1was synthesised based on nucleotide 9-44 of the previously sequenced cas1 gene (Marsh, E. N., Chang, M. D. T. and Towned, C. A. (1992) Biochemnistry 31: 12648-12657). Oligonucleotides were constructed using standard methods on an Applied Biosystems 391 DNA Synthesiser. The sequence of RMO1, a 36-mer, was synthesised in the antiparallel sense to that published by Marsh et al (1992, ibid) RMO1 was radiolabelled with .sup.32 P using standard techniques for end-labelling DNA oligonucleotides (Sambrook et al., 1989 ibid), and was used to screen a cosmid bank of Streptomyces clavuligerus genomic DNA by Southern hybridization as described by Stahl and Amann (In: Nucleic acid techniques in bacterial systematics. Ed. E. Stackebrandt and M. Goodfellow. Toronto: John Wiley and Sons, p. 205-248, 1991). The genomic bank of S. clavuligerus DNA, prepared in cosmid pLAFR3, was as described by Doran, J. L et al., (1990), J. Bacteriol. 172 (9),4909-4918.
Colony blots of the S. clavuligerus cosmid bank were incubated overnight with radiolabelled RMO1 at 60.degree. C. in a solution consisting of 5.times.SSC, 5.times. Denhardt's solution, and 0.5% SDS (1.times.SSC 0.15 M NaCl+0.015 M Na.sub.3 citrate; 1.times. Denhardt's solution: 0.2% BSA, 0.02% Ficoll, and 0.02% PVP). The blots were then washed at 68.degree. C for 30 minutes in a solution of 0.5.times.SSC+0.1% SDS. One cosmid clone, .10D7, was isolated that hybridised strongly to RMO1 and gave hybridization signals upon digestion with restriction endonucleases SacI and EcoRI that were consistent with hybridization signals detected in similar experiments with digests of S. clavuligerus genomic DNA.
B. DNA sequencing of the S. clavuligerus Chromosome Flanking cas1
A partial restriction map of cosmid 10D7 was generated using restriction endonucleases SacI, NcoI, and KpnI. Southern hybridization experiments between RMO1and various digests of 10D7 DNA indicated that cas1 was most likely located at one end of a 7-kb SacI-SacI DNA subfragment. This fragment consisted of the casl open reading frame and approximately 6 kb of upstream DNA. The 7-kb fragment was then subcloned from a SacI digest of 10D7 in the phagemid vector pBluescriptII SK+ (2.96 kb; Stratagene), thus generating the recombinant plasmid pCEC007.
To facilitate the process of sequencing the chromosome upstream of cas1, a 3-kb NcoI--NcoI subfragment of the 7-kb SacI-SacI fragment was subcloned in pUC120 (3.2 kb; Vieirra and Messing, Methods Enzymoi. 153, 3-11, 1987)) in both orientations, generating the recombinant plasmids pCEC026 and pCEC027. The 3-kb subfragment consisted of the amino-terminal-encoding portion of cas1 and approximately 2.6 kb of upstream DNA.
Nested, overlapping deletions were created in both pCEC026 and pCEC027 using exonuclease III and S1 nuclease digestion (Sambrook et al., 1989 ibid) and the DNA sequence of the 3-kb NcoI--NcoI fragment was determined on both strands by the dideoxy chain termination method (Sanger, F., Nicklen, S. and Coulson, A. R. (1977), Proc. Natl. Acad. Sci. U.S.A. 74: 5463-5467) using a Taq dye-deoxy.sup.a terminator kit and an Applied Biosystems 373A Sequencer.
To determine the DNA sequence of the chromosome immediately downstream of casl a 4.3-kb KpnI-EcoRI DNA fragment was subcloned from cosmid clone 10D7 in pBluescriptII SK+, generating pCEC018. From pCEC018 a 3.7-kb SacI--SacI subfragment was cloned in pSL1180 (3.422 kb, Pharnacia); one of the SacI termini of this fragment partially overlapped the TGA stop codon of cas1, the other was vector encoded. Both orientations of the 3.7-kb fragment were obtained during subcloning and the resulting recombinant plasmids were designated pCEC023 and pCEC024. Nested, overlapping deletions were created in both plasmids and the DNA sequence of the 3.7-kb fragment was determined on both strands. The nucleotide sequence of the S. clavuligerus chromosome generated in these experiments, including and flanking cas1 sequence is shown in FIG. 1.
II. Functional Analysis of the Open Reading Frames Flanking cas1
Computer analysis of the DNA sequence upstream of casl predicted the presence of two complete orfs and one incomplete orf. All three orfs were located on the opposite DNA strand to casl and were thus oriented in the opposite direction. The first open reading frame, orfup1, was located 579 bp upstream of cas1 and encoded a polypeptide of 344 amino acids (aa). The second open reading frame, orfup2, was located at 437 bp beyond the 3'-end of orfup1 and encoded a 151 aa polypeptide. Beyond orfup2 is orfup3. The start codon of orfup3 overlaps the translational stop codon of orfup2, suggesting that the two orfs are translationally coupled. No translational stop codon for orfup3 as located on the 3-kb NcoI--NcoI fragment.
A similar analysis of the DNA sequence downstream of cas1 predicted the presence of two complete orfs and one incomplete orf. Two of the orfs were located on the opposite DNA strand to cas1 and were thus oriented towards casl. The third orf was located on the same strand as cas1 and was thus oriented away from it. The first downstream open reading frame, orfdwn1, was located 373 bp downstream of cas1 and encoded a 328 aa polypeptide. The second open reading frame, orfdwn2, was located 55 bp upstream of orfdwn1 and encoded a 394 aa polypeptide. At 315 bp upstream of orfdwn2 and on the opposite strand was orfdwn3. Because no stop codon was observed for orfdwn3 on the 3.7-kb fragment, it encoded an incomplete polypeptide of 219 aa.
Gene Disruption of the orfup and orfdwn Open Reading Frames
To assess the possible roles of the open reading frames flanking cas1 in the biosynthesis of clavulanic acid and the other clavams produced by S. clavuligerus, insertional inactivation or deletion mutants were created by gene replacement The method used for gene disruption and replacement was essentially as described by Paradkar and Jensen (1995 ibid).
1.5-kb NcoI--NcoI fragment carrying the apramycin resistance gene (apr.sup.r), constructed as described in Paradkar and Jensen (1995 ibid), was treated with Klenow fragment to generate blunted termini (Sambrook et al., 1989 ibid) and was ligated to pCEC026 that had been digested with BsaBI and likewise treated with Klenow fragment. pCEC026 possesses a BsaBI site located within orfup1 at 636 bp from the translational start codon. The ligation mixture was used to transform competent cells of E. coli GM 2163 (available from New England Biolabs, USA., Marinus, M. G. et al M G G (1983) vol 122, p288-9) to apramycin resistance. From the resulting transformants two clones containing plasmids pCEC054 and pCEC055 were isolated; by restriction analysis pCEC054 was found to possess the apr.sup.r -fragment inserted in the same orientation as orfup1, while pCEC055 possessed it in the opposite orientation.
To introduce pCEC054 into S. clavuligerus, plasmid DNA was digested with BamHI and HindIII and ligated to the high-copy number Streptomyces vector pU486 (6.2 kb; Ward et al., (1986) Mol. Gen. Genet. 203: 468-478). The ligation mixture was then used to transform E. coli GM2163 competent cells to apramycin resistance. From the resulting transformants one clone, possessing the shuttle plasmid pCEC061, was isolated. This plasmid was then used to transform S. clavuligerus NRRL 3585. The resulting transformants were put through two successive rounds of sporulation on non-selective media and then replica plated to antibiotic containing media to identify apramycin-resistant and thiostrepton-sensitive transformants. From this process four putative mutants (61-1A, -2A, -3A and 4A) were chosen for further analysis.
To confirm that these putative mutants were disrupted in orfup1 genomic DNA was prepared from isolates 61-1A and 61-2A, digested with SacI and subjected to Southern blot analysis. The results of the Southern blot were consistent with a double cross-over having occurred and demonstrated that these mutants are true disruption replacement mutants in orfup1.
The mutants 61-1A, -2A, -3A and -4A were grown in Soya-Flour medium and their culture supernatants were assayed by HPLC for clavulanic acid and clavam production. The composition of the Soya-Flour medium and the method for assaying clavams by HPLC were as previously reported (Paradkar and Jensen, 1995 ibid) except that the running buffer for the HPLC assay consisted of 0.1 M NaH.sub.2 PO.sub.4 +6% methanol, pH 3.68 (adjusted with glacial acetic acid). The HPLC analysis indicated that none of the mutants produced detectable levels of clavam-2-carboxylate or 2-hydroxymethylclavam. Furthermore, when culture supernatants were bioassayed against Bacillus sp. ATCC 27860, using the method of Pruess and Kellett (1983, J. Antibiot. A: 208-212)., none of the mutants produced detectable levels of alanylclavam. In contrast, HPLC assays of the culture supernatants showed that the mutants appeared to produce superior levels of clavulanic acid when compared to the wild-type (Table 1).
TABLE 1 Clavulanic acid titre (CA) of orfup1 mutants in shake flask tests 70 HOURS 93 HOURS 70 HOURS CA ug/mg 93 HOURS CA ug/mg STRAIN CA ug/ml DNA CA ug/ml DNA NRRL 3585 #1 87 915 166 1963 NRRL 3585 #2 66 790 159 1842 61-1A 272 2894 439 6113 61-2A 199 2148 225 2928 61-3A 54 692 221 2585 61-4A 0 0 226 2422
B. orfdwn1 and orfdwn2
A deletion/replacement mutant in orfdwn1 and orfdwn2 was created by first digesting pCEC018 (7.3 kb) with NcoI and liberating a 1-kb subfragment containing most of orfdwn1 and a portion of orfdwn2. The digest was fractionated by agarose-gel electrophoresis and the 6.3-kb fragment was excised and eluted from the gel. This fragment was then ligated to an NcoI--NcoI DNA fragment carrying apr.sup.r and used to transform E. coli XL1-Blue to apramycin resistance. One clone was obtained from this experiment but restriction analysis of the resulting recombinant plasmid revealed that two copies of the apramycin resistance fragment had been ligated into the deletion plasmid. To eliminate the extra copy of the apr.sup.r -fragment, the plasmid was digested with NcoI and self-ligated. The ligation mixture was used to transform E. coli GM2163 to apramycin resistance. From the transformants, two clones were isolated that contained plasmids pCEC052 and pCEC053 both of which possessed only one copy of the apr.sup.r -fragment; pCEC052 possessed the apr.sup.r -fragment inversely oriented with respect to orfdwn1 and 2, while pCEC053 possessed the apr.sup.r -fragment inserted in the same orientation as orfdwn1 and 2.
A shuttle plasmid of pCEC052 was constructed by ligating BamHI-digested pCEC052 with similarly digested pIJ486 and transforming E. coli GM2163 to apranycin resistance. From this experiment one clone was isolated that contained the shuttle plasmid pCEC060. This plasmnid was used to transform wild-type S. clavuligerus 3585to apramycin and thiostrepton resistance. The resulting transformants were put through two rounds of sporulation under non-selective conditions and then replica plated to antibiotic containing media to identify apramycin resistant, thiostrepton sensitive colonies. Three putative mutants (60-1A, -2A and -3A) were chosen for further analysis.
To establish the identity of these putative mutants genomic DNA was isolated from strains 60-1A and 60-2A and digested with either SacI or BstEII and subjected to southern blot analysis. The hybridisation bands generated from this experiment were consistent with both strains having undergone a double cross-over event demonstrating that these mutants are true disruption replacement mutants in orfdwn1/2.
When these were cultured in Soya-Flour medium and their culture supernatants assayed by HPLC, none of the mutants produced detectable levels of clavam-2-carboxylate or 2-hydroxymethylclavam A bioassay of the culture supernatants showed that the mutants also failed to produce detectable levels of alanylclavam. As with the orfup1 mutants,the orfdwn1/2 mutants are capable of producing superior to wild-type levels of clavulanic acid (Table2).
TABLE 2 Clavulanic acid titre (CA) orfdwn1/2 mutants in shake flask tests 70 HOURS 93 HOURS 70 HOURS CA ug/mg 93 HOURS CA ug/mg STRAIN CA ug/ml DNA CA ug/ml DNA NRRL 3585 #1 87 915 166 1963 NRRL 3585 #2 66 790 159 1842 60-1A 164 1872 260 2911 60-2A 187 2013 108 1320 60-3A 79 994 214 2161
To disrupt orfdwn3 pCEC023 (consisting of a 3.7-kb fragment of casl downstream DNA subcloned into pSL1180) was digested with NcoI and then self ligated. After transforming E.coli with the ligation mixture a clone was isolated that possessed the plasmid pCEC031. This plasmid retained only the 1.9kb NcoI-EcoRI fragment encoding a portion of orfdwn2 and the incomplete orfdwn3. An examination of the DNA sequence revealed that pCEC031 possessed a unique BstEII site at 158bp from the translational start site of orfdwn3. Therefore, pCEC031 was digested with BstEII, treated with Klenow fragment to create blunt ends and then ligated to a blunted apramycin resistance cassette. The ligation mixture was used to transform E.coli GM2163 to apramycin resistance and ampicillin resistance. Two transformants were selected that contained respectively pCEC050 and pCEC051 restriction analysis revealed that the apramycin resistance cassette was orientated in the same orientation as orfdwn3 in pCEC050 and in the opposite orientation in pCEC051. Both of these plasmids were then digested with HindIII and ligated to similarly digested pU486. The ligation mixtures were then used separately to transform E.coli GM 2163 to apramycin and ampicillin resistance. The shuttle plasmids pCEC056 (pCEC050+pIJ486) and pCEC057 (pCEC051+pLI486) were isolated from the resultant transformants. Both plasmids were then used to transform S.clavuligerus NRRL 3585.
One transformant was selected from each transformant experiment and put through two successive rounds of sporulation on non-selective media and then replica plated to antibiotic containing media to identify apramycin-resistant and thiostrepton-sensitive transformants. From this process two putative mutants from the progeny of each primary transformant. (56-1A and 56-3A for pCEC056, and 57-1C and 57-2B for pCEC057).
To establish the identity of these putative mutants genomic DNA was isolated from these strains and digested with either SacI or Acc65I and subjected to Southern blot analysis. The hybridisation bands generated from this experiment were consistent with both strains having undergone a double cross-over event demonstrating that these mutants are true disruption replacement mutants in orfdwn3.
When these strains were cultured in Soya-Flour medium and their culture supernatants assayed by HPLC, the mutants produced greatly reduced levels of clavam-2-carboxylate or 2-hydroxymethylclavam. A bioassay of the culture supernatants showed that the mutants also failed to produce detectable levels of alanylclavam. As with the orfup1 and orfdwn1/2 mutants, the orfdwn3 mutants were capable of producing superior to wild-type levels of clavulanic acid (Table 3).
TABLE 3 Clavulanic acid titre (CA) orfdwn3 mutants in shake flask tests 71 HOURS 93 HOURS 71 HOURS CA ug/mg 93 HOURS CA ug/mg STRAIN CA ug/ml DNA CA ug/ml DNA NRRL 3585 180 1580 193 1790 #1A NRRL 3585 179 1640 266 2310 #1B 56-1A 34 110 235 2160 56-3A 225 2140 274 2740 57-1C 253 2910 277 2920 57-2B 242 2240 193 1860
The application discloses the following nucleotide sequences: SEQ ID No. 1: DNA sequence of FIG.1 SEQ ID No. 2: orfup3 sequence SEQ ID No. 3: orfup2 sequence SEQ ID No. 4: orfup1 sequence SEQ ID No. 5: orfdwn1 sequence SEQ ID No. 6: orfdwn2 sequence SEQ ID No. 7: orfdwn3 sequence
SEQUENCE LISTING (1) GENERAL INFORMATION: (iii) NUMBER OF SEQUENCES: 19 (2) INFORMATION FOR SEQ ID NO: 1: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 7193 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 CCATGGCGGG CGGCGGCTGC CCCGGAGCCT CGGCCGGACC GGTGACCAGG ACCACCCCGG 60 TGGGATAGTG GCCCGCCACC CGGCGCAGCA GACTCCCGGA CACGGACCCG TGGGTGTGCG 120 CGGAAAGGCC CGGAGGCCGG GTCACAGCCA CGGGTAACGC GCGGTGTCCT TGCCCGCGTA 180 ATCGGGGTCC AGATAGACGA AGGCCCGGTG GACGAGGAAG TCCCGCACCT CGTAGACCGT 240 GCACCAGCGC CCGGCGGCCC ACTCGGGGTC ACCCGCCCGC CACGGCCCGT CCCGGTGCTC 300 ACCGTGGGTG GTGCCCTCCG CGGCGAGGAG TTCGGTCCCG GTCAGAATCC AGTTGACGGA 360 CCACAGATGG TGGGTGATCG AGCGGATGGT GCCCCCGAGG TCGTCGAAGA GCCGGGCGAT 420 CTCGGACTTG CCCCGGGCCA GACCCCACTT GGGGAAGAAG AAGACCGCGT CCTCGGCGAA 480 GTAGTCGATC GCGGGGGTGC CGTCGCTGCC GACGCCGCCG TTGTCGAACG CCTTGAAGTA 540 CGCGGTGATG ACCGCCTTGC GCTGCTCGTC CGTCATACCG GCCGATGCCA CGGACATGAA 600 ACGACCTCCA GAGATTCCGG GTGGCTGTGC TGGGGCTGCG GAAGGGGTGT CCCCCGCGAA 660 GGACGGCGGA CGCCGCGGAC GCCGCGGCCG TCTCCCCGGC GGACGGGTCC CAGCGTCCTG 720 GAGAGGGCTT GGCGGCGGCT TGACGCCGTG CTGTCCCGCG GCTTGCGGAA CGCGAAGTAC 780 CGGCCAGCGT ACGGGCGTTG CACCGGACGT GTACGCCGGT CGGGACCCCT CGTACCCCCG 840 GAGCCGGCCG ACCCCGGCGG CTCCGGGGGT ACGGACGCGC CGGACCGGCC CGAGCGAGCC 900 GGACGGGTCG GACGGTGCGC GTGGTTCCGG TGTGTCGGAC AGCTCGGACG GACCGGACGG 960 TGCGCGTGGT TCCGGTGTGT CGGACAGCTC GGACGGGTCG GACGGTGCGC GTGGTTCCGG 1020 CACGCCGGAC GGGTCAGTTG CCGATCATGG CGAGCAATGC CGGGGTGTAC CGCTCCCCGG 1080 ACACCGGGTG GGAGATCGCG GCCGTCACCT CCGCGAGGGA CCGGTCGTCC AGCCGGATCG 1140 AGGCGGCGGC GAGATTGTCC GCGAGATGGG CCGGGTTCGC GGTGCCCGGG ATCGGGACGA 1200 CGTCCTCGCC CCGGTGGTGC AGCCAGGCGA GCGCGAGCTG TGCCAGGGTC AGCCCCAGAC 1260 CGTCCGCGAC CGGGCGCAGC CGGTGCAGCA ACGAGCGGTT GCGCGCGAGG GCCGGAGCGC 1320 TGAACCGGGG CTGGCCCCGG CGGAAGTCCT CGTCCCCCAG ATCGTCGGTG GTGCGGATGG 1380 TGCCGGTGAG AAAACCCCGT CCCAGAGGGG CGTAAGCGAC GATCCCGATC CCCAGCTCCC 1440 GGCAGACGGG CACCACCTCG TCCTCGATCC CGCGCGACCA CAGGCTCCAC TCGCTCTGCA 1500 CCGCCGTCAC CGGGTGCACC GCGTCCGCCC GGCGCAGCGT GGCCGCGGAG GGCTCGGAGA 1560 GACCGAGCCT GCGGACCTTG CCCTCGCGCA CCAGCTCGGC CACCGCACCC ACGGTCTCCT 1620 CGATCGGCAC CGCCGGGTCC GTCCAGTGCT GGTAGTACAG GTCGATGCGG TCGGTGCCGA 1680 GACGACGCAG GGACCGTTCG CAGGCCGCGC GGACGTAGGA CGGCTCGCCG CACAAGCCCT 1740 GGGAGGCGCC GTCGGACGAG CGCACCATGC CGAACTTGGT GGCGATCAGC ACCTCGTCCC 1800 GGCGGCCCGC GACCGCCCGT CCGAGCAGCT CCTCACCGGC GCCGAGCCCC TGGACGTCGG 1860 CGGTGTCCAG CAGGGTGACC CCGGCGTCGA CGGCGGCGCG GATGGTGGCC GTCGCCCGGG 1920 CGCGGTCCGG GCGTCCGTAG AAGTCGGTGG TCGGCAGGCA GCCGAGCCCC TGGGCACTGA 1980 CCGGAAGGTC CCGCAGGGCG CGGACCGGCG GACGCGGAAC CGCGGCGGAC ACGGAACCGG 2040 CCGGGGACTC GGGCGGAGAG CGGGACATAC GGAACCTCCA CAGGCGGAGC CGGGAACGGG 2100 ACGAGGGCGA GGACGGGACG GAACGAAGGA GAGGACGGGA CGGACAGCAC GGACGGGACG 2160 GACGGAACGG AGTCGGGAAC CGGGGGGGGT GACCGGAACC GGGCCGTCCT TGGCCCTCCC 2220 CCGTCCTCCC CGCCATCCGC CGTTCTCCCC CGTTCCCTCT CCCGTCCTCC AGCCAACACC 2280 GCCGCCCTTT CCAAGCGCTT GACACGGCAC CGACAGCCGC CGCCGGGCGC CCGATGGGGA 2340 CCCGTGCCCG CCGGTGAGCG GCGGTGAGCG CCGGTACGGG ACCCCACGCG CCGCCGCCCG 2400 GGCGCCCGCC AGGGCCCGCG CGGCCACCCC GGCCCGCCCC GGCCGGAGCG GCGATCCGGG 2460 CCGCTCGCTG CAAGAGGAAC ATCCACAGCC GCACAAGGAG CGCTCCGCAC AGTGGGCACC 2520 ACGTCCGCCC CGTCCCCCAC ACCGTGGCCG GTCCCCACCG GACAGCACAG CACCGCACAG 2580 CACCACATCG CACGGCACAG CACAGCACCA CCGGCACGAG GAACCAAGGA AAGGAACCAC 2640 ACCACCATGA CCTCAGTGGA CTGCACCGCG TACGGCCCCG AGCTGCGCGC GCTCGCCGCC 2700 CGGCTGCCCC GGACCCCCCG GGCCGACCTG TACGCCTTCC TGGACGCCGC GCACACAGCC 2760 GCCGCCTCGC TCCCCGGCGC CCTCGCCACC GCGCTGGACA CCTTCAACGC CGAGGGCAGC 2820 GAGGACGGCC ATCTGCTGCT GCGCGGCCTC CCGGTGGAGG CCGACGCCGA CCTCCCCACC 2880 ACCCCGAGCA GCACCCCGGC GCCCGAGGAC CGCTCCCTGC TGACCATGGA GGCCATGCTC 2940 GGACTGGTGG GCCGCCGGCT CGGTCTGCAC ACGGGGTACC GGGAGCTGCG CTCGGGCACG 3000 GTCTACCACG ACGTGTACCC GTCGCCCGGC GCGCACCACC TGTCCTCGGA GACCTCCGAG 3060 ACGCTGCTGG AGTTCCACAC GGAGATGGCC TACCACCGGC TCCAGCCGAA CTACGTCATG 3120 CTGGCCTGCT CCCGGGCCGA CCACGAGCGC ACGGCGGCCA CACTCGTCGC CTCGGTCCGC 3180 AAGGCGCTGC CCCTGCTGGA CGAGAGGACC CGGGCCCGGC TCCTCGACCG GAGGATGCCC 3240 TGCTGCGTGG ATGTGGCCTT CCGCGGCGGG GTGGACGACC CGGGCGCCAT CGCCCAGGTC 3300 AAACCGCTCT ACGGGGACGC GGACGATCCC TTCCTCGGGT ACGACCGCGA GCTGCTGGCG 3360 CCGGAGGACC CCGCGGACAA GGAGGCCGTC GCCGCCCTGT CCAAGGCGCT CGACGAGGTC 3420 ACGGAGGCGG TGTATCTGGA GCCCGGCGAT CTGCTGATCG TCGACAACTT CCGCACCACG 3480 CACGCGCGGA CGCCGTTCTC GCCCCGCTGG GACGGGAAGG ACCGCTGGCT GCACCGCGTC 3540 TACATCCGCA CCGACCGCAA TGGACAGCTC TCCGGCGGCG AGCGCGCGGG CGACGTCGTC 3600 GCCTTCACAC CGCGCGGCTG AGCTCCCGGG TCCGACACCG CGCGGCTGAA CCCACGGTCC 3660 GGGGCCCACG GTCCGGCACC GCGCGGCTGA GCCCCCGGGT CCGGCAGCGG GCGGCTGAAC 3720 CCCCGCCCCG GGCCACCGCC CGACCGCCCC CGCGCACCGG ACGCGCCCGC CTGTACGGCG 3780 GTCCCGCCCG GGCCCGTACA CCTGAAGCGC CCGGCGGACC GCCGCCCCGC CGGGGGACGG 3840 ACAGAGCCGG GTGCGGGAGG ACGTCCTCCC GCACCCGGCT CCCACCGTTC CGCACCGACC 3900 GCACCCGACC GTGCCGCAGG CGCCACCGGC ACCGCACCGC CCGCGCCGGC AGCCACCACA 3960 GGCGCCACGC CGCCCGCACG GTGCCCGCGC TGCTCAGCCC CCGTCCACCG GGCTGTCCAG 4020 CAGCCGCCGC AGCGCGCCCC CGATGAACTC CCGGTCGGCG GCCGACCCCC CGGACCCCGC 4080 GAGATGCCCC CACACTCCCG GGATCACCTC CAGCGAGGCA TACGGCAGCA GATCGGCCAC 4140 CCGCTTCTCG TCCTCGACGG CGAAACACAC GTCCAGGGCG CCCGGCAGCA CCACGGCCCG 4200 CGCCGTGACG GAGGCCAGCG CCGCCTCGAC GCTCCCCCCG GCCCCGGGTG TCGCCCCCAC 4260 ATCCGTGTTC TCCCAGGTGC GCACCATGGT GAGCAGATCC GCGGCGCCGG GCCCGGAGAG 4320 GAAGACCTGC TCCCAGAAGC CGGTGAGGTA CTCCTCGCGG GTGGCGAAAC CCAGCTCCCG 4380 GTGGGCACGG CGGGCCCAGA AGGAACGCGA GGTCCCCCAC CCGGCGAACA CCCGGCCCGC 4440 CGCCTTCCGC CCCCGCTCCC CGGCGTCGGC GCTGAGCGCC GCGGCCAGAC CGGACAGCAG 4500 GACCAGGCTG TGCGGGCTGC TCACCGGCGC CCCGCAGATC GGGGCGATCC GGCGCACCAT 4560 CCCCGGATGC GACACGGCCC ACTGGTAGGC GTGGGCCGCG CCCATCGACC AGCCCGTGAC 4620 CAGGGCCAGT TCCCGTACCC CCAGCTCCTC GGTGAGCAGC CGGTGCTGCG CCGCGACATT 4680 GTCCTGCGGA GTGATCAGCG GAAAGCGGGA CCCCGACGGG TGGTTGCCGG GCGAGCTGGA 4740 GACCCCGTTG CCGAAGAGTC CGGCGGTGAC GACGCAGTAC CGCCGGGTGT CCAGCGGCAG 4800 CCCCGCACCG ATCAGCCAGT CGTACCCGGT GTGGTCCCGG CCGAAGAACG ACGGACAGAG 4860 CACCACGTTC GTCCCGTCGG CGTTCGGCGT GCCGTACATG GCGTAACCGA TCCGGGCGTC 4920 CCGCAGGACC TCCCCGTCCA GCAACGGCAG TTCGTCGATC TCGAATATGC GGCATTCCAC 4980 CGCTGACCTC CTTGTTCGAT CCCCCCGGAC AACAGGTCGG TCGTGGCCGG AGACTCAGAG 5040 CCAGTTGGGG GCGATCTCGG TGGCCCACAG CTCCAGGCTG CGCAGCTGGA CATCGTGCGG 5100 GATCAGCCCG GAGTACTGGC ACTGGAGCAG ATACTCCGGA TCGTGCCGCT CCACCAGCTT 5160 CTCGATCATG CGGTTGATGT CGTCCGGGGT GCCGACCCAC TCCAGCCCCC GGTCGACCAG 5220 GGTCTTGTAG TCCGAGCCGA TCGGACCCGT CTCGCCGGTC GCGCGCAGCG CCTCGGTGAA 5280 GCCCATGGGG CCGAACCAGT TCTCGAAGAT GAAGCCGCCG CCGCGGGACG CCCAGTGGTG 5340 GGCCTCGCCG GAGTCCCGGG AGACCAGGAC GTCCTTCATC ACCCCGACCC GCTCGCCCCG 5400 CCGCAGGGTG CCGTGGCCCG CCGCCTCGGC CTCCTCCCGG TAGATGTCCA TCAGCCGGGC 5460 GACGATCTGG TCGTCGGTGT TCATCAGGAT CGGCACCACG CCCTCCCGGG CACAGAACCG 5520 GAACGTGTCC TCACTGAAGC TGAACGGCTG GAAGACGGGC GGGTGGGGGC GCTGGTAGGG 5580 CTTGGGCGCG ATGCCCACCT CGCGGATGAC GCCGTTCTCG TCGAGGCCCC GGCCGTAGCG 5640 GCGCACCGCC TCGTAGGGGA ACTCCAGGTC CGGCACCGGG ATCGTCCACT GCTCCCCGGA 5700 GTGGGTGAAC GTCTCGGTCG TCCACGCCTT CTTGATGATC TCCCAGTGCT CCTCGAAGAG 5760 GGCACGATTG CGCCGGTCCC GCTCCCCGGC GTCGGACAGG GTGCCGCCGA CCCCGTACAC 5820 CTGCCCCATG ATGTCGGCCC AGCGCTTCTG GAACCCGCGC GCGATCCCGA CGAAGGCGCG 5880 GCCCCGGGTC ATGTGGTCGA GCATCGCCAG ATCCTCGGCC AGCCGCAGCG GATTGTGCAG 5940 CGGCAGGACG TTGGCCATCT GGCCGACCCG GATGTGCCGG GTCTGCATGC CGAGGTAGAG 6000 CCCCAGCATG ATCGGGTTGT TGGAGACCTC GAAACCCTCG GTGTGGAAGT GGTGCTCGGT 6060 GAAGGACAGT CCCCAGTAGC CGAGTTCGTC GGCCGCCTGC GCCTGCCGGG TGAGCTGCCG 6120 GAGCATGTTC TGGTAGTTCT GCGGATTGAC CCCCGCCATA CCCCGCTGGA CCTGCGCATG 6180 ACTGCCGACC GTTGGCAGAT AGAAGAGAAT GGACTTCACC CTGGCTCCTC CGGTTCGCGG 6240 CGCCCTCCAT TGACGTGCGC CGAAAGCGGC TCGACCGTCC CACTCCGCCC TTGAGTTCCG 6300 TCTGACGCCG CGCCAGTCGG CGGGCCGTCC GCCGGGGTGC CCGCCGGGGT CCGCACCCGC 6360 CGGACGGCAC GGCGCGCACC GCGCGCGCGG CGCTTCGGGG CACCGGGCTC GACGGGGTGC 6420 TCAGCGGGAC GTCCAACGGA AGGCAAGCCC CCGTACCCAG CCTGGTCAAG GCGCTCATCG 6480 CCATTCCCTG AGGAGGTCCC GCCTTGACCA CAGCAATCTC CGCGCTCCCG ACCGTGCCCG 6540 GCTCCGGACT CGAAGCACTG GACCGTGCCA CCCTCATCCA CCCCACCCTC TCCGGAAACA 6600 CCGCGGAACG GATCGTGCTG ACCTCGGGGT CCGGCAGCCG GGTCCGCGAC ACCGACGGCC 6660 GGGAGTACCT GGACGCGAGC GCCGTCCTCG GGGTGACCCA GGTGGGCCAC GGCCGGGCCG 6720 AGCTGGCCCG GGTCGCGGCC GAGCAGATGG CCCGGCTGGA GTACTTCCAC ACCTGGGGGA 6780 CGATCAGCAA CGACCGGGCG GTGGAGCTGG CGGCACGGCT GGTGGGGCTG AGCCCGGAGC 6840 CGCTGACCCG CGTCTACTTC ACCAGCGGCG GGGCCGAGGG CAACGAGATC GCCCTGCGGA 6900 TGGCCCGGCT CTACCACCAC CGGCGCGGGG AGTCCGCCCG TACCTGGATA CTCTCCCGCC 6960 GGTCGGCCTA CCACGGCGTC GGATACGGCA GCGGCGGCGT CACCGGCTTC CCCGCCTACC 7020 ACCAGGGCTT CGGCCCCTCC CTCCCGGACG TCGACTTCCT GACCCCGCCG CAGCCCTACC 7080 GCCGGGAGCT GTTCGCCGGT TCCGACGTCA CCGACTTCTG CCTCGCCGAA CTGCGCGAGA 7140 CCATCGACCG GATCGGCCCG GAGCGGATCG CGGCGATGAT CGGCGAGCCG ATC 7193 (2) INFORMATION FOR SEQ ID NO: 2: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 145 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 GTGACCCGGC CTCCGGGCCT TTCCGCGCAC ACCCACGGGT CCGTGTCCGG GAGTCTGCTG 60 CGCCGGGTGG CGGGCCACTA TCCCACCGGG GTGGTCCTGG TCACCGGTCC GGCCGAGGCT 120 CCGGGGCAGC CGCCGCCCGC CATGG 145 (2) INFORMATION FOR SEQ ID NO: 3: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 453 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 ATGTCCGTGG CATCGGCCGG TATGACGGAC GAGCAGCGCA AGGCGGTCAT CACCGCGTAC 60 TTCAAGGCGT TCGACAACGG CGGCGTCGGC AGCGACGGCA CCCCCGCGAT CGACTACTTC 120 GCCGAGGACG CGGTCTTCTT CTTCCCCAAG TGGGGTCTGG CCCGGGGCAA GTCCGAGATC 180 GCCCGGCTCT TCGACGACCT CGGGGGCACC ATCCGCTCGA TCACCCACCA TCTGTGGTCC 240 GTCAACTGGA TTCTGACCGG GACCGAACTC CTCGCCGCGG AGGGCACCAC CCACGGTGAG 300 CACCGGGACG GGCCGTGGCG GGCGGGTGAC CCCGAGTGGG CCGCCGGGCG CTGGTGCACG 360 GTCTACGAGG TGCGGGACTT CCTCGTCCAC CGGGCCTTCG TCTATCTGGA CCCCGATTAC 420 GCGGGCAAGG ACACCGCGCG TTACCCGTGG CTG 453 (2) INFORMATION FOR SEQ ID NO: 4: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1032 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 ATGTCCCGCT CTCCGCCCGA GTCCCCGGCC GGTTCCGTGT CCGCCGCGGT TCCGCGTCCG 60 CCGGTCCGCG CCCTGCGGGA CCTTCCGGTC AGTGCCCAGG GGCTCGGCTG CCTGCCGACC 120 ACCGACTTCT ACGGACGCCC GGACCGCGCC CGGGCGACGG CCACCATCCG CGCCGCCGTC 180 GACGCCGGGG TCACCCTGCT GGACACCGCC GACGTCCAGG GGCTCGGCGC CGGTGAGGAG 240 CTGCTCGGAC GGGCGGTCGC GGGCCGCCGG GACGAGGTGC TGATCGCCAC CAAGTTCGGC 300 ATGGTGCGCT CGTCCGACGG CGCCTCCCAG GGCTTGTGCG GCGAGCCGTC CTACGTCCGC 360 GCGGCCTGCG AACGGTCCCT GCGTCGTCTC GGCACCGACC GCATCGACCT GTACTACCAG 420 CACTGGACGG ACCCGGCGGT GCCGATCGAG GAGACCGTGG GTGCGGTGGC CGAGCTGGTG 480 CGCGAGGGCA AGGTCCGCAG GCTCGGTCTC TCCGAGCCCT CCGCGGCCAC GCTGCGCCGG 540 GCGGACGCGG TGCACCCGGT GACGGCGGTG CAGAGCGAGT GGAGCCTGTG GTCGCGCGGG 600 ATCGAGGACG AGGTGGTGCC CGTCTGCCGG GAGCTGGGGA TCGGGATCGT CGCTTACGCC 660 CCTCTGGGAC GGGGTTTTCT CACCGGCACC ATCCGCACCA CCGACGATCT GGGGGACGAG 720 GACTTCCGCC GGGGCCAGCC CCGGTTCAGC GCTCCGGCCC TCGCGCGCAA CCGCTCGTTG 780 CTGCACCGGC TGCGCCCGGT CGCGGACGGT CTGGGGCTGA CCCTGGCACA GCTCGCGCTC 840 GCCTGGCTGC ACCACCGGGG CGAGGACGTC GTCCCGATCC CGGGCACCGC GAACCCGGCC 900 CATCTCGCGG ACAATCTCGC CGCCGCCTCG ATCCGGCTGG ACGACCGGTC CCTCGCGGAG 960 GTGACGGCCG CGATCTCCCA CCCGGTGTCC GGGGAGCGGT ACACCCCGGC ATTGCTCGCC 1020 ATGATCGGCA AC 1032 (2) INFORMATION FOR SEQ ID NO: 5: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 984 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5 GTGGAATGCC GCATATTCGA GATCGACGAA CTGCCGTTGC TGGACGGGGA GGTCCTGCGG 60 GACGCCCGGA TCGGTTACGC CATGTACGGC ACGCCGAACG CCGACGGGAC GAACGTGGTG 120 CTCTGTCCGT CGTTCTTCGG CCGGGACCAC ACCGGGTACG ACTGGCTGAT CGGTGCGGGG 180 CTGCCGCTGG ACACCCGGCG GTACTGCGTC GTCACCGCCG GACTCTTCGG CAACGGGGTC 240 TCCAGCTCGC CCGGCAACCA CCCGTCGGGG TCCCGCTTTC CGCTGATCAC TCCGCAGGAC 300 AATGTCGCGG CGCAGCACCG GCTGCTCACC GAGGAGCTGG GGGTACGGGA ACTGGCCCTG 360 GTCACGGGCT GGTCGATGGG CGCGGCCCAC GCCTACCAGT GGGCCGTGTC GCATCCGGGG 420 ATGGTGCGCC GGATCGCCCC GATCTGCGGG GCGCCGGTGA GCAGCCCGCA CAGCCTGGTC 480 CTGCTGTCCG GTCTGGCCGC GGCGCTCAGC GCCGACGCCG GGGAGCGGGG GCGGAAGGCG 540 GCGGGCCGGG TGTTCGCCGG GTGGGGGACC TCGCGTTCCT TCTGGGCCCG CCGTGCCCAC 600 CGGGAGCTGG GTTTCGCCAC CCGCGAGGAG TACCTCACCG GCTTCTGGGA GCAGGTCTTC 660 CTCTCCGGGC CCGGCGCCGC GGATCTGCTC ACCATGGTGC GCACCTGGGA GAACACGGAT 720 GTGGGGGCGA CACCCGGGGC CGGGGGGAGC GTCGAGGCGG CGCTGGCCTC CGTCACGGCG 780 CGGGCCGTGG TGCTGCCGGG CGCCCTGGAC GTGTGTTTCG CCGTCGAGGA CGAGAAGCGG 840 GTGGCCGATC TGCTGCCGTA TGCCTCGCTG GAGGTGATCC CGGGAGTGTG GGGGCATCTC 900 GCGGGGTCCG GGGGGTCGGC CGCCGACCGG GAGTTCATCG GGGGCGCGCT GCGGCGGCTG 960 CTGGACAGCC CGGTGGACGG GGGC 984 (2) INFORMATION FOR SEQ ID NO: 6: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1182 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6 GTGAAGTCCA TTCTCTTCTA TCTGCCAACG GTCGGCAGTC ATGCGCAGGT CCAGCGGGGT 60 ATGGCGGGGG TCAATCCGCA GAACTACCAG AACATGCTCC GGCAGCTCAC CCGGCAGGCG 120 CAGGCGGCCG ACGAACTCGG CTACTGGGGA CTGTCCTTCA CCGAGCACCA CTTCCACACC 180 GAGGGTTTCG AGGTCTCCAA CAACCCGATC ATGCTGGGGC TCTACCTCGG CATGCAGACC 240 CGGCACATCC GGGTCGGCCA GATGGCCAAC GTCCTGCCGC TGCACAATCC GCTGCGGCTG 300 GCCGAGGATC TGGCGATGCT CGACCACATG ACCCGGGGCC GCGCCTTCGT CGGGATCGCG 360 CGCGGGTTCC AGAAGCGCTG GGCCGACATC ATGGGGCAGG TGTACGGGGT CGGCGGCACC 420 CTGTCCGACG CCGGGGAGCG GGACCGGCGC AATCGTGCCC TCTTCGAGGA GCACTGGGAG 480 ATCATCAAGA AGGCGTGGAC GACCGAGACG TTCACCCACT CCGGGGAGCA GTGGACGATC 540 CCGGTGCCGG ACCTGGAGTT CCCCTACGAG GCGGTGCGCC GCTACGGCCG GGGCCTCGAC 600 GAGAACGGCG TCATCCGCGA GGTGGGCATC GCGCCCAAGC CCTACCAGCG CCCCCACCCG 660 CCCGTCTTCC AGCCGTTCAG CTTCAGTGAG GACACGTTCC GGTTCTGTGC CCGGGAGGGC 720 GTGGTGCCGA TCCTGATGAA CACCGACGAC CAGATCGTCG CCCGGCTGAT GGACATCTAC 780 CGGGAGGAGG CCGAGGCGGC GGGCCACGGC ACCCTGCGGC GGGGCGAGCG GGTCGGGGTG 840 ATGAAGGACG TCCTGGTCTC CCGGGACTCC GGCGAGGCCC ACCACTGGGC GTCCCGCGGC 900 GGCGGCTTCA TCTTCGAGAA CTGGTTCGGC CCCATGGGCT TCACCGAGGC GCTGCGCGCG 960 ACCGGCGAGA CGGGTCCGAT CGGCTCGGAC TACAAGACCC TGGTCGACCG GGGGCTGGAG 1020 TGGGTCGGCA CCCCGGACGA CATCAACCGC ATGATCGAGA AGCTGGTGGA GCGGCACGAT 1080 CCGGAGTATC TGCTCCAGTG CCAGTACTCC GGGCTGATCC CGCACGATGT CCAGCTGCGC 1140 AGCCTGGAGC TGTGGGCCAC CGAGATCGCC CCCAACTGGC TC 1182 (2) INFORMATION FOR SEQ ID NO: 7: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 660 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 GTGCCCGGCT CCGGACTCGA AGCACTGGAC CGTGCCACCC TCATCCACCC CACCCTCTCC 60 GGAAACACCG CGGAACGGAT CGTGCTGACC TCGGGGTCCG GCAGCCGGGT CCGCGACACC 120 GACGGCCGGG AGTACCTGGA CGCGAGCGCC GTCCTCGGGG TGACCCAGGT GGGCCACGGC 180 CGGGCCGAGC TGGCCCGGGT CGCGGCCGAG CAGATGGCCC GGCTGGAGTA CTTCCACACC 240 TGGGGGACGA TCAGCAACGA CCGGGCGGTG GAGCTGGCGG CACGGCTGGT GGGGCTGAGC 300 CCGGAGCCGC TGACCCGCGT CTACTTCACC AGCGGCGGGG CCGAGGGCAA CGAGATCGCC 360
CTGCGGATGG CCCGGCTCTA CCACCACCGG CGCGGGGAGT CCGCCCGTAC CTGGATACTC 420 TCCCGCCGGT CGGCCTACCA CGGCGTCGGA TACGGCAGCG GCGGCGTCAC CGGCTTCCCC 480 GCCTACCACC AGGGCTTCGG CCCCTCCCTC CCGGACGTCG ACTTCCTGAC CCCGCCGCAG 540 CCCTACCGCC GGGAGCTGTT CGCCGGTTCC GACGTCACCG ACTTCTGCCT CGCCGAACTG 600 CGCGAGACCA TCGACCGGAT CGGCCCGGAG CGGATCGCGG CGATGATCGG CGAGCCGATC 660 (2) INFORMATION FOR SEQ ID NO: 8: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8 CTGACGCTGC AGGAGGAAGT CCCGC 25 (2) INFORMATION FOR SEQ ID NO: 9: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9 CGGGGCGAGG ACGTCGTCCC GATCC 25 (2) INFORMATION FOR SEQ ID NO: 10: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10 GAGCCCCTGG ACGTCGGCGG TGTCC 25 (2) INFORMATION FOR SEQ ID NO: 11: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11 GACGGTGCAT GCTCAGCAGG GAGCG 25 (2) INFORMATION FOR SEQ ID NO: 12: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 972 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: Other (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12 ATGACCTCAG TGGACTGCAC CGCGTACGGC CCCGAGCTGC GCGCGCTCGC CGCCCGGCTG 60 CCCCGGACCC CCCGGGCCGA CCTGTACGCC TTCCTGGACG CCGCGCACAC AGCCGCCGCC 120 TCGCTCCCCG GCGCCCTCGC CACCGCGCTG GACACCTTCA ACGCCGAGGG CAGCGAGGAC 180 GGCCATCTGC TGCTGCGCGG CCTCCCGGTG GAGGCCGACG CCGACCTCCC CACCACCCCG 240 AGCAGCACCC CGGCGCCCGA GGACCGCTCC CTGCTGACCA TGGAGGCCAT GCTCGGACTG 300 GTGGGCCGCC GGCTCGGTCT GCACACGGGG TACCGGGAGC TGCGCTCGGG CACGGTCTAC 360 CACGACGTGT ACCCGTCGCC CGGCGCGCAC CACCTGTCCT CGGAGACCTC CGAGACGCTG 420 CTGGAGTTCC ACACGGAGAT GGCCTACCAC CGGCTCCAGC CGAACTACGT CATGCTGGCC 480 TGCTCCCGGG CCGACCACGA GCGCACGGCG GCCACACTCG TCGCCTCGGT CCGCAAGGCG 540 CTGCCCCTGC TGGACGAGAG GACCCGGGCC CGGCTCCTCG ACCGGAGGAT GCCCTGCTGC 600 GTGGATGTGG CCTTCCGCGG CGGGGTGGAC GACCCGGGCG CCATCGCCCA GGTCAAACCG 660 CTCTACGGGG ACGCGGACGA TCCCTTCCTC GGGTACGACC GCGAGCTGCT GGCGCCGGAG 720 GACCCCGCGG ACAAGGAGGC CGTCGCCGCC CTGTCCAAGG CGCTCGACGA GGTCACGGAG 780 GCGGTGTATC TGGAGCCCGG CGATCTGCTG ATCGTCGACA ACTTCCGCAC CACGCACGCG 840 CGGACGCCGT TCTCGCCCCG CTGGGACGGG AAGGACCGCT GGCTGCACCG CGTCTACATC 900 CGCACCGACC GCAATGGACA GCTCTCCGGC GGCGAGCGCG CGGGCGACGT CGTCGCCTTC 960 ACACCGCGCG GC 972 (2) INFORMATION FOR SEQ ID NO: 13: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 48 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13 Met Thr Arg Pro Pro Gly Leu Ser Ala His Thr His Gly Ser Val Ser 1 5 10 15 Gly Ser Leu Leu Arg Arg Val Ala Gly His Tyr Pro Thr Gly Val Val 20 25 30 Leu Val Thr Gly Pro Ala Glu Ala Pro Gly Gln Pro Pro Pro Ala Met 35 40 45 (2) INFORMATION FOR SEQ ID NO: 14: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 151 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14 Met Ser Val Ala Ser Ala Gly Met Thr Asp Glu Gln Arg Lys Ala Val 1 5 10 15 Ile Thr Ala Tyr Phe Lys Ala Phe Asp Asn Gly Gly Val Gly Ser Asp 20 25 30 Gly Thr Pro Ala Ile Asp Tyr Phe Ala Glu Asp Ala Val Phe Phe Phe 35 40 45 Pro Lys Trp Gly Leu Ala Arg Gly Lys Ser Glu Ile Ala Arg Leu Phe 50 55 60 Asp Asp Leu Gly Gly Thr Ile Arg Ser Ile Thr His His Leu Trp Ser 65 70 75 80 Val Asn Trp Ile Leu Thr Gly Thr Glu Leu Leu Ala Ala Glu Gly Thr 85 90 95 Thr His Gly Glu His Arg Asp Gly Pro Trp Arg Ala Gly Asp Pro Glu 100 105 110 Trp Ala Ala Gly Arg Trp Cys Thr Val Tyr Glu Val Arg Asp Phe Leu 115 120 125 Val His Arg Ala Phe Val Tyr Leu Asp Pro Asp Tyr Ala Gly Lys Asp 130 135 140 Thr Ala Arg Tyr Pro Trp Leu 145 150 (2) INFORMATION FOR SEQ ID NO: 15: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 344 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15 Met Ser Arg Ser Pro Pro Glu Ser Pro Ala Gly Ser Val Ser Ala Ala 1 5 10 15 Val Pro Arg Pro Pro Val Arg Ala Leu Arg Asp Leu Pro Val Ser Ala 20 25 30 Gln Gly Leu Gly Cys Leu Pro Thr Thr Asp Phe Tyr Gly Arg Pro Asp 35 40 45 Arg Ala Arg Ala Thr Ala Thr Ile Arg Ala Ala Val Asp Ala Gly Val 50 55 60 Thr Leu Leu Asp Thr Ala Asp Val Gln Gly Leu Gly Ala Gly Glu Glu 65 70 75 80 Leu Leu Gly Arg Ala Val Ala Gly Arg Arg Asp Glu Val Leu Ile Ala 85 90 95 Thr Lys Phe Gly Met Val Arg Ser Ser Asp Gly Ala Ser Gln Gly Leu 100 105 110 Cys Gly Glu Pro Ser Tyr Val Arg Ala Ala Cys Glu Arg Ser Leu Arg 115 120 125 Arg Leu Gly Thr Asp Arg Ile Asp Leu Tyr Tyr Gln His Trp Thr Asp 130 135 140 Pro Ala Val Pro Ile Glu Glu Thr Val Gly Ala Val Ala Glu Leu Val 145 150 155 160 Arg Glu Gly Lys Val Arg Arg Leu Gly Leu Ser Glu Pro Ser Ala Ala 165 170 175 Thr Leu Arg Arg Ala Asp Ala Val His Pro Val Thr Ala Val Gln Ser 180 185 190 Glu Trp Ser Leu Trp Ser Arg Gly Ile Glu Asp Glu Val Val Pro Val 195 200 205 Cys Arg Glu Leu Gly Ile Gly Ile Val Ala Tyr Ala Pro Leu Gly Arg 210 215 220 Gly Phe Leu Thr Gly Thr Ile Arg Thr Thr Asp Asp Leu Gly Asp Glu 225 230 235 240 Asp Phe Arg Arg Gly Gln Pro Arg Phe Ser Ala Pro Ala Leu Ala Arg 245 250 255 Asn Arg Ser Leu Leu His Arg Leu Arg Pro Val Ala Asp Gly Leu Gly 260 265 270 Leu Thr Leu Ala Gln Leu Ala Leu Ala Trp Leu His His Arg Gly Glu 275 280 285 Asp Val Val Pro Ile Pro Gly Thr Ala Asn Pro Ala His Leu Ala Asp 290 295 300 Asn Leu Ala Ala Ala Ser Ile Arg Leu Asp Asp Arg Ser Leu Ala Glu 305 310 315 320 Val Thr Ala Ala Ile Ser His Pro Val Ser Gly Glu Arg Tyr Thr Pro 325 330 335 Ala Leu Leu Ala Met Ile Gly Asn 340 (2) INFORMATION FOR SEQ ID NO: 16: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 328 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16 Met Glu Cys Arg Ile Phe Glu Ile Asp Glu Leu Pro Leu Leu Asp Gly 1 5 10 15 Glu Val Leu Arg Asp Ala Arg Ile Gly Tyr Ala Met Tyr Gly Thr Pro 20 25 30 Asn Ala Asp Gly Thr Asn Val Val Leu Cys Pro Ser Phe Phe Gly Arg 35 40 45 Asp His Thr Gly Tyr Asp Trp Leu Ile Gly Ala Gly Leu Pro Leu Asp 50 55 60 Thr Arg Arg Tyr Cys Val Val Thr Ala Gly Leu Phe Gly Asn Gly Val 65 70 75 80 Ser Ser Ser Pro Gly Asn His Pro Ser Gly Ser Arg Phe Pro Leu Ile 85 90 95 Thr Pro Gln Asp Asn Val Ala Ala Gln His Arg Leu Leu Thr Glu Glu 100 105 110 Leu Gly Val Arg Glu Leu Ala Leu Val Thr Gly Trp Ser Met Gly Ala 115 120 125 Ala His Ala Tyr Gln Trp Ala Val Ser His Pro Gly Met Val Arg Arg 130 135 140 Ile Ala Pro Ile Cys Gly Ala Pro Val Ser Ser Pro His Ser Leu Val 145 150 155 160 Leu Leu Ser Gly Leu Ala Ala Ala Leu Ser Ala Asp Ala Gly Glu Arg 165 170 175 Gly Arg Lys Ala Ala Gly Arg Val Phe Ala Gly Trp Gly Thr Ser Arg 180 185 190 Ser Phe Trp Ala Arg Arg Ala His Arg Glu Leu Gly Phe Ala Thr Arg 195 200 205 Glu Glu Tyr Leu Thr Gly Phe Trp Glu Gln Val Phe Leu Ser Gly Pro 210 215 220 Gly Ala Ala Asp Leu Leu Thr Met Val Arg Thr Trp Glu Asn Thr Asp 225 230 235 240 Val Gly Ala Thr Pro Gly Ala Gly Gly Ser Val Glu Ala Ala Leu Ala 245 250 255 Ser Val Thr Ala Arg Ala Val Val Leu Pro Gly Ala Leu Asp Val Cys 260 265 270 Phe Ala Val Glu Asp Glu Lys Arg Val Ala Asp Leu Leu Pro Tyr Ala 275 280 285 Ser Leu Glu Val Ile Pro Gly Val Trp Gly His Leu Ala Gly Ser Gly 290 295 300 Gly Ser Ala Ala Asp Arg Glu Phe Ile Gly Gly Ala Leu Arg Arg Leu 305 310 315 320 Leu Asp Ser Pro Val Asp Gly Gly 325 (2) INFORMATION FOR SEQ ID NO: 17: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 394 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17 Met Lys Ser Ile Leu Phe Tyr Leu Pro Thr Val Gly Ser His Ala Gln 1 5 10 15 Val Gln Arg Gly Met Ala Gly Val Asn Pro Gln Asn Tyr Gln Asn Met 20 25 30 Leu Arg Gln Leu Thr Arg Gln Ala Gln Ala Ala Asp Glu Leu Gly Tyr 35 40 45 Trp Gly Leu Ser Phe Thr Glu His His Phe His Thr Glu Gly Phe Glu 50 55 60 Val Ser Asn Asn Pro Ile Met Leu Gly Leu Tyr Leu Gly Met Gln Thr 65 70 75 80 Arg His Ile Arg Val Gly Gln Met Ala Asn Val Leu Pro Leu His Asn 85 90 95 Pro Leu Arg Leu Ala Glu Asp Leu Ala Met Leu Asp His Met Thr Arg 100 105 110 Gly Arg Ala Phe Val Gly Ile Ala Arg Gly Phe Gln Lys Arg Trp Ala 115 120 125 Asp Ile Met Gly Gln Val Tyr Gly Val Gly Gly Thr Leu Ser Asp Ala 130 135 140 Gly Glu Arg Asp Arg Arg Asn Arg Ala Leu Phe Glu Glu His Trp Glu 145 150 155 160 Ile Ile Lys Lys Ala Trp Thr Thr Glu Thr Phe Thr His Ser Gly Glu 165 170 175 Gln Trp Thr Ile Pro Val Pro Asp Leu Glu Phe Pro Tyr Glu Ala Val 180 185 190 Arg Arg Tyr Gly Arg Gly Leu Asp Glu Asn Gly Val Ile Arg Glu Val 195 200 205 Gly Ile Ala Pro Lys Pro Tyr Gln Arg Pro His Pro Pro Val Phe Gln 210 215 220 Pro Phe Ser Phe Ser Glu Asp Thr Phe Arg Phe Cys Ala Arg Glu Gly 225 230 235 240 Val Val Pro Ile Leu Met Asn Thr Asp Asp Gln Ile Val Ala Arg Leu 245 250 255 Met Asp Ile Tyr Arg Glu Glu Ala Glu Ala Ala Gly His Gly Thr Leu
260 265 270 Arg Arg Gly Glu Arg Val Gly Val Met Lys Asp Val Leu Val Ser Arg 275 280 285 Asp Ser Gly Glu Ala His His Trp Ala Ser Arg Gly Gly Gly Phe Ile 290 295 300 Phe Glu Asn Trp Phe Gly Pro Met Gly Phe Thr Glu Ala Leu Arg Ala 305 310 315 320 Thr Gly Glu Thr Gly Pro Ile Gly Ser Asp Tyr Lys Thr Leu Val Asp 325 330 335 Arg Gly Leu Glu Trp Val Gly Thr Pro Asp Asp Ile Asn Arg Met Ile 340 345 350 Glu Lys Leu Val Glu Arg His Asp Pro Glu Tyr Leu Leu Gln Cys Gln 355 360 365 Tyr Ser Gly Leu Ile Pro His Asp Val Gln Leu Arg Ser Leu Glu Leu 370 375 380 Trp Ala Thr Glu Ile Ala Pro Asn Trp Leu 385 390 (2) INFORMATION FOR SEQ ID NO: 18: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 220 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18 Met Pro Gly Ser Gly Leu Glu Ala Leu Asp Arg Ala Thr Leu Ile His 1 5 10 15 Pro Thr Leu Ser Gly Asn Thr Ala Glu Arg Ile Val Leu Thr Ser Gly 20 25 30 Ser Gly Ser Arg Val Arg Asp Thr Asp Gly Arg Glu Tyr Leu Asp Ala 35 40 45 Ser Ala Val Leu Gly Val Thr Gln Val Gly His Gly Arg Ala Glu Leu 50 55 60 Ala Arg Val Ala Ala Glu Gln Met Ala Arg Leu Glu Tyr Phe His Thr 65 70 75 80 Trp Gly Thr Ile Ser Asn Asp Arg Ala Val Glu Leu Ala Ala Arg Leu 85 90 95 Val Gly Leu Ser Pro Glu Pro Leu Thr Arg Val Tyr Phe Thr Ser Gly 100 105 110 Gly Ala Glu Gly Asn Glu Ile Ala Leu Arg Met Ala Arg Leu Tyr His 115 120 125 His Arg Arg Gly Glu Ser Ala Arg Thr Trp Ile Leu Ser Arg Arg Ser 130 135 140 Ala Tyr His Gly Val Gly Tyr Gly Ser Gly Gly Val Thr Gly Phe Pro 145 150 155 160 Ala Tyr His Gln Gly Phe Gly Pro Ser Leu Pro Asp Val Asp Phe Leu 165 170 175 Thr Pro Pro Gln Pro Tyr Arg Arg Glu Leu Phe Ala Gly Ser Asp Val 180 185 190 Thr Asp Phe Cys Leu Ala Glu Leu Arg Glu Thr Ile Asp Arg Ile Gly 195 200 205 Pro Glu Arg Ile Ala Ala Met Ile Gly Glu Pro Ile 210 215 220 (2) INFORMATION FOR SEQ ID NO: 19: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 324 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protei (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19 Met Thr Ser Val Asp Cys Thr Ala Tyr Gly Pro Glu Leu Arg Ala Leu 1 5 10 15 Ala Ala Arg Leu Pro Arg Thr Pro Arg Ala Asp Leu Tyr Ala Phe Leu 20 25 30 Asp Ala Ala His Thr Ala Ala Ala Ser Leu Pro Gly Ala Leu Ala Thr 35 40 45 Ala Leu Asp Thr Phe Asn Ala Glu Gly Ser Glu Asp Gly His Leu Leu 50 55 60 Leu Arg Gly Leu Pro Val Glu Ala Asp Ala Asp Leu Pro Thr Thr Pro 65 70 75 80 Ser Ser Thr Pro Ala Pro Glu Asp Arg Ser Leu Leu Thr Met Glu Ala 85 90 95 Met Leu Gly Leu Val Gly Arg Arg Leu Gly Leu His Thr Gly Tyr Arg 100 105 110 Glu Leu Arg Ser Gly Thr Val Tyr His Asp Val Tyr Pro Ser Pro Gly 115 120 125 Ala His His Leu Ser Ser Glu Thr Ser Glu Thr Leu Leu Glu Phe His 130 135 140 Thr Glu Met Ala Tyr His Arg Leu Gln Pro Asn Tyr Val Met Leu Ala 145 150 155 160 Cys Ser Arg Ala Asp His Glu Arg Thr Ala Ala Thr Leu Val Ala Ser 165 170 175 Val Arg Lys Ala Leu Pro Leu Leu Asp Glu Arg Thr Arg Ala Arg Leu 180 185 190 Leu Asp Arg Arg Met Pro Cys Cys Val Asp Val Ala Phe Arg Gly Gly 195 200 205 Val Asp Asp Pro Gly Ala Ile Ala Gln Val Lys Pro Leu Tyr Gly Asp 210 215 220 Ala Asp Asp Pro Phe Leu Gly Tyr Asp Arg Glu Leu Leu Ala Pro Glu 225 230 235 240 Asp Pro Ala Asp Lys Glu Ala Val Ala Ala Leu Ser Lys Ala Leu Asp 245 250 255 Glu Val Thr Glu Ala Val Tyr Leu Glu Pro Gly Asp Leu Leu Ile Val 260 265 270 Asp Asn Phe Arg Thr Thr His Ala Arg Thr Pro Phe Ser Pro Arg Trp 275 280 285 Asp Gly Lys Asp Arg Trp Leu His Arg Val Tyr Ile Arg Thr Asp Arg 290 295 300 Asn Gly Gln Leu Ser Gly Gly Glu Arg Ala Gly Asp Val Val Ala Phe 305 310 315 320 Thr Pro Arg Gly
* * * * *
Field of search:
No:252.3 - Transformants (e.g., recombinant DNA or vector or foreign or exogenous gene containing, fused bacteria, etc.)
No:252.35 - Streptomyces
No:320.1 - VECTOR, PER SE (E.G., PLASMID, HYBRID PLASMID, COSMID, VIRAL VECTOR, BACTERIOPHAGE VECTOR, ETC.)
No:240 - Chemistry: molecular biology and microbiology
No:23.2 - Encodes an enzyme
No:23.7 - Encodes a microbial polypeptide
Browse by classes
Automotives and Transportation
Business and Commerce
Fashion and Accessories
Hardware and Tools
Health and Medicine
Materials and Material Science
Paper and Office Materials