Home
Agents
Assignees
Inventors
Examiners
Contact
Links

Nucleic acid and amino acid sequences relating to pseudomonas aeruginosa for diagnostics and therapeutics


No:

6551795 -

Application no:

09252991 -

Filed date:

1999-02-18 -

Issue date:

2003-04-22



Abstract:


The invention provides isolated polypeptide and nucleic acid sequences derived from Pseudomonas aeruginosa that are useful in diagnosis and therapy of pathological conditions; antibodies against the polypeptides; and methods for the production of the polypeptides. The invention also provides methods for the detection, prevention and treatment of pathological conditions resulting from bacterial infection.

US Classes:



Inventors:



Agents:


Assignees:


Claims:


What is claimed is:

1. An isolated nucleic acid comprising a nucleotide sequence encoding a P. aeruginosa polypeptide selected from the group consisting of SEQ ID NO: 17411, SEQ ID NO: 17456, SEQ ID NO: 18830, SEQ ID NO: 21016, SEQ ID NO: 21380, SEQ ID NO: 22234, SEQ ID NO: 24200, SEQ ID NO: 27183, SEQ ID NO: 27280, and SEQ ID NO: 32329.

2. A recombinant expression vector comprising the nucleic acid of claim 1 operably linked to a transcription regulatory element.

3. A cell comprising a recombinant expression vector of claim 2.

4. A method for producing a P. aeruginosa polypeptide comprising culturing a cell of claim 3 under conditions that permit expression of the polypeptide.

5. An isolated nucleic acid comprising a nucleotide sequence encoding a P. aeruginosa polypeptide, said nucleic acid selected from the group consisting of SEQ ID NO: 840, SEQ ID NO: 885, SEQ ID NO: 2259, SEQ ID NO: 4445, SEQ ID NO: 4809, SEQ ID NO: 5663, SEQ ID NO: 7629, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758.

6. A recombinant expression vector comprising the nucleic acid of claim 5 operably linked to a transcription regulatory element.

7. A cell comprising a recombinant expression vector of claim 6.

8. A method for producing a P. aeruginosa polypeptide comprising culturing a cell of claim 7 under conditions that permit expression of the polypeptide.

9. A probe comprising a nucleotide sequence of at least 30 nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO: 4445, SEQ ID NO: 5663, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758.

10. An isolated nucleic acid or the complement thereof comprising a nucleotide sequence of at least 40 nucleotides in length selected from the group consisting of SEQ ID NO: 4445, SEQ ID NO: 5663, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758, wherein the sequence is hybridizable under conditions of high stringency to a nucleic acid having a nucleotide sequence selected from the group consisting of SEQ ID NO: 4445, SEQ ID NO: 5663, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758, or the complement thereof.

11. An isolated nucleic acid comprising a nucleotide sequence encoding a P. aeruginosa polypeptide selected from the group consisting of SEQ ID NO: 17456, SEQ ID NO: 18830, SEQ ID NO: 21016, SEQ ID NO: 21380, SEQ ID NO: 22234, SEQ ID NO: 27183, SEQ ID NO: 27280, and SEQ ID NO: 32329.

12. A recombinant expression vector comprising the nucleic acid of claim 11 operably linked to a transcription regulatory element.

13. A cell comprising a recombinant expression vector of claim 12.

14. A method for producing a P. aeruginosa polypeptide comprising culturing a cell of claim 13 under conditions that permit polypeptide expression.

15. An isolated nucleic acid comprising a nucleotide sequence with at least 80% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO: 885, SEQ ID NO: 2259, SEQ ID NO: 4445, SEQ ID NO: 4809, SEQ ID NO: 5663, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758, and wherein said isolated nucleic acid encodes a P. aeruginosa polypeptide selected from the group consisting of SEQ ID NO: 17456, SEQ ID NO: 18830, SEQ ID NO: 21016, SEQ ID NO: 21380, SEQ ID NO: 22234, SEQ ID NO: 27183, SEQ ID NO: 27280, and SEQ ID NO: 32329, respectively.

16. A recombinant expression vector comprising the nucleic acid of claim 15 operably linked to a transcription regulatory element.

17. A cell comprising a recombinant expression vector of claim 16.

18. A method for producing a P. aeruginosa polypeptide comprising culturing a cell of claim 17 under conditions that permit polypeptide expression.

19. An isolated nucleic acid comprising a nucleotide sequence with at least 90% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO: 885, SEQ ID NO: 2259, SEQ ID NO: 4445, SEQ ID NO: 4809, SEQ ID NO: 5663, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758 and wherein said isolated nucleic acid encodes a P. aeruginosa polypeptide selected from the group consisting of SEQ ID NO: 17456, SEQ ID NO: 18830, SEQ ID NO: 21016, SEQ ID NO: 21380, SEQ ID NO: 22234, SEQ ID NO: 27183, SEQ ID NO: 27280, and SEQ ID NO: 32329.

20. A recombinant expression vector comprising the nucleic acid of claim 19 operably linked to a transcription regulatory element.

21. A cell comprising a recombinant expression vector of claim 20.

22. A method for producing a P. aeruginosa polypeptide comprising culturing a cell of claim 21 under conditions that permit polypeptide expression.

23. An isolated nucleic acid comprising a nucleotide sequence encoding a P. aeruginosa polypeptide, said nucleic acid selected from the group consisting of SEQ ID NO: 885, SEQ ID NO: 2259, SEQ ID NO: 4445, SEQ ID NO: 4809, SEQ ID NO: 5663, SEQ ID NO: 10612, SEQ ID NO: 10709, and SEQ ID NO: 15758.

24. A recombinant expression vector comprising the nucleic acid of claim 23 operably linked to a transcription regulatory element.

25. A cell comprising a recombinant expression vector of claim 23.

26. A method for producing a P. aeruginosa polypeptide comprising culturing a cell of claim 25 under conditions that permit polypeptide expression.


Text:


FIELD OF THE INVENTION

The invention relates to isolated nucleic acids and polypeptides derived from Pseudomonas aeruginosa that are useful as molecular targets for diagnostics, prophylaxis and treatment of pathological conditions, as well as materials and methods for the diagnosis, prevention, and amelioration of pathological conditions resulting from bacterial infection.

BRIEF DESCRIPTION OF THE SEQUENCE LISTING

Incorporated herein by reference in its entirety is a Sequence Listing, comprising SEQ ID NO: 1 to SEQ ID NO: 33,142. The Sequence Listing is contained on a CD-ROM, three copies of which are filed, the Sequence Listing being in a computer-readable ASCII file named “Path9904.pto”, created on Oct. 20, 2000 and of 68,982,000 bytes in size, in IBM-PC Windows®NT v4.0 format.

BACKGROUND OF THE INVENTION

Pseudomonas aeruginosa (P. aeruginosa) is an aerobic, motile, gram-negative, rod. P. aeruginosa normally inhabits soil, water, and vegetation. Although it seldom causes disease in healthy people, P. aeruginosa is an opportunistic pathogen which accounts for ˜10% of all nosocomial infections (National Nosocomial Infection Survey report-Data Summary from October 1986-April 1996). P. aeruginosa is the most common pathogen affecting Cystic Fibrosis patients with 61% of the specimens culturing positive (Govan, J. R. W. and V. Deretic, 1996, Microbiol. Reviews, 60(3):530-574) as well as one of the two most common pathogens observed in intensive care units (Jarvis, W. R. et al., 1992, J. Antimicrob. Chemother., 29(a supp.):19-24). Mortality from some P. aeruginosa infections can be as high as 50%. Presently, P. aeruginosa infection can still be effectively controlled by antibiotics particularly using a combination of drugs. However, resistance to several of the common antibiotics has been shown and is particularly problematic in ICUs (Archibald, L. et al., 1997, Clin. Infectious Dis., 24(2):211-215; Fish, D. N., et al., 1995, Pharmacotherapy, 15(3):279-291). In addition, P. aeruginosa has already demonstrated mechanisms for acquiring plasmids containing antibiotic resistance genes (Jakoby, G. A. (1986), The bacteria, Vol. X, The biology of Pseudomonas, pp. 265-294, J. R. Sokach (ed.) Academic Press, London) and at present thare are no approved vaccines for Pseudomonas infection.

Like many other bacterial species, strain variability in P. aeruginosa is quite significant. Variability has been shown to occur by a number of different mechanisms, these include but are not limited to the integration into the genome of prophages (Zierdt, C. H. and P. J. Schmidt, 1964, J. Bacteriol. 87:1003-1010), the addition of the cytotoxin gene and pyocins from bacteriophages (Hayashi, T., et al., 1994, FEMS Microbiol. Lett. 122:239-244) and via transposons (Sinclair, M. I. and B. W. Holloway, 1982, J. Bacteriol. 151:569-579). Through this type of diversity, new pathogenic mechanisms have been incorporated into P. aeruginosa. These and other transitions such as the conversion to the mucoidy phenotype commonly seen in CF clearly illustrate the need for continued vigilance.

These concerns point to the need for diagnostic tools and therapeutics aimed at proper identification of strain and eradication of virulence. The design of vaccines that will limit the spread of infection and halt transfer of resistance factors is very desirable.

SUMMARY OF THE INVENTION

The present invention fulfills the need for diagnostic tools and therapeutics by providing bacterial-specific compositions and methods for detecting Pseudomonas species including P. aeruginosa, as well as compositions and methods useful for treating and preventing Pseudomonas infection, in particular, P. aeruginosa infection, in vertebrates including mammals.

The present invention encompasses isolated nucleic acids and polypeptides derived from P. aeruginosa that are useful as reagents for diagnosis of bacterial disease, components of effective antibacterial vaccines, and/or as targets for antibacterial drugs including anti-P. aeruginosa drugs. They can also be used to detect the presence of P. aeruginosa and other Pseudomonas species in a sample; and in screening compounds for the ability to interfere with the P. aeruginosa life cycle or to inhibit P. aeruginosa infection. They also has use as biocontrol agents for plants.

In one aspect, the invention features compositions of nucleic acids corresponding to entire coding sequences of P. aeruginosa proteins, including surface or secreted proteins or parts thereof, nucleic acids capable of binding mRNA from P. aeruginosa proteins to block protein translation, and methods for producing P. aeruginosa proteins or parts thereof using peptide synthesis and recombinant DNA techniques. This invention also features antibodies and nucleic acids useful as probes to detect P. aeruginosa infection. In addition, vaccine compositions and methods for protection against or treatment of infection by P. aeruginosa are within the scope of this invention.

The nucleotide sequences provided in SEQ ID NO: 1-SEQ ID NO: 16571, a fragment thereof, or a nucleotide sequence at least about 99.5% identical to a sequence contained within SEQ ID NO: 1-SEQ ID NO: 16571 may be “provided” in a variety of medias to facilitate use thereof. As used herein, “provided” refers to a manufacture, other than an isolated nucleic acid molecule, which contains a nucleotide sequence of the present invention, i.e., the nucleotide sequence provided in SEQ ID NO: 1-SEQ ID NO: 16571, a fragment thereof, or a nucleotide sequence at least about 99.5% identical to a sequence contained within SEQ ID NO: 1-SEQ ID NO: 16571. Uses for and methods for providing nucleotide sequences in a variety of media are well known in the art (see e.g., EPO Publication No. EP 0 756 006).

In one application of this embodiment, a nucleotide sequence of the present invention can be recorded on computer readable media. As used herein, “computer readable media” refers to any media which can be read and accessed directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard disc storage media, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage media. A person skilled in the art can readily appreciate how any of the presently known computer readable media can be used to create a manufacture comprising computer readable media having recorded thereon a nucleotide sequence of the present invention.

As used herein, “recorded” refers to a process for storing information on computer readable media. A person skilled in the art can readily adopt any of the presently known methods for recording information on computer readable media to generate manufactures comprising the nucleotide sequence information of the present invention.

A variety of data storage structures are available to a person skilled in the art for creating a computer readable media having recorded thereon a nucleotide sequence of the present invention. The choice of the data storage structure will generally be based on the means chosen to access the stored information. In addition, a variety of data processor programs and formats can be used to store the nucleotide sequence information of the present invention on computer readable media. The sequence information can be represented in a word processing text file, formatted in commercially-available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or the like. A person skilled in the art can readily adapt any number of data processor structuring formats (e.g. text file or database) in order to obtain computer readable media having recorded thereon the nucleotide sequence information of the present invention.

By providing the nucleotide sequence of SEQ ID NO: 1-SEQ ID NO: 16571, a fragment thereof, or a nucleotide sequence at least about 99.5% identical to SEQ ID NO: 1-SEQ ID NO: 16571 in computer readable form, a person skilled in the art can routinely access the coding sequence information for a variety of purposes. Computer software is publicly available which allows a person skilled in the art to access sequence information provided in a computer readable media. Examples of such computer software include programs of the “Staden Package”, “DNA Star”, “MacVector”, GCG “Wisconsin Package” (Genetics Computer Group, Madison, Wis.) and “NCBI Toolbox” (National Center For Biotechnology Information). Suitable programs are described, for example, in Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic Press, San Diego, Calif. (1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: Tools for Genomic and Molecular Research, American Society for Microbiology, Washington, D.C. (1997).

Computer algorithms enable the identification of P. aeruginosa open reading frames (ORFs) within SEQ ID NO: 1-SEQ ID NO: 16571 which contain homology to ORFs or proteins from other organisms. Examples of such similarity-search algorithms include the BLAST [Altschul et al., J. Mol. Biol. 215:403-410 (1990)] and Smith-Waterman [Smith and Waterman (1981) Advances in Applied Mathematics, 2:482-489] search algorithms. Suitable search algorithms are described, for example, in Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic Press, San Diego, Calif. (1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: Tools for Genomic and Molecular Research, American Society for Microbiology, Washington, D.C. (1997). Such algorithms are utilized on computer systems as exemplified below. The ORFs so identified represent protein encoding fragments within the P. aeruginosa genome and are useful in producing commercially important proteins such as enzymes used in fermentation reactions and in the production of commercially useful metabolites.

The present invention further provides systems, particularly computer-based systems, which contain the sequence information described herein. Such systems are designed to identify commercially important fragments of the P. aeruginosa genome. As used herein, “a computer-based system” refers to the hardware means, software means, and data storage means used to analyze the nucleotide sequence information of the present invention. The minimum hardware means of the computer-based systems of the present invention comprises a central processing unit (CPU), input means, output means, and data storage means. A person skilled in the art can readily appreciate that any one of the currently available computer-based systems is suitable for use in the present invention. The computer-based systems of the present invention comprise a data storage means having stored therein a nucleotide sequence of the present invention and the necessary hardware means and software means for supporting and implementing a search means. As used herein, “data storage means” refers to memory which can store nucleotide sequence information of the present invention, or a memory access means which can access manufactures having recorded thereon the nucleotide sequence information of the present invention.

As used herein, “search means” refers to one or more programs which are implemented on the computer-based system to compare a target sequence or target structural motif with the sequence information stored within the data storage means. Search means are used to identify fragments or regions of the P. aeruginosa genome which are similar to, or “match”, a particular target sequence or target motif. A variety of known algorithms are known in the art and have been disclosed publicly, and a variety of commercially available software for conducting homology-based similarity searches are available and can be used in the computer-based systems of the present invention. Examples of such software includes, but is not limited to, FASTA (GCG Wisconsin Package), Bic_SW (Compugen Bioccelerator), BLASTN2, BLASTP2, BLASTD2 (NCBI) and Motifs (GCG). Suitable software programs are described, for example, in Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic Press, San Diego, Calif. (1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: Tools for Genomic and Molecular Research, American Society for Microbiology, Washington, D.C. (1997). A person skilled in the art can readily recognize that any one of the available algorithms or implementing software packages for conducting homology searches can be adapted for use in the present computer-based systems.

As used herein, a “target sequence” can be any DNA or amino acid sequence of six or more nucleotides or two or more amino acids. A person skilled in the art can readily recognize that the longer a target sequence is, the less likely a target sequence will be present as a random occurrence in the database. The most preferred sequence length of a target sequence is from about 10 to 100 amino acids or from about 30 to 300 nucleotide residues. However, it is well recognized that many genes are longer than 500 amino acids, or 1.5 kb in length, and that commercially important fragments of the P. aeruginosa genome, such as sequence fragments involved in gene expression and protein processing, will often be shorter than 30 nucleotides.

As used herein, “a target structural motif,” or “target motif,” refers to any rationally selected sequence or combination of sequences in which the sequence(s) are chosen based on a specific functional domain or three-dimensional configuration which is formed upon the folding of the target polypeptide. There are a variety of target motifs known in the art. Protein target motifs include, but are not limited to, enzymatic active sites, membrane-spanning regions, and signal sequences. Nucleic acid target motifs include, but are not limited to, promoter sequences, hairpin structures and inducible expression elements (protein binding sequences).

A variety of structural formats for the input and output means can be used to input and output the information in the computer-based systems of the present invention. A preferred format for an output means ranks fragments of the P. aeruginosa genome possessing varying degrees of homology to the target sequence or target motif. Such presentation provides a person skilled in the art with a ranking of sequences which contain various amounts of the target sequence or target motif and identifies the degree of homology contained in the identified fragment.

A variety of comparing means can be used to compare a target sequence or target motif with the data storage means to identify sequence fragments of the P. aeruginosa genome. In the present examples, implementing software which implement the BLASTP2 and bic_SW algorithms (Altschul et al., J Mol. Biol. 215:403-410 (1990); Compugen Biocellerator) was used to identify open reading frames within the P. aeruginosa genome. A person skilled in the art can readily recognize that any one of the publicly available homology search programs can be used as the search means for the computer-based systems of the present invention. Suitable programs are described, for example, in Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic Press, San Diego, Calif. (1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: Tools for Genomic and Molecular Research, American Society for Microbiology, Washington, D.C. (1997).

The invention features P. aeruginosa polypeptides, preferably a substantially pure preparation of a P. aeruginosa polypeptide, or a recombinant P. aeruginosa polypeptide. In preferred embodiments: the polypeptide has biological activity; the polypeptide has an amino acid sequence at least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% identical to an amino acid sequence of the invention contained in the Sequence Listing, preferably it has about 65% sequence identity with an amino acid sequence of the invention contained in the Sequence Listing, and most preferably it has about 92% to about 99% sequence identity with an amino acid sequence of the invention contained in the Sequence Listing; the polypeptide has an amino acid sequence essentially the same as an amino acid sequence of the invention contained in the Sequence Listing; the polypeptide is at least about 5, 10, 20, 50, 100, or 150 amino acid residues in length; the polypeptide includes at least about 5, preferably at least about 10, more preferably at least about 20, more preferably at least about 50, 100, or 150 contiguous amino acid residues of the invention contained in the Sequence Listing. In yet another preferred embodiment, the amino acid sequence which differs in sequence identity by about 7% to about 8% from the P. aeruginosa amino acid sequences of the invention contained in the Sequence Listing is also encompassed by the invention.

In preferred embodiments: the P. aeruginosa polypeptide is encoded by a nucleic acid of the invention contained in the Sequence Listing, or by a nucleic acid having at least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid of the invention contained in the Sequence Listing.

In a preferred embodiment, the subject P. aeruginosa polypeptide differs in amino acid sequence at 1, 2, 3, 5, 10 or more residues from a sequence of the invention contained in the Sequence Listing. The differences, however, are such that the P. aeruginosa polypeptide exhibits a P. aeruginosa biological activity, e.g., the P. aeruginosa polypeptide retains a biological activity of a naturally occurring P. aeruginosa enzyme.

In preferred embodiments, the polypeptide includes all or a fragment of an amino acid sequence of the invention contained in the Sequence Listing; fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5′ or 3′ to the genomic DNA which encodes a sequence of the invention contained in the Sequence Listing.

In yet other preferred embodiments, the P. aeruginosa polypeptide is a recombinant fusion protein having a first P. aeruginosa polypeptide portion and a second polypeptide portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to P. aeruginosa. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a DNA binding domain, or a polymerase activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid assay.

Polypeptides of the invention include those which arise as a result of alternative transcription events, alternative RNA splicing events, and alternative translational and postranslational events.

In a preferred embodiment, the encoded P. aeruginosa polypeptide differs (e.g., by amino acid substitution, addition or deletion of at least one amino acid residue) in amino acid sequence at 1, 2, 3, 5, 10 or more residues, from a sequence of the invention contained in the Sequence Listing. The differences, however, are such that: the P. aeruginosa encoded polypeptide exhibits a P. aeruginosa biological activity, e.g., the encoded P. aeruginosa enzyme retains a biological activity of a naturally occurring P. aeruginosa.

In preferred embodiments, the encoded polypeptide includes all or a fragment of an amino acid sequence of the invention contained in the Sequence Listing; fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5′ or 3′ to the genomic DNA which encodes a sequence of the invention contained in the Sequence Listing.

The P. aeruginosas strain from which the nucleotide sequences of the invention have been sequenced was deposited on Jul. 18, 1997 in the American Type Culture Collection (ATCC #202004) as strain 19804.

Included in the invention are: allelic variations; natural mutants; induced mutants; proteins encoded by DNA that hybridize under high or low stringency conditions to a nucleic acid which encodes a polypeptide of the invention contained in the Sequence Listing (for definitions of high and low stringency see Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1989, 6.3.1-6.3.6, hereby incorporated by reference); and, polypeptides specifically bound by antisera to P. aeruginosa polypeptides, especially by antisera to an active site or binding domain of P. aeruginosa polypeptide. The invention also includes fragments, preferably biologically active fragments. These and other polypeptides are also referred to herein as P. aeruginosa polypeptide analogs or variants.

The invention further provides nucleic acids, e.g., RNA or DNA, encoding a polypeptide of the invention. This includes double stranded nucleic acids as well as coding and antisense single strands.

In preferred embodiments, the subject P. aeruginosa nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the P. aeruginosa gene sequence, e.g., to render the P. aeruginosa gene sequence suitable for expression in a recombinant host cell.

In yet a further preferred embodiment, the nucleic acid which encodes a P. aeruginosa polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe corresponding to at least about 8 consecutive nucleotides of the invention contained in the Sequence Listing; more preferably to at least about 12 consecutive nucleotides of the invention contained in the Sequence Listing; more preferably to at least about 20 consecutive nucleotides of the invention contained in the Sequence Listing; more preferably still to at least about 40 consecutive nucleotides of the invention contained in the Sequence Listing.

In another aspect, the invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes a P. aeruginosa polypeptide. In preferred embodiments: the encoded polypeptide has biological activity; the encoded polypeptide has an amino acid sequence at least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid sequence of the invention contained in the Sequence Listing; the encoded polypeptide has an amino acid sequence essentially the same as an amino acid sequence of the invention contained in the Sequence Listing; the encoded polypeptide is at least about 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least about 5, preferably at least about 10, more preferably at least about 20, still more preferably at least about 50, 100, or 150 contiguous amino acids of the invention contained in the Sequence Listing.

In another aspect, the invention encompasses: a vector including a nucleic acid which encodes a P. aeruginosa polypeptide or a P. aeruginosa polypeptide variant as described herein; a host cell transfected with the vector; and a method of producing a recombinant P. aeruginosa polypeptide or P. aeruginosa polypeptide variant; including culturing the cell, e.g., in a cell culture medium, and isolating an P. aeruginosa or P. aeruginosa polypeptide variant, e.g., from the cell or from the cell culture medium.

One embodiment of the invention is directed to substantially isolated nucleic acids. Nucleic acids of the invention include sequences comprising at least about 8 nucleotides in length, more preferably at least about 12 nucleotides in length, even more preferably at least about 15-20 nucleotides in length, that correspond to a subsequence of any one of SEQ ID NO: 1-SEQ ID NO: 16571 or complements thereof. Alternatively, the nucleic acids comprise sequences contained within any ORF (open reading frame), including a complete protein-coding sequence, of which any of SEQ ID NO: 1-SEQ ID NO: 16571 forms a part. The invention encompasses sequence-conservative variants and function-conservative variants of these sequences. The nucleic acids may be DNA, RNA, DNA/RNA duplexes, protein-nucleic acid (PNA), or derivatives thereof.

In another aspect, the invention features, a purified recombinant nucleic acid having at least about 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a sequence of the invention contained in the Sequence Listing

The invention also encompasses recombinant DNA (including DNA cloning and expression vectors) comprising these P. aeruginosa-derived sequences; host cells comprising such DNA, including fungal, bacterial, yeast, plant, insect, and mammalian host cells; and methods for producing expression products comprising RNA and polypeptides encoded by the P. aeruginosa sequences. These methods are carried out by incubating a host cell comprising a P. aeruginosa-derived nucleic acid sequence under conditions in which the sequence is expressed. The host cell may be native or recombinant. The polypeptides can be obtained by (a) harvesting the incubated cells to produce a cell fraction and a medium fraction; and (b) recovering the P. aeruginosa polypeptide from the cell fraction, the medium fraction, or both. The polypeptides can also be made by in vitro translation.

In another aspect, the invention features nucleic acids capable of binding mRNA of P. aeruginosa. Such nucleic acid is capable of acting as antisense nucleic acid to control the translation of mRNA of P. aeruginosa. A further aspect features a nucleic acid which is capable of binding specifically to a P. aeruginosa nucleic acid. These nucleic acids are also referred to herein as complements and have utility as probes and as capture reagents.

In another aspect, the invention features an expression system comprising an open reading frame corresponding to P. aeruginosa nucleic acid. The nucleic acid further comprises a control sequence compatible with an intended host. The expression system is useful for making polypeptides corresponding to P. aeruginosa nucleic acid.

In another aspect, the invention encompasses: a vector including a nucleic acid which encodes a P. aeruginosa polypeptide or a P. aeruginosa polypeptide variant as described herein; a host cell transfected with the vector; and a method of producing a recombinant P. aeruginosa polypeptide or P. aeruginosa polypeptide variant; including culturing the cell, e.g., in a cell culture medium, and isolating the P. aeruginosa or P. aeruginosa polypeptide variant, e.g., from the cell or from the cell culture medium.

In yet another embodiment of the invention encompasses reagents for detecting bacterial infection, including P. aeruginosa infection, which comprise at least one P. aeruginosa-derived nucleic acid defined by any one of SEQ ID NO: 1-SEQ ID NO: 16571, or sequence-conservative or function-conservative variants thereof. Alternatively, the diagnostic reagents comprise polypeptide sequences that are contained within any open reading frames (ORFs), including complete protein-coding sequences, contained within any of SEQ ID NO: 1-SEQ ID NO: 16571, or polypeptide sequences contained within any of SEQ ID NO: 16572-SEQ ID NO: 33142, or polypeptides of which any of the above sequences forms a part, or antibodies directed against any of the above peptide sequences or function-conservative variants and/or fragments thereof.

The invention further provides antibodies, preferably monoclonal antibodies, which specifically bind to the polypeptides of the invention. Methods are also provided for producing antibodies in a host animal. The methods of the invention comprise immunizing an animal with at least one P. aeruginosa-derived immunogenic component, wherein the immunogenic component comprises one or more of the polypeptides encoded by any one of SEQ ID NO: 1-SEQ ID NO: 16571 or sequence-conservative or function-conservative variants thereof; or polypeptides that are contained within any ORFs, including complete protein-coding sequences, of which any of SEQ ID NO: 1-SEQ ID NO: 16571 forms a part; or polypeptide sequences contained within any of SEQ ID NO: 16572-SEQ ID NO: 33142; or polypeptides of which any of SEQ ID NO: 16572-SEQ ID NO: 33142 forms a part. Host animals include any warm blooded animal, including without limitation mammals and birds. Such antibodies have utility as reagents for immunoassays to evaluate the abundance and distribution of P. aeruginosa-specific antigens.

In yet another aspect, the invention provides diagnostic methods for detecting P. aeruginosa antigenic components or anti-P. aeruginosa antibodies in a sample. P. aeruginosa antigenic components are detected by a process comprising: (i) contacting a sample suspected to contain a bacterial antigenic component with a bacterial-specific antibody, under conditions in which a stable antigen-antibody complex can form between the antibody and bacterial antigenic components in the sample; and (ii) detecting any antigen-antibody complex formed in step (i), wherein detection of an antigen-antibody complex indicates the presence of at least one bacterial antigenic component in the sample. In different embodiments of this method, the antibodies used are directed against a sequence encoded by any of SEQ ID NO: 1-SEQ ID NO: 16571 or sequence-conservative or function-conservative variants thereof, or against a polypeptide sequence contained in any of SEQ ID NO: 16572-SEQ ID NO: 33142 or function-conservative variants thereof.

In yet another aspect, the invention provides a method for detecting antibacterial-specific antibodies in a sample, which comprises: (i) contacting a sample suspected to contain antibacterial-specific antibodies with a P. aeruginosa antigenic component, under conditions in which a stable antigen-antibody complex can form between the P. aeruginosa antigenic component and antibacterial antibodies in the sample; and (ii) detecting any antigen-antibody complex formed in step (i), wherein detection of an antigen-antibody complex indicates the presence of antibacterial antibodies in the sample. In different embodiments of this method, the antigenic component is encoded by a sequence contained in any of SEQ ID NO: 1-SEQ ID NO: 16571 or sequence-conservative and function-conservative variants thereof, or is a polypeptide sequence contained in any of SEQ ID NO: 16572-SEQ ID NO: 33142 or function-conservative variants thereof.

In another aspect, the invention features a method of generating vaccines for immunizing an individual against P. aeruginosa. The method includes: immunizing a subject with a P. aeruginosa polypeptide, e.g., a surface or secreted polypeptide, or a combination of such peptides or active portion(s) thereof, and a pharmaceutically acceptable carrier. Such vaccines have therapeutic and prophylactic utilities.

In another aspect, the invention features a method of evaluating a compound, e.g. a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind a P. aeruginosa polypeptide. The method includes: contacting the Pseudomonas compound with a P. aeruginosa polypeptide and determining if the compound binds or otherwise interacts with a P. aeruginosa polypeptide. Compounds which bind P. aeruginosa are candidates as activators or inhibitors of the bacterial life cycle. These assays can be performed in vitro or in vivo.

In another aspect, the invention features a method of evaluating a compound, e.g. a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind a P. aeruginosa nucleic acid, e.g., DNA or RNA. The method includes: contacting the Pseudomonas compound with a P. aeruginosa nucleic acid and determining if the compound binds or otherwise interacts with a P. aeruginosa polypeptide. Compounds which bind P. aeruginosa are candidates as activators or inhibitors of the bacterial life cycle. These assays can be performed in vitro or in vivo.

A particularly preferred embodiment of the invention is directed to a method of screening test compounds for anti-bacterial activity, which method comprises: selecting as a target a bacterial specific sequence, which sequence is essential to the viability of a bacterial species; contacting a test compound with said target sequence; and selecting those test compounds which bind to said target sequence as potential anti-bacterial candidates. In one embodiment, the target sequence selected is specific to a single species, or even a single strain, i.e., the P. aeruginosa 19804. In a second embodiment, the target sequence is common to at least two species of bacteria. In a third embodiment, the target sequence is common to a family of bacteria. The target sequence may be a nucleic acid sequence or a polypeptide sequence. Methods employing sequences common to more than one species of microorganism may be used to screen candidates for broad spectrum anti-bacterial activity.

The invention also provides methods for preventing or treating disease caused by certain bacteria, including P. aeruginosa, which are carried out by administering to an animal in need of such treatment, in particular a warm-blooded vertebrate, including but not limited to birds and mammals, a compound that specifically inhibits or interferes with the function of a bacterial polypeptide or nucleic acid. In a particularly preferred embodiment, the mammal to be treated is human.

DETAILED DESCRIPTION OF THE INVENTION

The sequences of the present invention include the specific nucleic acid and amino acid sequences set forth in the Sequence Listing that forms a part of the present specification, and which are designated SEQ ID NO: 1-SEQ ID NO: 33142. Use of the terms “SEQ ID NO: 1-SEQ ID NO: 16571”, “SEQ ID NO: 16572-SEQ ID NO: 33142”, “the sequences depicted in Table 2”, and the like, is intended, for convenience, to refer to each individual SEQ ID NO individually, and is not intended to refer to the genus of these sequences unless such reference would be indicated. In other words, it is a shorthand for listing all of these sequences individually. The invention encompasses each sequence individually, as well as any combination thereof.

Definitions

“Nucleic acid” or “polynucleotide” as used herein refers to purine- and pyrimidine-containing polymers of any length, either polyribonucleotides or polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. This includes single- and double-stranded molecules, i.e., DNA—DNA, DNA-RNA and RNA—RNA hybrids, as well as “protein nucleic acids” (PNA) formed by conjugating bases to an amino acid backbone. This also includes nucleic acids containing modified bases.

A nucleic acid or polypeptide sequence that is “derived from” a designated sequence refers to a sequence that corresponds to a region of the designated sequence. For nucleic acid sequences, this encompasses sequences that are homologous or complementary to the sequence, as well as “sequence-conservative variants” and “function-conservative variants.” For polypeptide sequences, this encompasses “function-conservative variants.” Sequence-conservative variants are those in which a change of one or more nucleotides in a given codon position results in no alteration in the amino acid encoded at that position. Function-conservative variants are those in which a given amino acid residue in a polypeptide has been changed without altering the overall conformation and function of the native polypeptide, including, but not limited to, replacement of an amino acid with one having similar physico-chemical properties (such as, for example, acidic, basic, hydrophobic, and the like). “Function-conservative” variants also include any polypeptides that have the ability to elicit antibodies specific to a designated polypeptide.

An “P. aeruginosa-derived” nucleic acid or polypeptide sequence may or may not be present in other bacterial species, and may or may not be present in all P. aeruginosa strains. This term is intended to refer to the source from which the sequence was originally isolated. Thus, a P. aeruginosa-derived polypeptide, as used herein, may be used, e.g., as a target to screen for a broad spectrum antibacterial agent, to search for homologous proteins in other species of bacteria or in eukaryotic organisms such as bacteria humans, etc.

A purified or isolated polypeptide or a substantially pure preparation of a polypeptide are used interchangeably herein and, as used herein, mean a polypeptide that has been separated from other proteins, lipids, and nucleic acids with which it naturally occurs. Preferably, the polypeptide is also separated from substances, e.g., antibodies or gel matrix, e.g., polyacrylamide, which are used to purify it. Preferably, the polypeptide constitutes at least about 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the preparation contains: sufficient polypeptide to allow protein sequencing; at least about 1, 10, or 100 mg of the polypeptide.

A purified preparation of cells refers to, in the case of plant or animal cells, an in vitro preparation of cells and not an entire intact plant or animal. In the case of cultured cells or microbial cells, it consists of a preparation of at least about 10% and more preferably 50% of the subject cells.

A purified or isolated or a substantially pure nucleic acid, e.g., a substantially pure DNA, (are terms used interchangeably herein) is a nucleic acid which is one or both of the following: not immediately contiguous with both of the coding sequences with which it is immediately contiguous (i.e., one at the 5′ end and one at the 3′ end) in the naturally-occurring genome of the organism from which the nucleic acid is derived; or which is substantially free of a nucleic acid with which it occurs in the organism from which the nucleic acid is derived. The term includes, for example, a recombinant DNA which is incorporated into a vector, e.g., into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment) independent of other DNA sequences. Substantially pure DNA also includes a recombinant DNA which is part of a hybrid gene encoding additional P. aeruginosa DNA sequence.

A “contig” as used herein is a nucleic acid representing a continuous stretch of genomic sequence of an organism.

An “open reading frame”, also referred to herein as ORF, is a region of nucleic acid which encodes a polypeptide. This region may represent a portion of a coding sequence or a total sequence and can be determined from a stop to stop codon or from a start to stop codon.

As used herein, a “coding sequence” is a nucleic acid which is transcribed into messenger RNA and/or translated into a polypeptide when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are determined by a translation start codon at the five prime terminus and a translation stop code at the three prime terminus. A coding sequence can include but is not limited to messenger RNA, synthetic DNA, and recombinant nucleic acid sequences.

A “complement” of a nucleic acid as used herein refers to an anti-parallel or antisense sequence that participates in Watson-Crick base-pairing with the original sequence.

A “gene product” is a protein or structural RNA which is specifically encoded by a gene.

As used herein, the term “probe” refers to a nucleic acid, peptide or other chemical entity which specifically binds to a molecule of interest. Probes are often associated with or capable of associating with a label. A label is a chemical moiety capable of detection. Typical labels comprise dyes, radioisotopes, luminescent and chemiluminescent moieties, fluorophores, enzymes, precipitating agents, amplification sequences, and the like. Similarly, a nucleic acid, peptide or other chemical entity which specifically binds to a molecule of interest and immobilizes such molecule is referred herein as a “capture ligand”. Capture ligands are typically associated with or capable of associating with a support such as nitro-cellulose, glass, nylon membranes, beads, particles and the like. The specificity of hybridization is dependent on conditions such as the base pair composition of the nucleotides, and the temperature and salt concentration of the reaction. These conditions are readily discernable to one of ordinary skill in the art using routine experimentation.

“Homologous” refers to the sequence similarity or sequence identity between two polypeptides or between two nucleic acid molecules. When a position in both of the two compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules are homologous at that position. The percent of homology between two sequences is a function of the number of matching or homologous positions shared by the two sequences divided by the number of positions compared×100. For example, if 6 of 10 of the positions in two sequences are matched or homologous then the two sequences are 60% homologous. By way of example, the DNA sequences ATTGCC and TATGGC share 50% homology. Generally, a comparison is made when two sequences are aligned to give maximum homology.

Nucleic acids are hybridizable to each other when at least one strand of a nucleic acid can anneal to the other nucleic acid under defined stringency conditions. Stringency of hybridization is determined by: (a) the temperature at which hybridization and/or washing is performed; and (b) the ionic strength and polarity of the hybridization and washing solutions. Hybridization requires that the two nucleic acids contain complementary sequences; depending on the stringency of hybridization, however, mismatches may be tolerated. Typically, hybridization of two sequences at high stringency (such as, for example, in a solution of 0.5X SSC, at 65° C.) requires that the sequences be essentially completely homologous. Conditions of intermediate stringency (such as, for example, 2X SSC at 65° C.) and low stringency (such as, for example 2X SSC at 55° C.), require correspondingly less overall complementarity between the hybridizing sequences. (1X SSC is 0.15 M NaCl, 0.015 M Na citrate).

The terms peptides, proteins, and polypeptides are used interchangeably herein.

As used herein, the term “surface protein” refers to all surface accessible proteins, e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and secreted proteins.

A polypeptide has P. aeruginosa biological activity if it has one, two and preferably more of the following properties: (1) if when expressed in the course of a P. aeruginosa infection, it can promote, or mediate the attachment of P. aeruginosa to a cell; (2) it has an enzymatic activity, structural or regulatory function characteristic of a P. aeruginosa protein; (3) or the gene which encodes it can rescue a lethal mutation in a P. aeruginosa gene. A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the above-listed properties.

A biologically active fragment or analog is one having an in vivo or in vitro activity which is characteristic of the P. aeruginosa polypeptides of the invention contained in the Sequence Listing, or of other naturally occurring P. aeruginosa polypeptides, e.g., one or more of the biological activities described herein. Especially preferred are fragments which exist in vivo, e.g., fragments which arise from post transcriptional processing or which arise from translation of alternatively spliced RNA's. Fragments include those expressed in native or endogenous cells as well as those made in expression systems, e.g., in CHO (Chinese Hamster Ovary) cells. Because peptides such as P. aeruginosa polypeptides often exhibit a range of physiological properties and because such properties may be attributable to different portions of the molecule, a useful P. aeruginosa fragment or P. aeruginosa analog is one which exhibits a biological activity in any biological assay for P. aeruginosa activity. Most preferably the fragment or analog possesses 10%, preferably 40%, more preferably 60%, 70%, 80% or 90% or greater of the activity of P. aeruginosa, in any in vivo or in vitro assay.

Analogs can differ from naturally occurring P. aeruginosa polypeptides in amino acid sequence or in ways that do not involve sequence, or both. Non-sequence modifications include changes in acetylation, methylation, phosphorylation, carboxylation, or glycosylation. Preferred analogs include P. aeruginosa polypeptides (or biologically active fragments thereof) whose sequences differ from the wild-type sequence by one or more conservative amino acid substitutions or by one or more non-conservative amino acid substitutions, deletions, or insertions which do not substantially diminish the biological activity of the P. aeruginosa polypeptide. Conservative substitutions typically include the substitution of one amino acid for another with similar characteristics, e.g., substitutions within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Other conservative substitutions can be made in view of the table below.

TABLE 1
CONSERVATIVE AMINO ACID REPLACEMENTS
For Amino AcidCodeReplace with any of
AlanineAD-Ala, Gly, beta-Ala, L-Cys, D-Cys
ArginineRD-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg,
Met,
AsparagineND-Asn, Asp, D-Asp, Glu, D-Glu, Gln, D-Gln
Aspartic AcidDD-Asp, D-Asn, Asn, Glu, D-Glu, Gln, D-Gln
CysteineCD-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr
GlutamineQD-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp
Glutamic AcidED-Glu, D-Asp, Asp, Asn, D-Asn, Gln, D-Gln
GlycineGAla, D-Ala, Pro, D-Pro, β-Ala, Acp
IsoleucineID-Ile, Val, D-Val, Leu, D-Leu, Met, D-Met
LeucineLD-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met
LysineKD-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg,
Met,
MethionineMD-Met, S-Me-Cys, Ile, D-IIe, Leu, D-Leu, Val,
D-
Phenylal anineFD-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp,
D-
Proline< /td>PD-Pro, L-I-thioazolidine-4-carboxylic acid,
D-or L-
SerineSD-Ser, Thr, D-Thr, allo-Thr, Met, D-Met,
Met(O),
T hreonineTD-Thr, Ser, D-Ser, allo-Thr, Met, D-Met,
Met(O),
T yrosineYD-Tyr, Phe, D-Phe, L-Dopa, His, D-His
ValineVD-Val, Leu, D-Leu, Ile, D-Ile, Met, D-Met

Other analogs within the invention are those with modifications which increase peptide stability; such analogs may contain, for example, one or more non-peptide bonds (which replace the peptide bonds) in the peptide sequence. Also included are: analogs that include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino acids, e.g., β or γ amino acids; and cyclic analogs.

As used herein, the term “fragment”, as applied to a P. aeruginosa analog, will ordinarily be at least about 20 residues, more typically at least about 40 residues, preferably at least about 60 residues in length. Fragments of P. aeruginosa polypeptides can be generated by methods known to those skilled in the art. The ability of an Pseudomonas fragment to exhibit a biological activity of P. aeruginosa polypeptide can be assessed by methods known to those skilled in the art as described herein. Also included are P. aeruginosa polypeptides containing residues that are not required for biological activity of the peptide or that result from alternative mRNA splicing or alternative protein processing events.

An “immunogenic component” as used herein is a moiety, such as a P. aeruginosa polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or cellular immune response in a host animal.

An “antigenic component” as used herein is a moiety, such as a P. aeruginosa polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody with sufficiently high affinity to form a detectable antigen-antibody complex.

The term “antibody” as used herein is intended to include fragments thereof which are specifically reactive with P. aeruginosa polypeptides.

As used herein, the term “cell-specific promoter” means a DNA sequence that serves as a promoter, i.e., regulates expression of a selected DNA sequence operably linked to the promoter, and which effects expression of the selected DNA sequence in specific cells of a tissue. The term also covers so-called “leaky” promoters, which regulate expression of a selected DNA primarily in one tissue, but cause expression in other tissues as well.

Misexpression, as used herein, refers to a non-wild type pattern of gene expression. It includes: expression at non-wild type levels, i.e., over or under expression; a pattern of expression that differs from wild type in terms of the time or stage at which the gene is expressed, e.g., increased or decreased expression (as compared with wild type) at a predetermined developmental period or stage; a pattern of expression that differs from wild type in terms of increased expression (as compared with wild type) in a predetermined cell type or tissue type; a pattern of expression that differs from wild type in terms of the splicing size, amino acid sequence, post-translational modification, or biological activity of the expressed polypeptide; a pattern of expression that differs from wild type in terms of the effect of an environmental stimulus or extracellular stimulus on expression of the gene, e.g., a pattern of increased or decreased expression (as compared with wild type) in the presence of an increase or decrease in the strength of the stimulus.

As used herein, “host cells” and other such terms denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities refers to cells which can become or have been used as recipients for a recombinant vector or other transfer DNA, and include the progeny of the original cell which has been transfected. It is understood by individuals skilled in the art that the progeny of a single parental cell may not necessarily be completely identical in genomic or total DNA compliment to the original parent, due to accident or deliberate mutation.

As used herein, the term “control sequence” refers to a nucleic acid having a base sequence which is recognized by the host organism to effect the expression of encoded sequences to which they are ligated. The nature of such control sequences differs depending upon the host organism; in prokaryotes, such control sequences generally include a promoter, ribosomal binding site, terminators, and in some cases operators; in eukaryotes, generally such control sequences include promoters, terminators and in some instances, enhancers. The term control sequence is intended to include at a minimum, all components whose presence is necessary for expression, and may also include additional components whose presence is advantageous, for example, leader sequences.

As used herein, the term “operably linked” refers to sequences joined or ligated to function in their intended manner. For example, a control sequence is operably linked to coding sequence by ligation in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequence and host cell.

The “metabolism” of a substance, as used herein, means any aspect of the expression, function, action, or regulation of the substance. The metabolism of a substance includes modifications, e.g., covalent or non-covalent modifications of the substance. The metabolism of a substance includes modifications, e.g., covalent or non-covalent modification, the substance induces in other substances. The metabolism of a substance also includes changes in the distribution of the substance. The metabolism of a substance includes changes the substance induces in the distribution of other substances.

A “sample” as used herein refers to a biological sample, such as, for example, tissue or fluid isloated from an individual (including without limitation plasma, serum, cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture constituents, as well as samples from the environment.

Technical and scientific terms used herein have the meanings commonly understood by one of ordinary skill in the art to which the present invention pertains, unless otherwise defined. Reference is made herein to various methodologies known to those of skill in the art. Publications and other materials setting forth such known methodologies to which reference is made are incorporated herein by reference in their entireties as though set forth in full. The practice of the invention will employ, unless otherwise indicated, conventional techniques of chemistry, molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See e.g., Sambrook, Fritsch, and Maniatis, Molecular Cloning; Laboratory Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D. N Glover ed. 1985); Oligonucleotide Synthesis (M. J. Gait ed, 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); the series, Methods in Enzymology (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and Grossman, eds.); PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 1991); Immunology, 2d Edition, 1989, Roitt et al., C.V. Mosby Company, and New York; Advanced Immunology, 2d Edition, 1991, Male et al., Grower Medical Publishing, New York.; DNA Cloning: A Practical Approach, Volumes I and II, 1985 (D. N. Glover ed.); Oligonucleotide Synthesis, 1984, (M. L. Gait ed); Transcription and Translation, 1984 (Hames and Higgins eds.); Animal Cell Culture, 1986 (R. I. Freshney ed.); Immobilized Cells and Enzymes, 1986 (IRL Press); Perbal, 1984, A Practical Guide to Molecular Cloning; and Gene Transfer Vectors for Mammalian Cells, 1987 (J. H. Miller and M. P. Calos eds., Cold Spring Harbor Laboratory);

Any suitable materials and/or methods known to those of skill can be utilized in carrying out the present invention: however preferred materials and/or methods are described. Materials, reagents and the like to which reference is made in the following description and examples are obtainable from commercial sources, unless otherwise noted.

P. aeruginosa Genomic Sequence

This invention provides nucleotide sequences of the genome of P. aeruginosa which thus comprises a DNA sequence library of P. aeruginosa genomic DNA. The detailed description that follows provides nucleotide sequences of P. aeruginosa, and also describes how the sequences were obtained and how ORFs and protein-coding sequences were identified. Also described are methods of using the disclosed P. aeruginosa sequences in methods including diagnostic and therapeutic applications. Furthermore, the library can be used as a database for identification and comparison of medically important sequences in this and other strains of P. aeruginosa.

To determine the genomic sequence of P. aeruginosa, DNA from strain 19804 of P. aeruginosa was isolated after Zymolyase digestion, sodium dodecyl sulfate lysis, potassium acetate precipitation, phenol:chloroform extraction and ethanol precipitation (Soll, D. R., T. Srikantha and S. R. Lockhart: Characterizing Developmentally Regulated Genes in P. aeruginosa, In, Microbial Genome Methods. K. W. Adolph, editor. CRC Press. New York. p 17-37.). DNA was sheared hydrodynamically using an HPLC (Oefner, et. al., 1996) to an insert size of 2000-3000 bp. After size fractionation by gel electrophoresis the fragments were blunt-ended, ligated to adapter oligonucleotides and cloned into the pGTC (Thomann) vector to construct a “shotgun” subclone library

DNA sequencing was achieved using established ABI sequencing methods on ABI377 automated DNA sequencers. The cloning and sequencing procedures are described in more detail in the Exemplification.

Individual sequence reads were assembled using PHRAP (P. Green, Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, January 1996, p.157). The average contig length was about 3-4 kb.

All subsequent steps were based on sequencing by ABI377 automated DNA sequencing methods. The cloning and sequencing procedures are described in more detail in the Exemplification.

A variety of approaches can be used to order the contigs so as to obtain a continuous sequence representing the entire P. aeruginosa genome. Synthetic oligonucleotides are designed that are complementary to sequences at the end of each contig. These oligonucleotides may be hybridized to libaries of P. aeruginosa genomic DNA in, for example, lambda phage vectors or plasmid vectors to identify clones that contain sequences corresponding to the junctional regions between individual contigs. Such clones are then used to isolate template DNA and the same oligonucleotides are used as primers in polymerase chain reaction (PCR) to amplify junctional fragments, the nucleotide sequence of which is then determined.

The P. aeruginosa sequences were analyzed for the presence of open reading frames (ORFs) comprising at least 180 nucleotides. As a result of the analysis of ORFs based on stop-to-stop codon reads, it should be understood that these ORFs may not correspond to the ORF of a naturally-occurring P. aeruginosa polypeptide. These ORFs may contain start codons which indicate the initiation of protein synthesis of a naturally-occurring P. aeruginosa polypeptide. Such start codons within the ORFs provided herein were identified by those of ordinary skill in the relevant art, and the resulting ORF and the encoded P. aeruginosa polypeptide is within the scope of this invention. For example, within the ORFs a codon such as AUG or GUG (encoding methionine or valine) which is part of the initiation signal for protein synthesis were identified and the portion of an ORF to corresponding to a naturally-occurring P. aeruginosa polypeptide was recognized. The predicted coding regions were defined by evaluating the coding potential of such sequences with the program GENEMARK™ (Borodovsky and McIninch, 1993, Comp. . 17:123).

Each predicted ORF amino acid sequence was compared with all sequences found in current GENBANK, SWISS-PROT, and PIR databases using the BLAST algorithm. BLAST identifies local alignments occurring by chance between the ORF sequence and the sequence in the databank (Altschal et al., 1990, L Mol. Biol. 215:403-410). Homologous ORFs (probabilities less than 10−5 by chance) and ORF's that are probably non-homologous (probabilities greater than 10−5 by chance) but have good codon usage were identified. Both homologous, sequences and non-homologous sequences with good codon usage, are likely to encode proteins and are encompassed by the invention.

P. aeruginosa Nucleic Acids

The present invention provides a library of P. aeruginosa_-derived nucleic acid sequences. The libraries provide probes, primers, and markers which are used as markers in epidemiological studies. The present invention also provides a library of P. aeruginosa-derived nucleic acid sequences which comprise or encode targets for therapeutic drugs.

The nucleic acids of this invention may be obtained directly from the DNA of the above referenced P. aeruginosa strain by using the polymerase chain reaction (PCR). See “PCR, A Practical Approach” (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, UK, 1991) for details about the PCR. High fidelity PCR is used to ensure a faithful DNA copy prior to expression. In addition, the authenticity of amplified products is verified by conventional sequencing methods. Clones carrying the desired sequences described in this invention may also be obtained by screening the libraries by means of the PCR or by hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or plaques as known in the art (see, e.g., Sambrook et al., Molecular Cloning, A Laboratory Manual 2nd edition, 1989, Cold Spring Harbor Press, NY).

It is also possible to obtain nucleic acids encoding P. aeruginosa polypeptides from a cDNA library in accordance with protocols herein described. A cDNA encoding a P. aeruginosa polypeptide can be obtained by isolating total mRNA from an appropriate strain. Double stranded cDNAs can then be prepared from the total mRNA. Subsequently, the cDNAs can be inserted into a suitable plasmid or viral (e.g., bacteriophage) vector using any one of a number of known techniques. Genes encoding P. aeruginosa polypeptides can also be cloned using established polymerase chain reaction techniques in accordance with the nucleotide sequence information provided by the invention. The nucleic acids of the invention can be DNA or RNA. Preferred nucleic acids of the invention are contained in the Sequence Listing.

The nucleic acids of the invention can also be chemically synthesized using standard techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, including solid-phase synthesis which, like peptide synthesis, has been fully automated in commercially available DNA synthesizers (See e.g., Itakura et al. U.S. Pat. No. 4,598,049; Caruthers et al. U.S. Pat. No. 4,458,066; and Itakura U.S. Pat. Nos. 4,401,796 and 4,373,071, incorporated by reference herein).

In another example, DNA can be chemically synthesized using, e.g., the phosphoramidite solid support method of Matteucci et al., 1981, J. Am. Chem. Soc. 103:3185, the method of Yoo et al., 1989, J. Biol. Chem. 764:17078, or other well known methods. This can be done by sequentially linking a series of oligonucleotide cassettes comprising pairs of synthetic oligonucleotides, as described below.

Nucleic acids isolated or synthesized in accordance with features of the present invention are useful, by way of example, without limitation, as probes, primers, capture ligands, antisense genes and for developing expression systems for the synthesis of proteins and peptides corresponding to such sequences. As probes, primers, capture ligands and antisense agents, the nucleic acid normally consists of all or part (approximately twenty or more nucleotides for specificity as well as the ability to form stable hybridization products) of the nucleic acids of the invention contained in the Sequence Listing. These uses are described in further detail below.

Probes

A nucleic acid isolated or synthesized in accordance with the sequence of the invention contained in the Sequence Listing can be used as a probe to specifically detect P. aeruginosa. With the sequence information set forth in the present application, sequences of twenty or more nucleotides are identified which provide the desired inclusivity and exclusivity with respect to P. aeruginosa, and extraneous nucleic acids likely to be encountered during hybridization conditions. More preferably, the sequence will comprise at least about twenty to thirty nucleotides to convey stability to the hybridization product formed between the probe and the intended target molecules.

Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be generated by recombinant DNA techniques. Individuals skilled in the art will readily recognize that the nucleic acids, for use as probes, can be provided with a label to facilitate detection of a hybridization product.

Nucleic acid isolated and synthesized in accordance with the sequence of the invention contained in the Sequence Listing can also be useful as probes to detect homologous regions (especially homologous genes) of other Pseudomonas species using appropriate stringency hybridization conditions as described herein.

Capture Ligand

For use as a capture ligand, the nucleic acid selected in the manner described above with respect to probes, can be readily associated with a support. The manner in which nucleic acid is associated with supports is well known. Nucleic acid having twenty or more nucleotides in a sequence of the invention contained in the Sequence Listing have utility to separate P. aeruginosa nucleic acid from one strain from the nucleic acid of other another strain as well as from other organisms. Nucleic acid having twenty or more nucleotides in a sequence of the invention contained in the Sequence Listing can also have utility to separate other Pseudomonas species from each other and from other organisms. Preferably, the sequence will comprise at least about twenty nucleotides to convey stability to the hybridization product formed between the probe and the intended target molecules. Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be generated by recombinant DNA techniques.

Primers

Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility as primers for the amplification of P. aeruginosa nucleic acid. These nucleic acids may also have utility as primers for the amplification of nucleic acids in other Pseudomonas species. With respect to polymerase chain reaction (PCR) techniques, nucleic acid sequences of ≧ about 10-15 nucleotides of the invention contained in the Sequence Listing have utility in conjunction with suitable enzymes and reagents to create copies of P. aeruginosa nucleic acid. More preferably, the sequence will comprise twenty or more nucleotides to convey stability to the hybridization product formed between the primer and the intended target molecules. Binding conditions of primers greater than 100 nucleotides are more difficult to control to obtain specificity. High fidelity PCR can be used to ensure a faithful DNA copy prior to expression. In addition, amplified products can be checked by conventional sequencing methods.

The copies can be used in diagnostic assays to detect specific sequences, including genes from P. aeruginosa and/or other Pseudomonas species. The copies can also be incorporated into cloning and expression vectors to generate polypeptides corresponding to the nucleic acid synthesized by PCR, as is described in greater detail herein.

The nucleic acids of the present invention find use as templates for the recombinant production of P. aeruginosa-derived peptides or polypeptides

Antisense

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in accordance with the sequences described herein have utility as antisense agents to prevent the expression of P. aeruginosa genes. These sequences also have utility as antisense agents to prevent expression of genes of other Pseudomonas species.

In one embodiment, nucleic acid or derivatives corresponding to P. aeruginosa nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for introduction into bacterial cells. For example, a nucleic acid having twenty or more nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA. Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide necessary stability of a hybridization product of non-naturally occurring nucleic acid and bacterial nucleic acid and/or bacterial messenger RNA. Nucleic acid having a sequence greater than 1000 nucleotides in length is difficult to synthesize but can be generated by recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes is known in the art as exemplified by U.S. Pat. No. 4,241,046 issued Dec. 23, 1980 to Papahadjopoulos et al.

The present invention encompasses isolated polypeptides and nucleic acids derived from P. aeruginosa that are useful as reagents for diagnosis of bacterial infection, components of effective anti-bacterial vaccines, and/or as targets for anti-bacterial drugs, including anti-P. aeruginosa drugs.

Expression of P. aeruginosa Nucleic Acids

Table 2, which is appended herewith and which forms part of the present specification, provides a list of open reading frames (ORFs) in both strands and a putative identification of the particular function of a polypeptide which is encoded by each ORF, based on the homology match (determined by the BLAST algorithm) of the predicted polypeptide with known proteins encoded by ORFs in other organisms. An ORF is a region of nucleic acid which encodes a polypeptide. This region may represent a portion of a coding sequence or a total sequence and was determined from stop to stop codons. Each contig represents a continuous stretch of the genomic sequence of the organism. The first column lists the ORF designation. The second and third columns list the SEQ ID numbers for the nucleic acid and amino acid sequences corresponding to each ORF, respectively. The fourth and fifth columns list the length of the nucleic acid ORF and the length of the amino acid ORF, respectively. The nucleotide sequence corresponding to each ORF begins at the first nucleotide immediately following a stop codon and ends at the nucleotide immediately preceding the next downstream stop codon in the same reading frame. It will be recognized by one skilled in the art that the natural translation initiation sites will correspond to ATG, GTG, or TTG codons located within the ORFs. The natural initiation sites depend not only on the sequence of a start codon but also on the context of the DNA sequence adjacent to the start codon. Usually, a recognizable ribosome binding site is found within 20 nucleotides upstream from the initiation codon. In some cases where genes are translationally coupled and coordinately expressed together in “operons”, ribosome binding sites are not present, but the initiation codon of a downstream gene may occur very close to, or overlap, the stop codon of the an upstream gene in the same operon. The correct start codons can be generally identified without undue experimentation because only a few codons need be tested. It is recognized that the translational machinery in bacteria initiates all polypeptide chains with the amino acid methionine, regardless of the sequence of the start codon. In some cases, polypeptides are post-translationally modified, resulting in an N-terminal amino acid other than methionine in vivo. The sixth and seventh columns provide metrics for assessing the likelihood of the homology match (determined by the BLASTP2 algorithm), as is known in the art, to the genes indicated in the tenth column when the designated ORF was compared against a non-redundant comprehensive protein database. Specifically, the sixth column represents the “Blast Score” for the match (a higher score is a better match), and the seventh column represents the “P-value” for the match (the probability that such a match can have occurred by chance; the lower the value, the more likely the match is valid). If a BLASTP2 score of less than 46 was obtained, no value is reported in the table the “P-value”. Column eight provides the name of the organism that was identified as having the closest homology match. The ninth column provides, where available, either a public database accession number or our own sequence name. The tenth column provides, where available, the Swissprot accession number (SP),(SP), the locus name (LN), the Organism (OR), Source of variant (SR), E.C. number (EC), the gene name (GN), the product name (PN), the Function Description (FN), Left End (LE), Right End (RE), Coding Direction (DI), and the description (DE) or notes (NT) for each ORF. Information that is not preceded by a code designation in the tenth column represents a description of the ORF. This information allows one of ordinary skill in the art to determine a potential use for each identified coding sequence and, as a result, allows to use the polypeptides of the present invention for commercial and industrial purposes.

Using the information provided in SEQ ID NO: 1-SEQ ID NO: 16571, SEQ ID NO: 16572-SEQ ID NO: 33142 and in Table 2 together with routine cloning and sequencing methods, one of ordinary skill in the art will be able to clone and sequence all the nucleic acid fragments of interest including open reading frames (ORFs) encoding a large variety proteins of P. aeruginosa.

Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility to generate polypeptides. The nucleic acid of the invention exemplified in SEQ ID NO: 1-SEQ ID NO: 16571 and in Table 2 or fragments of said nucleic acid encoding active portions of P. aeruginosa polypeptides can be cloned into suitable vectors or used to isolate nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and cloned into a suitable vector.

The function of a specific gene or operon can be ascertained by expression in a bacterial strain under conditions where the activity of the gene product(s) specified by the gene or operon in question can be specifically measured. Alternatively, a gene product may be produced in large quantities in an expressing strain for use as an antigen, an industrial reagent, for structural studies, etc. This expression can be accomplished in a mutant strain which lacks the activity of the gene to be tested, or in a strain that does not produce the same gene product(s). This includes, but is not limited to, Eucaryotic species such as the yeast Saccharomyces cerevisiae, Methanobacterium strains or other Archaea, and Eubacteria such as E. coli, B. Subtilis, S. Aureus, S. Pneumonia or Pseudomonas putida. In some cases the expression host will utilize the natural P. aeruginosa promoter whereas in others, it will be necessary to drive the gene with a promoter sequence derived from the expressing organism (e.g., an E. coli beta-galactosidase promoter for expression in E. coli).

To express a gene product using the natural P. aeruginosa promoter, a procedure such as the following can be used. A restriction fragment containing the gene of interest, together with its associated natural promoter element and regulatory sequences (identified using the DNA sequence data) is cloned into an appropriate recombinant plasmid containing an origin of replication that functions in the host organism and an appropriate selectable marker. This can be accomplished by a number of procedures known to those skilled in the art. It is most preferably done by cutting the plasmid and the fragment to be cloned with the same restriction enzyme to produce compatible ends that can be ligated to join the two pieces together. The recombinant plasmid is introduced into the host organism by, for example, electroporation and cells containing the recombinant plasmid are identified by selection for the marker on the plasmid. Expression of the desired gene product is detected using an assay specific for that gene product.

In the case of a gene that requires a different promoter, the body of the gene (coding sequence) is specifically excised and cloned into an appropriate expression plasmid. This subcloning can be done by several methods, but is most easily accomplished by PCR amplification of a specific fragment and ligation into an expression plasmid after treating the PCR product with a restriction enzyme or exonuclease to create suitable ends for cloning.

A suitable host cell for expression of a gene can be any procaryotic or eucaryotic cell. Suitable methods for transforming host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press (1989)), and other laboratory textbooks.

For example, a host cell transfected with a nucleic acid vector directing expression of a nucleotide sequence encoding a P. aeruginosa polypeptide can be cultured under appropriate conditions to allow expression of the polypeptide to occur. Suitable media for cell culture are well known in the art. Polypeptides of the invention can be isolated from cell culture medium, host cells, or both using techniques known in the art for purifying proteins including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies specific for such polypeptides. Additionally, in many situations, polypeptides can be produced by chemical cleavage of a native protein (e.g., tryptic digestion) and the cleavage products can then be purified by standard techniques.

In the case of membrane bound proteins, these can be isolated from a host cell by contacting a membrane-associated protein fraction with a detergent forming a solubilized complex, where the membrane-associated protein is no longer entirely embedded in the membrane fraction and is solubilized at least to an extent which allows it to be chromatographically isolated from the membrane fraction. Chromatographic techniques which can be used in the final purification step are known in the art and include hydrophobic interaction, lectin affinity, ion exchange, dye affinity and immunoaffinity.

One strategy to maximize recombinant P. aeruginosa peptide expression in E. coli is to express the protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant protein (Gottesman, S., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 119-128). Another strategy would be to alter the nucleic acid encoding a P. aeruginosa peptide to be inserted into an expression vector so that the individual codons for each amino acid would be those preferentially utilized in highly expressed E. coli proteins (Wada et al., (1992) Nuc. Acids Res. 20:2111-2118). Such alteration of nucleic acids of the invention can be carried out by standard DNA synthesis techniques.

The nucleic acids of the invention can also be chemically synthesized using standard techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, including solid-phase synthesis which, like peptide synthesis, has been fully automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. Pat. No. 4,598,049; Caruthers et al. U.S. Pat. No. 4,458,066; and Itakura U.S. Pat. Nos. 4,401,796 and 4,373,071, incorporated by reference herein).

The present invention provides a library of P. aeruginosa-derived nucleic acid sequences. The libraries provide probes, primers, and markers which can be used as markers in epidemiological studies. The present invention also provides a library of P. aeruginosa-derived nucleic acid sequences which comprise or encode targets for therapeutic drugs.

Nucleic acids comprising any of the sequences disclosed herein or sub-sequences thereof can be prepared by standard methods using the nucleic acid sequence information provided in SEQ ID NO: 1-SEQ ID NO:16571. For example, DNA can be chemically synthesized using, e.g., the phosphoramidite solid support method of Matteucci et al., 1981, J. Am. Chem. Soc. 103:3185, the method of Yoo et al., 1989, J. Biol. Chem. 764:17078, or other well known methods. This can be done by sequentially linking a series of oligonucleotide cassettes comprising pairs of synthetic oligonucleotides, as described below.

Of course, due to the degeneracy of the genetic code, many different nucleotide sequences can encode polypeptides having the amino acid sequences defined by SEQ ID NO: 16572-SEQ ID NO: 33142 or sub-sequences thereof. The codons can be selected for optimal expression in prokaryotic or eukaryotic systems. Such degenerate variants are also encompassed by this invention.

Insertion of nucleic acids (typically DNAs) encoding the polypeptides of the invention into a vector is easily accomplished when the termini of both the DNAs and the vector comprise compatible restriction sites. If this cannot be done, it may be necessary to modify the termini of the DNAs and/or vector by digesting back single-stranded DNA overhangs generated by restriction endonuclease cleavage to produce blunt ends, or to achieve the same result by filling in the single-stranded termini with an appropriate DNA polymerase.

Alternatively, any site desired may be produced, e.g., by ligating nucleotide sequences (linkers) onto the termini. Such linkers may comprise specific oligonucleotide sequences that define desired restriction sites. Restriction sites can also be generated by the use of the polymerase chain reaction (PCR). See, e.g., Saiki et al., 1988, Science 239:48. The cleaved vector and the DNA fragments may also be modified if required by homopolymeric tailing.

The nucleic acids of the invention may be isolated directly from cells. Alternatively, the polymerase chain reaction (PCR) method can be used to produce the nucleic acids of the invention, using either chemically synthesized strands or genomic material as templates. Primers used for PCR can be synthesized using the sequence information provided herein and can further be designed to introduce appropriate new restriction sites, if desirable, to facilitate incorporation into a given vector for recombinant expression.

The nucleic acids of the present invention may be flanked by natural P. aeruginosa regulatory sequences, or may be associated with heterologous sequences, including promoters, enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5′- and 3′-noncoding regions, and the like. The nucleic acids may also be modified by many means known in the art. Non-limiting examples of such modifications include methylation, “caps”, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Nucleic acids may contain one or more additional covalently linked moieties, such as, for example, proteins (e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, psoralen, etc.), chelators (e.g., metals, radioactive metals, iron, oxidative metals, etc.), and alkylators. PNAs are also included. The nucleic acid may be derivatized by formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage. Furthermore, the nucleic acid sequences of the present invention may also be modified with a label capable of providing a detectable signal, either directly or indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin, and the like.

The invention also provides nucleic acid vectors comprising the disclosed P. aeruginosa-derived sequences or derivatives or fragments thereof. A large number of vectors, including plasmid and bacterial vectors, have been described for replication and/or expression in a variety of eukaryotic and prokaryotic hosts, and may be used for cloning or protein expression.

The encoded P. aeruginosa polypeptides may be expressed by using many known vectors, such as pUC plasmids, pET plasmids (Novagen, Inc., Madison, Wis.), or pRSET or pREP (Invitrogen, San Diego, Calif.), and many appropriate host cells, using methods disclosed or cited herein or otherwise known to those skilled in the relevant art. The particular choice of vector/host is not critical to the practice of the invention.

Recombinant cloning vectors will often include one or more replication systems for cloning or expression, one or more markers for selection in the host, e.g. antibiotic resistance, and one or more expression cassettes. The inserted P. aeruginosa coding sequences may be synthesized by standard methods, isolated from natural sources, or prepared as hybrids, etc. Ligation of the P. aeruginosa coding sequences to transcriptional regulatory elements and/or to other amino acid coding sequences may be achieved by known methods. Suitable host cells may be transformed/transfected/infected as appropriate by any suitable method including electroporation, CaCl2 mediated DNA uptake, bacterial infection, microinjection, microprojectile, or other established methods.

Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, and plant and animal cells, especially mammalian cells. Of particular interest are P. aeruginosa, E. coli, B. Subtilis, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Schizosaccharomyces pombi, SF9 cells, C129 cells, 293 cells, Neurospora, and CHO cells, COS cells, HeLa cells, and immortalized mammalian myeloid and lymphoid cell lines. Preferred replication systems include M13, ColE1, SV40, baculovirus, lambda, adenovirus, and the like. A large number of transcription initiation and termination regulatory regions have been isolated and shown to be effective in the transcription and translation of heterologous proteins in the various hosts. Examples of these regions, methods of isolation, manner of manipulation, etc. are known in the art. Under appropriate expression conditions, host cells can be used as a source of recombinantly produced P. aeruginosa-derived peptides and polypeptides.

Advantageously, vectors may also include a transcription regulatory element (i.e., a promoter) operably linked to the P. aeruginosa portion. The promoter may optionally contain operator portions and/or ribosome binding sites. Non-limiting examples of bacterial promoters compatible with E. coli include: b-lactamase (penicillinase) promoter; lactose promoter; tryptophan (trp) promoter; araBAD (arabinose) operon promoter; lambda-derived P1 promoter and N gene ribosome binding site; and the hybrid tac promoter derived from sequences of the trp and lac UV5 promoters. Non-limiting examples of yeast promoters include 3-phosphoglycerate kinase promoter, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter, galactokinase (GAL1) promoter, galactoepimerase promoter, and alcohol dehydrogenase (ADH) promoter. Suitable promoters for mammalian cells include without limitation viral promoters such as that from Simian Virus 40 (SV40), Rous sarcoma virus (RSV), adenovirus (ADV), and bovine papilloma virus (BPV). Mammalian cells may also require terminator sequences, polyA addition sequences and enhancer sequences to increase expression. Sequences which cause amplification of the gene may also be desirable. Furthermore, sequences that facilitate secretion of the recombinant product from cells, including, but not limited to, bacteria, yeast, and animal cells, such as secretory signal sequences and/or prohormone pro region sequences, may also be included. These sequences are well described in the art.

Nucleic acids encoding wild-type or variant P. aeruginosa-derived polypeptides may also be introduced into cells by recombination events. For example, such a sequence can be introduced into a cell, and thereby effect homologous recombination at the site of an endogenous gene or a sequence with substantial identity to the gene. Other recombination-based methods such as nonhomologous recombinations or deletion of endogenous genes by homologous recombination may also be used.

The nucleic acids of the present invention find use as templates for the recombinant production of P. aeruginosa-derived peptides or polypeptides.

Identification and Use of P. aeruginosa Nucleic Acid Sequences

The disclosed P. aeruginosa polypeptide and nucleic acid sequences, or other sequences that are contained within ORFs, including complete protein-coding sequences, of which any of the disclosed P. aeruginosa-specific sequences forms a part, are useful as target components for diagnosis and/or treatment of P. aeruginosa-caused infection

It will be understood that the sequence of an entire protein-coding sequence of which each disclosed nucleic acid sequence forms a part can be isolated and identified based on each disclosed sequence. This can be achieved, for example, by using an isolated nucleic acid encoding the disclosed sequence, or fragments thereof, to prime a sequencing reaction with genomic P. aeruginosa DNA as template; this is followed by sequencing the amplified product. The isolated nucleic acid encoding the disclosed sequence, or fragments thereof, can also be hybridized to P. aeruginosa genomic libraries to identify clones containing additional complete segments of the protein-coding sequence of which the shorter sequence forms a part. Then, the entire protein-coding sequence, or fragments thereof, or nucleic acids encoding all or part of the sequence, or sequence-conservative or function-conservative variants thereof, may be employed in practicing the present invention.

Preferred sequences are those that are useful in diagnostic and/or therapeutic applications. Diagnostic applications include without limitation nucleic-acid-based and antibody-based methods for detecting bacterial infection. Therapeutic applications include without limitation vaccines, passive immunotherapy, and drug treatments directed against gene products that are both unique to bacteria and essential for growth and/or replication of bacteria.

Identification of Nucleic Acids Encoding Vaccine Components and Targets for Agents Effective Against P. aeruginosa

The disclosed P. aeruginosa genome sequence includes segments that direct the synthesis of ribonucleic acids and polypeptides, as well as origins of replication, promoters, other types of regulatory sequences, and intergenic nucleic acids. The invention encompasses nucleic acids encoding immunogenic components of vaccines and targets for agents effective against P. aeruginosa. Identification of said immunogenic components involved in the determination of the function of the disclosed sequences, which can be achieved using a variety of approaches. Non-limiting examples of these approaches are described briefly below.

Homology to known sequences:

Computer-assisted comparison of the disclosed P. aeruginosa sequences with previously reported sequences present in publicly available databases is useful for identifying functional P. aeruginosa nucleic acid and polypeptide sequences. It will be understood that protein-coding sequences, for example, may be compared as a whole, and that a high degree of sequence homology between two proteins (such as, for example, >80-90%) at the amino acid level indicates that the two proteins also possess some degree of functional homology, such as, for example, among enzymes involved in metabolism, DNA synthesis, or cell wall synthesis, and proteins involved in transport, cell division, etc. In addition, many structural features of particular protein classes have been identified and correlate with specific consensus sequences, such as, for example, binding domains for nucleotides, DNA, metal ions, and other small molecules; sites for covalent modifications such as phosphorylation, acylation, and the like; sites of protein:protein interactions, etc. These consensus sequences may be quite short and thus may represent only a fraction of the entire protein-coding sequence. Identification of such a feature in a P. aeruginosa sequence is therefore useful in determining the function of the encoded protein and identifying useful targets of antibacterial drugs.

Of particular relevance to the present invention are structural features that are common to secretory, transmembrane, and surface proteins, including secretion signal peptides and hydrophobic transmembrane domains. P. aeruginosa proteins identified as containing putative signal sequences and/or transmembrane domains are useful as immunogenic components of vaccines.

Targets for therapeutic drugs according to the invention include, but are not limited to, polypeptides of the invention, whether unique to P. aeruginosa or not, that are essential for growth and/or viability of P. aeruginosa under at least one growth condition. Polypeptides essential for growth and/or viability can be determined by examining the effect of deleting and/or disrupting the genes, i.e., by so-called gene “knockout”. Alternatively, genetic footprinting can be used (Smith et al., 1995, Proc. Natl. Acad. Sci. USA 92:5479-6433; Published International Application WO 94/26933; U.S. Pat. No. 5,612,180). Still other methods for assessing essentiality includes the ability to isolate conditional lethal mutations in the specific gene (e.g., temperature sensitive mutations). Other useful targets for therapeutic drugs, which include polypeptides that are not essential for growth or viability per se but lead to loss of viability of the cell, can be used to target therapeutic agents to cells.

Strain-specific sequences:

Because of the evolutionary relationship between different P. aeruginosa strains, it is believed that the presently disclosed P. aeruginosa sequences are useful for identifying, and/or discriminating between, previously known and new P. aeruginosa strains. It is believed that other P. aeruginosa strains will exhibit at least about 70% sequence homology with the presently disclosed sequence. Systematic and routine analyses of DNA sequences derived from samples containing P. aeruginosa strains, and comparison with the present sequence allows for the identification of sequences that can be used to discriminate between strains, as well as those that are common to all P. aeruginosa strains. In one embodiment, the invention provides nucleic acids, including probes, and peptide and polypeptide sequences that discriminate between different strains of P. aeruginosa. Strain-specific components can also be identified functionally by their ability to elicit or react with antibodies that selectively recognize one or more P. aeruginosa strains.

In another embodiment, the invention provides nucleic acids, including probes, and peptide and polypeptide sequences that are common to all P. aeruginosa strains but are not found in other bacterial species.

P. aeruginosa Polypeptides

This invention encompasses isolated P. aeruginosa polypeptides encoded by the disclosed P. aeruginosa genomic sequences, including the polypeptides of the invention contained in the Sequence Listing. Polypeptides of the invention are preferably at least about 5 amino acid residues in length. Using the DNA sequence information provided herein, the amino acid sequences of the polypeptides encompassed by the invention can be deduced using methods well-known in the art. It will be understood that the sequence of an entire nucleic acid encoding a P. aeruginosa polypeptide can be isolated and identified based on an ORF that encodes only a fragment of the cognate protein-coding region. This can be achieved, for example, by using the isolated nucleic acid encoding the ORF, or fragments thereof, to prime a polymerase chain reaction with genomic P. aeruginosa DNA as template; this is followed by sequencing the amplified product.

The polypeptides of the present invention, including function-conservative variants of the disclosed ORFs, may be isolated from wild-type or mutant P. aeruginosa cells, or from heterologous organisms or cells (including, but not limited to, bacteria, fungi, insect, plant, and mammalian cells) including P. aeruginosa into which a P. aeruginosa-derived protein-coding sequence has been introduced and expressed. Furthermore, the polypeptides may be part of recombinant fusion proteins.

P. aeruginosa polypeptides of the invention can be chemically synthesized using commercially automated procedures such as those referenced herein, including, without limitation, exclusive solid phase synthesis, partial solid phase methods, fragment condensation or classical solution synthesis. The polypeptides are preferably prepared by solid phase peptide synthesis as described by Merrifield, 1963, J. Am. Chem. Soc. 85:2149. The synthesis is carried out with amino acids that are protected at the alpha-amino terminus. Trifunctional amino acids with labile side-chains are also protected with suitable groups to prevent undesired chemical reactions from occurring during the assembly of the polypeptides. The alpha-amino protecting group is selectively removed to allow subsequent reaction to take place at the amino-terminus. The conditions for the removal of the alpha-amino protecting group do not remove the side-chain protecting groups.

Methods for polypeptide purification are well-known in the art, including, without limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and countercurrent distribution. For some purposes, it is preferable to produce the polypeptide in a recombinant system in which the P. aeruginosa protein contains an additional sequence tag that facilitates purification, such as, but not limited to, a polyhistidine sequence. The polypeptide can then be purified from a crude lysate of the host cell by chromatography on an appropriate solid-phase matrix. Alternatively, antibodies produced against a P. aeruginosa protein or against peptides derived therefrom can be used as purification reagents. Other purification methods are possible.

The present invention also encompasses derivatives and homologues of P. aeruginosa-encoded polypeptides. For some purposes, nucleic acid sequences encoding the peptides may be altered by substitutions, additions, or deletions that provide for functionally equivalent molecules, i.e., function-conservative variants. For example, one or more amino acid residues within the sequence can be substituted by another amino acid of similar properties, such as, for example, positively charged amino acids (arginine, lysine, and histidine); negatively charged amino acids (aspartate and glutamate); polar neutral amino acids; and non-polar amino acids.

The isolated polypeptides may be modified by, for example, phosphorylation, sulfation, acylation, or other protein modifications. They may also be modified with a label capable of providing a detectable signal, either directly or indirectly, including, but not limited to, radioisotopes and fluorescent compounds.

To identify P. aeruginosa-derived polypeptides for use in the present invention, essentially the complete genomic sequence of a virulent, methicillin-resistant isolate of Pseudomonas aeruginosa isolate was analyzed. While, in very rare instances, a nucleic acid sequencing error may be revealed, resolving a rare sequencing error is well within the art, and such an occurrence will not prevent one skilled in the art from practicing the invention.

Also encompassed are any P. aeruginosa polypeptide sequences that are contained within the open reading frames (ORFs), including complete protein-coding sequences, of which any of SEQ ID NO: 1-SEQ ID NO: 16571 forms a part. Table 2, which is appended herewith and which forms part of the present specification, provides a putative identification of the particular function of a polypeptide which is encoded by each ORF, based on the homology match (determined by the BLAST algorithm) of the predicted polypeptide with known proteins encoded by ORFs in other organisms. As a result, one skilled in the art can use the polypeptides of the present invention for commercial and industrial purposes consistent with the type of putative identification of the polypeptide.

The present invention provides a library of P. aeruginosa-derived polypeptide sequences, and a corresponding library of nucleic acid sequences encoding the polypeptides, wherein the polypeptides themselves, or polypeptides contained within ORFs of which they form a part, comprise sequences that are contemplated for use as components of vaccines. Non-limiting examples of such sequences are listed by SEQ ID NO in Table 2, which is appended herewith and which forms part of the present specification.

The present invention also provides a library of P. aeruginosa-derived polypeptide sequences, and a corresponding library of nucleic acid sequences encoding the polypeptides, wherein the polypeptides themselves, or polypeptides contained within ORFs of which they form a part, comprise sequences lacking homology to any known prokaryotic or eukaryotic sequences. Such libraries provide probes, primers, and markers which can be used to diagnose P. aeruginosa infection, including use as markers in epidemiological studies. Non-limiting examples of such sequences are listed by SEQ ID NO in Table 2, which is appended hereto and a part hereof.

The present invention also provides a library of P. aeruginosa-derived polypeptide sequences, and a corresponding library of nucleic acid sequences encoding the polypeptides, wherein the polypeptides themselves, or polypeptides contained within ORFs of which they form a part, comprise targets for therapeutic drugs.

SPECIFIC EXAMPLE

Determination of Pseudomonas Protein Antigens for Antibody and Vaccine Development

The selection of Pseudomonas protein antigens for vaccine development can be derived from the nucleic acids encoding P. aeruginosa polypeptides. First, the ORF's can be analyzed for homology to other known exported or membrane proteins and analyzed using the discriminant analysis described by Klein, et al. (Klein, P., Kanehsia, M., and DeLisi, C. (1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported and membrane proteins.

Homology searches can be performed using the BLAST algorithm contained in the Wisconsin Sequence Analysis Package (Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711) to compare each predicted ORF amino acid sequence with all sequences found in the current GenBank, SWISS-PROT and PIR databases. BLAST searches for local alignments between the ORF and the databank sequences and reports a probability score which indicates the probability of finding this sequence by chance in the database. ORF's with significant homology (e.g. probabilities lower than 1×10−6 that the homology is only due to random chance) to membrane or exported proteins represent protein antigens for vaccine development. Possible functions can be provided to P. aeruginosa genes based on sequence homology to genes cloned in other organisms.

Discriminant analysis (Klein, et al. supra) can be used to examine the ORF amino acid sequences. This algorithm uses the intrinsic information contained in the ORF amino acid sequence and compares it to information derived from the properties of known membrane and exported proteins. This comparison predicts which proteins will be exported, membrane associated or cytoplasmic. ORF amino acid sequences identified as exported or membrane associated by this algorithm are likely protein antigens for vaccine development.

Production of Fragments and Analogs of P. aeruginosa Nucleic Acids and Polypeptides

Based on the discovery of the P. aeruginosa gene products of the invention provided in the Sequence Listing, one skilled in the art can alter the disclosed structure of P. aeruginosa genes, e.g., by producing fragments or analogs, and test the newly produced structures for activity. Examples of techniques known to those skilled in the relevant art which allow the production and testing of fragments and analogs are discussed below. These, or analogous methods can be used to make and screen libraries of polypeptides, e.g., libraries of random peptides or libraries of fragments or analogs of cellular proteins for the ability to bind P. aeruginosa polypeptides. Such screens are useful for the identification of inhibitors of P. aeruginosa.

Generation of Fragments

Fragments of a protein can be produced in several ways, e.g., recombinantly, by proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a polypeptide can be generated by removing one or more nucleotides from one end (for a terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. Digestion with “end-nibbling” endonucleases can thus generate DNAs which encode an array of fragments. DNAs which encode fragments of a protein can also be generated by random shearing, restriction digestion or a combination of the above-discussed methods.

Fragments can also be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, peptides of the present invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, or divided into overlapping fragments of a desired length.

Alteration of Nucleic Acids and Polypeptides: Random Methods

Amino acid sequence variants of a protein can be prepared by random mutagenesis of DNA which encodes a protein or a particular domain or region of a protein. Useful methods include PCR mutagenesis and saturation mutagenesis. A library of random amino acid sequence variants can also be generated by the synthesis of a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a library of variants are elsewhere herein).

PCR Mutagenesis

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). The DNA region to be mutagenized is amplified using the polymerase chain reaction (PCR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, e.g., by using a dGTP/dATP ratio of five and adding Mn2+ to the PCR reaction. The pool of amplified DNA fragments are inserted into appropriate cloning vectors to provide random mutant libraries.

Saturation Mutagenesis

Saturation mutagenesis allows for the rapid introduction of a large number of single base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This technique includes generation of mutations, e.g., by chemical treatment or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The mutation frequency can be modulated by modulating the severity of the treatment, and essentially all possible base substitutions can be obtained. Because this procedure does not involve a genetic selection for mutant fragments both neutral substitutions, as well as those that alter function, are obtained. The distribution of point mutations is not biased toward conserved sequence elements.

Degenerate Oligonucleotides

A library of homologs can also be generated from a set of degenerate oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried out in an automatic DNA synthesizer, and the synthetic genes then ligated into an appropriate expression vector. The synthesis of degenerate oligonucleotides is known in the art (see for example, Narang, S A (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. A G Walton, Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Pat. Nos. 5,223,409, 5,198,346, and 5,096,815).

Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis

Non-random or directed, mutagenesis techniques can be used to provide specific sequences or mutations in specific regions. These techniques can be used to create variants which include, e.g., deletions, insertions, or substitutions, of residues of the known amino acid sequence of a protein. The sites for mutation can be modified individually or in series, e.g., by (1) substituting first with conserved amino acids and then with more radical choices depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of the same or a different class adjacent to the located site, or combinations of options 1-3.

Alanine Scanning Mutagenesis

Alanine scanning mutagenesis is a useful method for identification of certain residues or regions of the desired protein that are preferred locations or domains for mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine scanning, a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction of the amino acids with the surrounding aqueous environment in or outside the cell. Those domains demonstrating functional sensitivity to the substitutions are then refined by introducing further or other variants at or for the sites of substitution. Thus, while the site for introducing an amino acid sequence variation is predetermined, the nature of the mutation per se need not be predetermined. For example, to optimize the performance of a mutation at a given site, alanine scanning or random mutagenesis may be conducted at the target codon or region and the expressed desired protein subunit variants are screened for the optimal combination of desired activity.

Oligonucleotide-Mediated Mutagenesis

Oligonucleotide-mediated mutagenesis is a useful method for preparing substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 2:183, 1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a mutation to a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the desired protein. After hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code for the selected alteration in the desired protein DNA. Generally, oligonucleotides of at least about 25 nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the single-stranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known in the art such as that described by Crea et al. (Proc. Natl. Acad. Sci. USA, 75: 5765 [1978]).

Cassette Mutagenesis

Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells et al. (Gene, 34:315 [1985]). The starting material is a plasmid (or other vector) which includes the protein subunit DNA to be mutated. The codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, they may be generated using the above-described oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations in the desired protein subunit DNA. After the restriction sites have been introduced into the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of the DNA between the restriction sites but containing the desired mutation(s) is synthesized using standard procedures. The two strands are synthesized separately and then hybridized together using standard techniques. This double-stranded oligonucleotide is referred to as the cassette. This cassette is designed to have 3′ and 5′ ends that are comparable with the ends of the linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence.

Combinatorial Mutagenesis

Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., WO 88/06630). In this method, the amino acid sequences for a group of homologs or other related proteins are aligned, preferably to promote the highest homology possible. All of the amino acids which appear at a given position of the aligned sequences can be selected to create a degenerate set of combinatorial sequences. The variegated library of variants is generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene library. For example, a mixture of synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the degenerate set of potential sequences are expressible as individual peptides, or alternatively, as a set of larger fusion proteins containing the set of degenerate sequences.

Other Modifications of P. aeruginosa Nucleic Acids and Polypeptides

It is possible to modify the structure of a P. aeruginosa polypeptide for such purposes as increasing solubility, enhancing stability (e.g., shelf life ex vivo and resistance to proteolytic degradation in vivo). A modified P. aeruginosa protein or peptide can be produced in which the amino acid sequence has been altered, such as by amino acid substitution, deletion, or addition as described herein.

An P. aeruginosa peptide can also be modified by substitution of cysteine residues preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize dimerization via disulfide linkages. In addition, amino acid side chains of fragments of the protein of the invention can be chemically modified. Another modification is cyclization of the peptide.

In order to enhance stability and/or reactivity, a P. aeruginosa polypeptide can be modified to incorporate one or more polymorphisms in the amino acid sequence of the protein resulting from any natural allelic variation. Additionally, D-amino acids, non-natural amino acids, or non-amino acid analogs can be substituted or added to produce a modified protein within the scope of this invention. Furthermore, an P. aeruginosa polypeptide can be modified using polyethylene glycol (PEG) according to the method of A. Sehon and co-workers (Wie et al., supra) to produce a protein conjugated with PEG. In addition, PEG can be added during chemical synthesis of the protein. Other modifications of P. aeruginosa proteins include reduction/alkylation (Tarr, Methods of Protein Microcharacterization, J. E. Silver ed., Humana Press, Clifton N.J. 155-194 (1986)); acylation (Tarr, supra); chemical coupling to an appropriate carrier (Mishell and Shiigi, eds, Selected Methods in Cellular Immunology, W H Freeman, San Francisco, Calif. (1980), U.S. Pat. No. 4,939,239; or mild formalin treatment (Marsh, (1971) Int. Arch. of Allergy and Appl. Immunol., 41: 199-215).

To facilitate purification and potentially increase solubility of a P. aeruginosa protein or peptide, it is possible to add an amino acid fusion moiety to the peptide backbone. For example, hexa-histidine can be added to the protein for purification by immobilized metal ion affinity chromatography (Hochuli, E. et al., (1988) Bio/Technology, 6: 1321-1325). In addition, to facilitate isolation of peptides free of irrelevant sequences, specific endoprotease cleavage sites can be introduced between the sequences of the fusion moiety and the peptide.

To potentially aid proper antigen processing of epitopes within an P. aeruginosa polypeptide, canonical protease sensitive sites can be engineered between regions, each comprising at least one epitope via recombinant or synthetic methods. For example, charged amino acid pairs, such as KK or RR, can be introduced between regions within a protein or fragment during recombinant construction thereof. The resulting peptide can be rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes which would generate portions of the protein containing one or more epitopes. In addition, such charged amino acid residues can result in an increase in the solubility of the peptide.

Primary Methods for Screening Polypeptides and Analogs

Various techniques are known in the art for screening generated mutant gene products. Techniques for screening large gene libraries often include cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the genes under conditions in which detection of a desired activity, e.g., in this case, binding to P. aeruginosa polypeptide or an interacting protein, facilitates relatively easy isolation of the vector encoding the gene whose product was detected. Each of the techniques described below is amenable to high through-put analysis for screening large numbers of sequences created, e.g., by random mutagenesis techniques.

Two Hybrid Systems

Two hybrid assays such as the system described below (as with the other screening methods described herein), can be used to identify polypeptides, e.g., fragments or analogs of a naturally-occurring P. aeruginosa polypeptide, e.g., of cellular proteins, or of randomly generated polypeptides which bind to an P. aeruginosa protein. (The P. aeruginosa domain is used as the bait protein and the library of variants are expressed as prey fusion proteins.) In an analogous fashion, a two hybrid assay (as with the other screening methods described herein), can be used to find polypeptides which bind a P. aeruginosa polypeptide.

Display Libraries

In one approach to screening assays, the Pseudomonas peptides are displayed on the surface of a cell or viral particle, and the ability of particular cells or viral particles to bind an appropriate receptor protein via the displayed product is detected in a “panning assay”. For example, the gene library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can be used to detect homologs which retain ligand-binding activity. The use of fluorescently labeled ligands, allows cells to be visually inspected and separated under a fluorescence microscope, or, where the morphology of the cell permits, to be separated by a fluorescence-activated cell sorter.

A gene library can be expressed as a fusion protein on the surface of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences can be expressed on the surface of infectious phage, thereby conferring two significant benefits. First, since these phage can be applied to affinity matrices at concentrations well over 1013 phage per milliliter, a large number of phage can be screened at one time. Second, since each infectious phage displays a gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the phage can be amplified by another round of infection. The group of almost identical E. coli filamentous phages, M13, fd., and f1, are most often used in phage display libraries. Either of the phage gIII or gVIII coat proteins can be used to generate fusion proteins without disrupting the ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH2-terminal end of pIII and phage bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner et al. PCT publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461).

A common approach uses the maltose receptor of E. coli (the outer membrane protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce peptides fused into one of the extracellular loops of the protein. These peptides are available for binding to ligands, e.g., to antibodies, and can elicit an immune response when the cells are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface structures have served as vehicles for peptide display. Peptides can be fused to pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic information (Thiry et al. (1989) Appl. Environ. Microbiol. 55, 984-993). Because of its role in interacting with other cells, the pilus provides a useful support for the presentation of peptides to the extracellular environment. Another large surface structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of peptides to the subunit protein flagellin offers a dense array of many peptide copies on the host cells (Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of other bacterial species have also served as peptide fusion partners. Examples include the Staphylococcus protein A and the outer membrane IgA protease of Neisseria (Hansson et al. (1992) J. Bacteriol. 174, 4239-4245 and Klauser et al. (1990) EMBO J. 9, 1991-1999).

In the filamentous phage systems and the LamB system described above, the physical link between the peptide and its encoding DNA occurs by the containment of the DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the peptide captures the particle and the DNA within. An alternative scheme uses the DNA-binding protein LacI to form a link between peptide and DNA (Cull et al. (1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the LacI gene with an oligonucleotide cloning site at its 3′-end. Under the controlled induction by arabinose, a LacI-peptide fusion protein is produced. This fusion retains the natural ability of LacI to bind to a short DNA sequence known as LacO operator (LacO). By installing two copies of LacO on the expression plasmid, the LacI-peptide fusion binds tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a single oligonucleotide sequence and each cell expresses only a single peptide sequence, the peptides become specifically and stablely associated with the DNA sequence that directed its synthesis. The cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix of immobilized receptor to recover the complexes containing active peptides. The associated plasmid DNA is then reintroduced into cells for amplification and DNA sequencing to determine the identity of the peptide ligands. As a demonstration of the practical utility of the method, a large random library of dodecapeptides was made and selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl. Acad. Sci. U.S.A. 89-1869)

This scheme, sometimes referred to as peptides-on-plasmids, differs in two important ways from the phage display methods. First, the peptides are attached to the C-terminus of the fusion protein, resulting in the display of the library members as peptides having free carboxy termini. Both of the filamentous phage coat proteins, pIII and pVIII, are anchored to the phage through their C-termini, and the guest peptides are placed into the outward-extending N-terminal domains. In some designs, the phage-displayed peptides are presented right at the amino terminus of the fusion protein. (Cwirla, et al. (1990) Proc. Natl. Acad. Sci. U.S.A 87, 6378-6382) A second difference is the set of biological biases affecting the population of peptides actually present in the libraries. The LacI fusion molecules are confined to the cytoplasm of the host cells. The phage coat fusions are exposed briefly to the cytoplasm during translation but are rapidly secreted through the inner membrane into the periplasmic compartment, remaining anchored in the membrane by their C-terminal hydrophobic domains, with the N-termini, containing the peptides, protruding into the periplasm while awaiting assembly into phage particles. The peptides in the LacI and phage libraries may differ significantly as a result of their exposure to different proteolytic activities. The phage coat proteins require transport across the inner membrane and signal peptidase processing as a prelude to incorporation into phage. Certain peptides exert a deleterious effect on these processes and are underrepresented in the libraries (Gallop et al. (1994) J. Med. Chem. 37(9):1233-1251). These particular biases are not a factor in the LacI display system.

The number of small peptides available in recombinant random libraries is enormous. Libraries of 107-109 independent clones are routinely prepared. Libraries as large as 1011 recombinants have been created, but this size approaches the practical limit for clone libraries. This limitation in library size occurs at the step of transforming the DNA containing randomized segments into the host bacterial cells. To circumvent this limitation, an in vitro system based on the display of nascent peptides in polysome complexes has recently been developed. This display library method has the potential of producing libraries 3-6 orders of magnitude larger than the currently available phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, expression of the peptides, and screening, is done in an entirely cell-free format.

In one application of this method (Gallop et al. (1994) J. Med. Chem. 37(9):1233-1251), a molecular DNA library encoding 1012 decapeptides was constructed and the library expressed in an E. coli S30 in vitro coupled transcription/translation system. Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes and yielding complexes containing nascent peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be affinity purified on immobilized receptors in much the same way as the more conventional recombinant peptide display libraries are screened. RNA from the bound complexes is recovered, converted to cDNA, and amplified by PCR to produce a template for the next round of synthesis and screening. The polysome display method can be coupled to the phage display system. Following several rounds of screening, cDNA from the enriched pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing vector for peptide identification. By expressing the polysome-derived peptides on phage, one can either continue the affinity selection procedure in this format or assay the peptides on individual clones for binding activity in a phage ELISA, or for binding specificity in a completion phage ELISA (Barret, et al. (1992) Anal. Biochem 204,357-364). To identify the sequences of the active peptides one sequences the DNA produced by the phagemid host.

Secondary Screening of Polypeptides and Analogs

The high through-put assays described above can be followed by secondary screens in order to identify further biological activities which will, e.g., allow one skilled in the art to differentiate agonists from antagonists. The type of a secondary screen used will depend on the desired activity that needs to be tested. For example, an assay can be developed in which the ability to inhibit an interaction between a protein of interest and its respective ligand can be used to identify antagonists from a group of peptide fragments isolated though one of the primary screens described above.

Therefore, methods for generating fragments and analogs and testing them for activity are known in the art. Once the core sequence of interest is identified, it is routine for one skilled in the art to obtain analogs and fragments.

Peptide Mimetics of P. aeruginosa Polypeptides

The invention also provides for reduction of the protein binding domains of the subject P. aeruginosa polypeptides to generate mimetics, e.g. peptide or non-peptide agents. The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, e.g., in the case of a P. aeruginosa polypeptide binding to a naturally occurring ligand. The critical residues of a subject P. aeruginosa polypeptide which are involved in molecular recognition of a polypeptide can be determined and used to generate P. aeruginosa-derived peptidomimetics which competitively or noncompetitively inhibit binding of the P. aeruginosa polypeptide with an interacting polypeptide (see, for example, European patent applications EP-412,762A and EP-B31,080A).

For example, scanning mutagenesis can be used to map the amino acid residues of a particular P. aeruginosa polypeptide involved in binding an interacting polypeptide, peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated which mimic those residues in binding to an interacting polypeptide, and which therefore can inhibit binding of a P. aeruginosa polypeptide to an interacting polypeptide and thereby interfere with the function of P. aeruginosa polypeptide. For instance, non-hydrolyzable peptide analogs of such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G. R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, Ill., 1985), b-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and b-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and et al. (1986) Biochem Biophys Res Commun 134:71).

Vaccine Formulations for P. aeruginosa Nucleic Acids and Polypeptides

This invention also features vaccine compositions for protection against infection by P. aeruginosa or for treatment of P. aeruginosa infection. In one embodiment, the vaccine compositions contain one or more immunogenic components such as a surface protein from P. aeruginosa, or portion thereof, and a pharmaceutically acceptable carrier. Nucleic acids within the scope of the invention are exemplified by the nucleic acids of the invention contained in the Sequence Listing which encode P. aeruginosa surface proteins. Any nucleic acid encoding an immunogenic P. aeruginosa protein, or portion thereof, which is capable of expression in a cell, can be used in the present invention. These vaccines have therapeutic and prophylactic utilities.

One aspect of the invention provides a vaccine composition for protection against infection by P. aeruginosa which contains at least one immunogenic fragment of an P. aeruginosa protein and a pharmaceutically acceptable carrier. Preferred fragments include peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino acid residues in length, and more preferably about 12-16 amino acid residues in length.

Immunogenic components of the invention can be obtained, for example, by screening polypeptides recombinantly produced from the corresponding fragment of the nucleic acid encoding the full-length P. aeruginosa protein. In addition, fragments can be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry.

In one embodiment, immunogenic components are identified by the ability of the peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for example, T cell proliferation or cytokine secretion are defined herein as comprising at least one T cell epitope. T cell epitopes are believed to be involved in initiation and perpetuation of the immune response to the protein allergen which is responsible for the clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at the level of the T helper cell by binding to an appropriate HLA molecule on the surface of an antigen presenting cell, thereby stimulating the T cell subpopulation with the relevant T cell receptor for the epitope. These events lead to T cell proliferation, lymphokine secretion, local inflammatory reactions, recruitment of additional immune cells to the site of antigen/T cell interaction, and activation of the B cell cascade, leading to the production of antibodies. A T cell epitope is the basic element, or smallest unit of recognition by a T cell receptor, where the epitope comprises amino acids essential to receptor recognition (e.g., approximately 6 or 7 amino acid residues). Amino acid sequences which mimic those of the T cell epitopes are within the scope of this invention.

Screening immunogenic components can be accomplished using one or more of several different assays. For example, in vitro, peptide T cell stimulatory activity is assayed by contacting a peptide known or suspected of being immunogenic with an antigen presenting cell which presents appropriate MHC molecules in a T cell culture. Presentation of an immunogenic P. aeruginosa peptide in association with appropriate MHC molecules to T cells in conjunction with the necessary co-stimulation has the effect of transmitting a signal to the T cell that induces the production of increased levels of cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be obtained and assayed for interleukin-2 or other known cytokines. For example, any one of several conventional assays for interleukin-2 can be employed, such as the assay described in Proc. Natl. Acad. Sci USA, 86: 1333 (1989) the pertinent portions of which are incorporated herein by reference. A kit for an assay for the production of interferon is also available from Genzyme Corporation (Cambridge, Mass.).

Alternatively, a common assay for T cell proliferation entails measuring tritiated thymidine incorporation. The proliferation of T cells can be measured in vitro by determining the amount of 3H-labeled thymidine incorporated into the replicating DNA of cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell division can be quantified.

Vaccine compositions of the invention containing immunogenic components (e.g., P. aeruginosa polypeptide or fragment thereof or nucleic acid encoding an P. aeruginosa polypeptide or fragment thereof) preferably include a pharmaceutically acceptable carrier. The term “pharmaceutically acceptable carrier” refers to a carrier that does not cause an allergic reaction or other untoward effect in patients to whom it is administered. Suitable pharmaceutically acceptable carriers include, for example, one or more of water, saline, phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations thereof. Pharmaceutically acceptable carriers may further comprise minor amounts of auxiliary substances such as wetting or emulsifying agents, preservatives or buffers, which enhance the shelf life or effectiveness of the antibody. For vaccines of the invention containing P. aeruginosa polypeptides, the polypeptide is co-administered with a suitable adjuvant.

It will be apparent to those of skill in the art that the therapeutically effective amount of DNA or protein of this invention will depend, inter alia, upon the administration schedule, the unit dose of antibody administered, whether the protein or DNA is administered in combination with other therapeutic agents, the immune status and health of the patient, and the therapeutic activity of the particular protein or DNA.

Vaccine compositions are conventionally administered parenterally, e.g., by injection, either subcutaneously or intramuscularly. Methods for intramuscular immunization are described by Wolff et al. (1990) Science 247: 1465-1468 and by Sedegah et al. (1994) Immunology 91: 9866-9870. Other modes of administration include oral and pulmonary formulations, suppositories, and transdermal applications. Oral immunization is preferred over parenteral methods for inducing protection against infection by P. aeruginosa. Cain et. al. (1993) Vaccine 11: 637-642. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like.

The vaccine compositions of the invention can include an adjuvant, including, but not limited to aluminum hydroxide; N-acetyl-muramyl--L-threonyl-D-isoglutamine (thr-MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP); N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1′-2 ′-dipalmitoyl-sn-glycero-3-hydroxyphos-phoryloxy)-ethylami ne (CGP 19835A, referred to a MTP-PE); RIBI, which contains three components from bacteria; monophosphoryl lipid A; trehalose dimycoloate; cell wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tween 80 emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of cholera toxin, including its B subunit, and/or conjugates or genetically engineered fusions of the P. aeruginosa polypeptide with cholera toxin or its B subunit, procholeragenoid, fungal polysaccharides, including schizophyllan, muramyl dipeptide, muramyl dipeptide derivatives, phorbol esters, labile toxin of E. coli, non-P. aeruginosa bacterial lysates, block polymers or saponins.

Other suitable delivery methods include biodegradable microcapsules or immuno-stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like particles, e.g., bluetongue. The amount of adjuvant employed will depend on the type of adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably used in an amount of 5 mg to 50 mg, for example 10 mg to 35 mg. When used in the form of microcapsules, the amount used will depend on the amount employed in the matrix of the microcapsule to achieve the desired dosage. The determination of this amount is within the skill of a person of ordinary skill in the art.

Carrier systems in humans may include enteric release capsules protecting the antigen from the acidic environment of the stomach, and including P. aeruginosa polypeptide in an insoluble form as fusion proteins. Suitable carriers for the vaccines of the invention are enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 0.2 N NaHCO3 and/or saline.

Vaccines of the invention can be administered as a primary prophylactic agent in adults or in children, as a secondary prevention, after successful eradication of P. aeruginosa in an infected host, or as a therapeutic agent in the aim to induce an immune response in a susceptible host to prevent infection by P. aeruginosa. The vaccines of the invention are administered in amounts readily determined by persons of ordinary skill in the art. Thus, for adults a suitable dosage will be in the range of 10 mg to 10 g, preferably 10 mg to 100 mg. A suitable dosage for adults will also be in the range of 5 mg to 500 mg. Similar dosage ranges will be applicable for children. Those skilled in the art will recognize that the optimal dose may be more or less depending upon the patient's body weight, disease, the route of administration, and other factors. Those skilled in the art will also recognize that appropriate dosage levels can be obtained based on results with known oral vaccines such as, for example, a vaccine based on an E. coli lysate (6 mg dose daily up to total of 540 mg) and with an enterotoxigenic E. coli purified antigen (4 doses of 1 mg) (Schulman et al., J. Urol. 150:917-921 (1993); Boedecker et al., American Gastroenterological Assoc. 999:A-222 (1993)). The number of doses will depend upon the disease, the formulation, and efficacy data from clinical trials. Without intending any limitation as to the course of treatment, the treatment can be administered over 3 to 8 doses for a primary immunization schedule over 1 month (Boedeker, American Gastroenterological Assoc. 888:A-222 (1993)).

In a preferred embodiment, a vaccine composition of the invention can be based on a killed whole E. coli preparation with an immunogenic fragment of a P. aeruginosa protein of the invention expressed on its surface or it can be based on an E. coli lysate, wherein the killed E. coli acts as a carrier or an adjuvant.

It will be apparent to those skilled in the art that some of the vaccine compositions of the invention are useful only for preventing P. aeruginosa infection, some are useful only for treating P. aeruginosa infection, and some are useful for both preventing and treating P. aeruginosa infection. In a preferred embodiment, the vaccine composition of the invention provides protection against P. aeruginosa infection by stimulating humoral and/or cell-mediated immunity against P. aeruginosa. It should be understood that amelioration of any of the symptoms of P. aeruginosa infection is a desirable clinical goal, including a lessening of the dosage of medication used to treat P. aeruginosa-caused disease, or an increase in the production of antibodies in the serum or mucous of patients.

Antibodies Reactive with P. aeruginosa Polypeptides

The invention also includes antibodies specifically reactive with the subject P. aeruginosa polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a hamster or rabbit can be immunized with an immunogenic form of the peptide. Techniques for conferring immunogenicity on a protein or peptide include conjugation to carriers or other techniques well known in the art. An immunogenic portion of the subject P. aeruginosa polypeptide can be administered in the presence of adjuvant. The progress of immunization can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other immunoassays can be used with the immunogen as antigen to assess the levels of antibodies.

In a preferred embodiment, the subject antibodies are immunospecific for antigenic determinants of the P. aeruginosa polypeptides of the invention, e.g. antigenic determinants of a polypeptide of the invention contained in the Sequence Listing, or a closely related human or non-human mammalian homolog (e.g., about 90% homologous, more preferably at least about 95% homologous). In yet a further preferred embodiment of the invention, the anti-P. aeruginosa antibodies do not substantially cross react (i.e., react specifically) with a protein which is for example, less than 80% percent homologous to a sequence of the invention contained in the Sequence Listing. By “not substantially cross react”, it is meant that the antibody has a binding affinity for a non-homologous protein which is less than 10 percent, more preferably less than 5 percent, and even more preferably less than 1 percent, of the binding affinity for a protein of the invention contained in the Sequence Listing. In a most preferred embodiment, there is no cross-reactivity between bacterial and mammalian antigens.

The term antibody as used herein is intended to include fragments thereof which are also specifically reactive with P. aeruginosa polypeptides. Antibodies can be fragmented using conventional techniques and the fragments screened for utility in the same manner as described above for whole antibodies. For example, F(ab′)2 fragments can be generated by treating antibody with pepsin. The resulting F(ab′)2 fragment can be treated to reduce disulfide bridges to produce Fab′ fragments. The antibody of the invention is further intended to include bispecific and chimeric molecules having an anti-P. aeruginosa portion.

Both monoclonal and polyclonal antibodies (Ab) directed against P. aeruginosa polypeptides or P. aeruginosa polypeptide variants, and antibody fragments such as Fab′ and F(ab′)2, can be used to block the action of P. aeruginosa polypeptide and allow the study of the role of a particular P. aeruginosa polypeptide of the invention in aberrant or unwanted intracellular signaling, as well as the normal cellular function of the P. aeruginosa and by microinjection of anti-P. aeruginosa polypeptide antibodies of the present invention.

Antibodies which specifically bind P. aeruginosa epitopes can also be used in immunohistochemical staining of tissue samples in order to evaluate the abundance and pattern of expression of P. aeruginosa antigens. Anti-P. aeruginosa polypeptide antibodies can be used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate P. aeruginosa levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, the ability to monitor P. aeruginosa polypeptide levels in an individual can allow determination of the efficacy of a given treatment regimen for an individual afflicted with such a disorder. The level of a P. aeruginosa polypeptide can be measured in cells found in bodily fluid, such as in urine samples or can be measured in tissue, such as produced by gastric biopsy. Diagnostic assays using anti-P. aeruginosa antibodies can include, for example, immunoassays designed to aid in early diagnosis of P. aeruginosa infections. The present invention can also be used as a method of detecting antibodies contained in samples from individuals infected by this bacterium using specific P. aeruginosa antigens.

Another application of anti-P. aeruginosa polypeptide antibodies of the invention is in the immunological screening of cDNA libraries constructed in expression vectors such as λgt11, λgt18-23, λZAP, and λORF8. Messenger libraries of this type, having coding sequences inserted in the correct reading frame and orientation, can produce fusion proteins. For instance, λgt11 will produce fusion proteins whose amino termini consist of β-galactosidase amino acid sequences and whose carboxy termini consist of a foreign polypeptide. Antigenic epitopes of a subject P. aeruginosa polypeptide can then be detected with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with anti-P. aeruginosa polypeptide antibodies. Phage, scored by this assay, can then be isolated from the infected plate. Thus, the presence of P. aeruginosa gene homologs can be detected and cloned from other species, and alternate isoforms (including splicing variants) can be detected and cloned.

Kits Containing Nucleic Acids, Polypeptides or Antibodies of the Invention

The nucleic acid, polypeptides and antibodies of the invention can be combined with other reagents and articles to form kits. Kits for diagnostic purposes typically comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels. Kits typically comprise other reagents for performing hybridization reactions, polymerase chain reactions (PCR), or for reconstitution of lyophilized components, such as aqueous media, salts, buffers, and the like. Kits may also comprise reagents for sample processing such as detergents, chaotropic salts and the like. Kits may also comprise immobilization means such as particles, supports, wells, dipsticks and the like. Kits may also comprise labeling means such as dyes, developing reagents, radioisotopes, fluorescent agents, luminescent or chemiluminescent agents, enzymes, intercalating agents and the like. With the nucleic acid and amino acid sequence information provided herein, individuals skilled in art can readily assemble kits to serve their particular purpose. Kits further can include instructions for use.

Bio Chip Technology

The nucleic acid sequence of the present invention may be used to detect P. aeruginosa or other species of Pseudomonas acid sequence using bio chip technology. Bio chips containing arrays of nucleic acid sequence can also be used to measure expression of genes of P. aeruginosa or other species of Pseudomonas. For example, to diagnose a patient with a P. aeruginosa or other Pseudomonas infection, a sample from a human or animal can be used as a probe on a bio chip containing an array of nucleic acid sequence from the present invention. In addition, a sample from a disease state can be compared to a sample from a non-disease state which would help identify a gene that is up-regulated or expressed in the disease state. This would provide valuable insight as to the mechanism by which the disease manifests. Changes in gene expression can also be used to identify critical pathways involved in drug transport or metabolism, and may enable the identification of novel targets involved in virulence or host cell interactions involved in maintenance of an infection. Procedures using such techniques have been described by Brown et al., 1995, Science 270: 467-470.

Bio chips can also be used to monitor the genetic changes of potential therapeutic compounds including, deletions, insertions or mismatches. Once the therapeutic is added to the patient, changes to the genetic sequence can be evaluated for its efficacy. In addition, the nucleic acid sequence of the present invention can be used to determine essential genes in cell cycling. As described in Iyer et al., 1999 (Science, 283:83-87) genes essential in the cell cycle can be identified using bio chips. Furthermore, the present invention provides nucleic acid sequences which can be used with bio chip technology to understand regulatory networks in bacteria, measure the response to environmental signals or drugs as in drug screening, and study virulence induction. (Mons et al., 1998, Nature Biotechnology, 16: 45-48. Patents teaching this technology include U.S. Pat. Nos. 5,445,934, 5,744,305, and 5,800,992.

Drug Screening Assays Using P. aeruginosa Polypeptides

By making available purified and recombinant P. aeruginosa polypeptides, the present invention provides assays which can be used to screen for drugs which are either agonists or antagonists of the normal cellular function, in this case, of the subject P. aeruginosa polypeptides, or of their role in intracellular signaling. Such inhibitors or potentiators may be useful as new therapeutic agents to combat P. aeruginosa infections in humans. A variety of assay formats will suffice and, in light of the present inventions, will be comprehended by the person skilled in the art.

In many drug screening programs which test libraries of compounds and natural extracts, high throughput assays are desirable in order to maximize the number of compounds surveyed in a given period of time. Assays which are performed in cell-free systems, such as may be derived with purified or semi-purified proteins, are often preferred as “primary” screens in that they can be generated to permit rapid development and relatively easy detection of an alteration in a molecular target which is mediated by a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be generally ignored in the in vitro system, the assay instead being focused primarily on the effect of the drug on the molecular target as may be manifest in an alteration of binding affinity with other proteins or change in enzymatic properties of the molecular target. Accordingly, in an exemplary screening assay of the present invention, the compound of interest is contacted with an isolated and purified P. aeruginosa polypeptide.

Screening assays can be constructed in vitro with a purified P. aeruginosa polypeptide or fragment thereof, such as a P. aeruginosa polypeptide having enzymatic activity, such that the activity of the polypeptide produces a detectable reaction product. The efficacy of the compound can be assessed by generating dose response curves from data obtained using various concentrations of the test compound. Moreover, a control assay can also be performed to provide a baseline for comparison. Suitable products include those with distinctive absorption, fluorescence, or chemiluminescence properties, for example, because detection may be easily automated. A variety of synthetic or naturally occurring compounds can be tested in the assay to identify those which inhibit or potentiate the activity of the P. aeruginosa polypeptide. Some of these active compounds may directly, or with chemical alterations to promote membrane permeability or solubility, also inhibit or potentiate the same activity (e.g., enzymatic activity) in whole, live P. aeruginosa cells.

Overexpression Assays

Overexpression assays are based on the premise that overproduction of a protein would lead to a higher level of resistance to compounds that selectively interfere with the function of that protein. Overexpression assays may be used to identify compounds that interfere with the function of virtually any type of protein, including without limitation enzymes, receptors, DNA- or RNA-binding proteins, or any proteins that are directly or indirectly involved in regulating cell growth.

Typically, two bacterial strains are constructed. One contains a single copy of the gene of interest, and a second contains several copies of the same gene. Identification of useful inhibitory compounds of this type of assay is based on a comparison of the activity of a test compound in inhibiting growth and/or viability of the two strains. The method involves constructing a nucleic acid vector that directs high level expression of a particular target nucleic acid. The vectors are then transformed into host cells in single or multiple copies to produce strains that express low to moderate and high levels of protein encoding by the target sequence (strain A and B, respectively). Nucleic acid comprising sequences encoding the target gene can, of course, be directly integrated into the host cell.

Large numbers of compounds (or crude substances which may contain active compounds) are screened for their effect on the growth of the two strains. Agents which interfere with an unrelated target equally inhibit the growth of both strains. Agents which interfere with the function of the target at high concentration should inhibit the growth of both strains. It should be possible, however, to titrate out the inhibitory effect of the compound in the overexpressing strain. That is, if the compound is affecting the particular target that is being tested, it should be possible to inhibit the growth of strain A at a concentration of the compound that allows strain B to grow.

Alternatively, a bacterial strain is constructed that contains the gene of interest under the control of an inducible promoter. Identification of useful inhibitory agents using this type of assay is based on a comparison of the activity of a test compound in inhibiting growth and/or viability of this strain under both inducing and non-inducing conditions. The method involves constructing a nucleic acid vector that directs high-level expression of a particular target nucleic acid. The vector is then transformed into host cells that are grown under both non-inducing and inducing conditions (conditions A and B, respectively).

Large numbers of compounds (or crude substances which may contain active compounds) are screened for their effect on growth under these two conditions. Agents that interfere with the function of the target should inhibit growth under both conditions. It should be possible, however, to titrate out the inhibitory effect of the compound in the overexpressing strain. That is, if the compound is affecting the particular target that is being tested, it should be possible to inhibit growth under condition A at a concentration that allows the strain to grow under condition B.

Ligand-binding Assays

Many of the targets according to the invention have functions that have not yet been identified. Ligand-binding assays are useful to identify inhibitor compounds that interfere with the function of a particular target, even when that function is unknown. These assays are designed to detect binding of test compounds to particular targets. The detection may involve direct measurement of binding. Alternatively, indirect indications of binding may involve stabilization of protein structure or disruption of a biological function. Non-limiting examples of useful ligand-binding assays are detailed below.

A useful method for the detection and isolation of binding proteins is the Biomolecular Interaction Assay (BIAcore) system developed by Pharmacia Biosensor and described in the manufacturer's protocol (LKB Pharmacia, Sweden). The BIAcore system uses an affinity purified anti-GST antibody to immobilize GST-fusion proteins onto a sensor chip. The sensor utilizes surface plasmon resonance which is an optical phenomenon that detects changes in refractive indices. In accordance with the practice of the invention, a protein of interest is coated onto a chip and test compounds are passed over the chip. Binding is detected by a change in the refractive index (surface plasmon resonance).

A different type of ligand-binding assay involves scintillation proximity assays (SPA, described in U.S. Pat. No. 4,568,649).

Another type of ligand binding assay, also undergoing development, is based on the fact that proteins containing mitochondrial targeting signals are imported into isolated mitochondria in vitro (Hurt et al., 1985, Embo J. 4:2061-2068; Eilers and Schatz, Nature, 1986, 322:228-231). In a mitochondrial import assay, expression vectors are constructed in which nucleic acids encoding particular target proteins are inserted downstream of sequences encoding mitochondrial import signals. The chimeric proteins are synthesized and tested for their ability to be imported into isolated mitochondria in the absence and presence of test compounds. A test compound that binds to the target protein should inhibit its uptake into isolated mitochondria in vitro.

Another ligand-binding assay is the yeast two-hybrid system (Fields and Song, 1989, Nature 340:245-246). The yeast two-hybrid system takes advantage of the properties of the GAL4 protein of the yeast Saccharomyces cerevisiae. The GAL4 protein is a transcriptional activator required for the expression of genes encoding enzymes of galactose utilization. This protein consists of two separable and functionally essential domains: an N-terminal domain which binds to specific DNA sequences (UASG); and a C-terminal domain containing acidic regions, which is necessary to activate transcription. The native GAL4 protein, containing both domains, is a potent activator of transcription when yeast are grown on galactose media. The N-terminal domain binds to DNA in a sequence-specific manner but is unable to activate transcription. The C-terminal domain contains the activating regions but cannot activate transcription because it fails to be localized to UASG. In the two-hybrid system, a system of two hybrid proteins containing parts of GAL4: (1) a GAL4 DNA-binding domain fused to a protein ‘X’ and (2) a GAL4 activation region fused to a protein ‘Y’. If X and Y can form a protein—protein complex and reconstitute proximity of the GAL4 domains, transcription of a gene regulated by UASG occurs. Creation of two hybrid proteins, each containing one of the interacting proteins X and Y, allows the activation region of UASG to be brought to its normal site of action.

The binding assay described in Fodor et al., 1991, Science 251:767-773, which involves testing the binding affinity of test compounds for a plurality of defined polymers synthesized on a solid substrate, may also be useful.

Compounds which bind to the polypeptides of the invention are potentially useful as antibacterial agents for use in therapeutic compositions.

Pharmaceutical formulations suitable for antibacterial therapy comprise the antibacterial agent in conjunction with one or more biologically acceptable carriers. Suitable biologically acceptable carriers include, but are not limited to, phosphate-buffered saline, saline, deionized water, or the like. Preferred biologically acceptable carriers are physiologically or pharmaceutically acceptable carriers.

The antibacterial compositions include an antibacterial effective amount of active agent. Antibacterial effective amounts are those quantities of the antibacterial agents of the present invention that afford prophylactic protection against bacterial infections or which result in amelioration or cure of an existing bacterial infection. This antibacterial effective amount will depend upon the agent, the location and nature of the infection, and the particular host. The amount can be determined by experimentation known in the art, such as by establishing a matrix of dosages and frequencies and comparing a group of experimental units or subjects to each point in the matrix.

The antibacterial active agents or compositions can be formed into dosage unit forms, such as for example, creams, ointments, lotions, powders, liquids, tablets, capsules, suppositories, sprays, aerosols or the like. If the antibacterial composition is formulated into a dosage unit form, the dosage unit form may contain an antibacterial effective amount of active agent. Alternatively, the dosage unit form may include less than such an amount if multiple dosage unit forms or multiple dosages are to be used to administer a total dosage of the active agent. Dosage unit forms can include, in addition, one or more excipient(s), diluent(s), disintegrant(s), lubricant(s), plasticizer(s), colorant(s), dosage vehicle(s), absorption enhancer(s), stabilizer(s), bactericide(s), or the like.

For general information concerning formulations, see, e.g., Gilman et al. (eds.), 1990, Goodman and Gilman's: The Pharmacological Basis of Therapeutics, 8th ed., Pergamon Press; and Remington's Pharmaceutical Sciences, 17th ed., 1990, Mack Publishing Co., Easton, Pa.; Avis et al. (eds.), 1993, Pharmaceutical Dosage Forms: Parenteral Medications, Dekker, New York; Lieberman et al (eds.), 1990, Pharmaceutical Dosage Forms: Disperse Systems, Dekker, New York.

The antibacterial agents and compositions of the present invention are useful for preventing or treating P. aeruginosa infections. Infection prevention methods incorporate a prophylactically effective amount of an antibacterial agent or composition. A prophylactically effective amount is an amount effective to prevent P. aeruginosa infection and will depend upon the specific bacterial strain, the agent, and the host. These amounts can be determined experimentally by methods known in the art and as described above.

P. aeruginosa infection treatment methods incorporate a therapeutically effective amount of an antibacterial agent or composition. A therapeutically effective amount is an amount sufficient to ameliorate or eliminate the infection. The prophylactically and/or therapeutically effective amounts can be administered in one administration or over repeated administrations. Therapeutic administration can be followed by prophylactic administration, once the initial bacterial infection has been resolved.

The antibacterial agents and compositions can be administered topically or systemically. Topical application is typically achieved by administration of creams, ointments, lotions, or sprays as described above. Systemic administration includes both oral and parental routes. Parental routes include, without limitation, subcutaneous, intramuscular, intraperitoneal, intravenous, transdermal, inhalation and intranasal administration.

EXEMPLIFICATION

Cloning and Sequencing P. aeruginosa Genomic Sequence

This invention provides nucleotide sequences of the genome of P. aeruginosa which thus comprises a DNA sequence library of P. aeruginosa genomic DNA. The detailed description that follows provides nucleotide sequences of P. aeruginosa, and also describes how the sequences were obtained and how ORFs (Open Reading Frames) and protein-coding sequences can be identified. Also described are methods of using the disclosed P. aeruginosa sequences in methods including diagnostic and therapeutic applications. Furthermore, the library can be used as a database for identification and comparison of medically important sequences in this and other strains of P. aeruginosa as well as other species of Pseudomonas.

Chromosomal DNA from strain 19804 of P. aeruginosa was isolated after Zymolyase digestion, sodium dodecyl sulfate lysis, potassium acetate precipitation, phenol:chloroform extraction and ethanol precipitation (Soll, D. R., T. Srikantha and S. R. Lockhart: Characterizing Developmentally Regulated Genes in P. aeruginosa. In Microbial Genome Methods. K. W. Adolph, editor. CRC Press. New York. p 17-37.). Genomic P. aeruginosa DNA was hydrodynamically sheared in an HPLC and then separated on a standard 1% agarose gel. Fractions corresponding to 2500-3000 bp in length were excised from the gel and purifed by the GeneClean procedure (Bio101, Inc.).

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The healed DNA was then ligated to unique BstXI-linker adapters (5′-GTCTTCACCACGGGG-3′ and 5′-GTGGTGAAGAC-3′ in 100-1000 fold molar excess). These linkers are complimentary to the BstXI-cut pGTC vector, while the overhang is not self-complimentary. Therefore, the linkers will not concatermerize nor will the cut-vector religate itself easily. The linker-adapted inserts were separated from the unincorporated linkers on a 1% agarose gel and purified using GeneClean. The linker-adapted inserts were then ligated to BstXI-cut vector to construct a “shotgun” sublclone libraries.

Only major modifications to the protocols are highlighted. Briefly, the library was then transformed into DH5á competent cells (Gibco/BRL, DH5á transformation protocol). It was assessed by plating onto antibiotic plates containing ampicillin and IPTG/Xgal. The plates were incubated overnight at 37° C. Transformants were then used for plating of clones and picking for sequencing. The cultures were grown overnight at 37° C. DNA was purified using a silica bead DNA preparation (Engelstein, 1996) method. In this manner, 25 μg of DNA was obtained per clone.

These purified DNA samples were then sequenced using primarily ABI dye-terminator chemistry. All subsequent steps were based on sequencing by ABI377 automated DNA sequencing methods. The ABI dye terminator sequence reads were run on ABI377 machines and the data was transferred to UNIX machines following lane tracking of the gels. Base calls and quality scores were determined using the program PHRED (Ewing et al., 1998, Genome Res. 8: 175-185; Ewing and Green, 1998, Genome Res. 8: 685-734). Reads were assembled using PHRAP (P. Green, Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, January 1996, p.157) with default program parameters and quality scores. The initial assembly was done at 6-fold coverage and yielded 162 contigs.

Finishing can follow the initial assembly. Missing mates (sequences from clones that only gave reads from one end of the Pseudomonas DNA inserted in the plasmid) can be identified and sequenced with ABI technology to allow the identification of additional overlapping contigs.

End-sequencing of randomly picked genomic lambda was also performed. Sequencing on a both sides was done for all lambda sequences. The lambda library backbone helped to verify the integrity of the assembly and allowed closure of some of the physical gaps.Primers for walking off the ends of contigs would be selected using pick_primer (a GTC program) near the ends of the clones to facilitate gap closure. These walks can be sequenced using the selected clones and primers. These data are then reassembled with PHRAP. Additional sequencing using PCR-generated templates and screened and/or unscreened lambda templates can be done in addition.

To identify P. aeruginosa polypeptides the complete genomic sequence of P. aeruginosa were analyzed essentially as follows: First, all possible stop-to-stop open reading frames (ORFs) greater than 180 nucleotides in all six reading frames were translated into amino acid sequences. Second, the identified ORFs were analyzed for homology to known (archeabacter, prokaryotic and eukaryotic) protein sequences. Third, the coding potential of non-homologous sequences were evaluated with the program GENEMARKTM (Borodovsky and McIninch, 1993, Comp. Chem. 17:123).

Identification, Cloning and Expression of P. aeruginosa Nucleic Acids

Expression and purification of the P. aeruginosa polypeptides of the invention can be performed essentially as outlined below.

To facilitate the cloning, expression and purification of membrane and secreted proteins from P. aeruginosa, a gene expression system, such as the pET System (Novagen), for cloning and expression of recombinant proteins in E. coli, is selected. Also, a DNA sequence encoding a peptide tag, the His-Tag, is fused to the 3′ end of DNA sequences of interest in order to facilitate purification of the recombinant protein products. The 3′ end is selected for fusion in order to avoid alteration of any 5′ terminal signal sequence.

PCR Amplification and Cloning of Nucleic Acids Containing ORF's Encoding Enzymes

Nucleic acids chosen (for example, from the nucleic acids set forth in SEQ ID NO: 1-SEQ ID NO:16571) for cloning from the 19804 strain of P. aeruginosa are prepared for amplification cloning by polymerase chain reaction (PCR). Synthetic oligonucleotide primers specific for the 5′ and 3′ ends of open reading frames (ORFs) are designed and purchased from GibcoBRL Life Technologies (Gaithersburg, Md., USA). All forward primers (specific for the 5′ end of the sequence) are designed to include an NcoI cloning site at the extreme 5′ terminus. These primers are designed to permit initiation of protein translation at a methionine residue followed by a valine residue and the coding sequence for the remainder of the native P. aeruginosa DNA sequence. All reverse primers (specific for the 3′ end of any P. aeruginosa ORF) include a EcoRI site at the extreme 5′ terminus to permit cloning of each P. aeruginosa sequence into the reading frame of the pET-28b. The pET-28b vector provides sequence encoding an additional 20 carboxy-terminal amino acids including six histidine residues (at the extreme C-terminus), which comprise the His-Tag.

Genomic DNA prepared from the 19804 strain of P. aeruginosa is used as the source of template DNA for PCR amplification reactions (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). To amplify a DNA sequence containing a P. aeruginosa ORF, genomic DNA (50 nanograms) is introduced into a reaction vial containing 2 mM MgCl2, 1 micromolar synthetic oligonucleotide primers (forward and reverse primers) complementary to and flanking a defined P. aeruginosa ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, N.J., USA) in a final volume of 100 microliters.

Upon completion of thermal cycling reactions, each sample of amplified DNA is washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, Md., USA). All amplified DNA samples are subjected to digestion with the restriction endonucleases, e.g., NcoI and EcoRI (New England BioLabs, Beverly, Mass., USA)(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). DNA samples are then subjected to electrophoresis on 1.0% NuSeive (FMC BioProducts, Rockland, Me. USA) agarose gels. DNA is visualized by exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices isolated from the agarose gel is purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, Calif., USA).

Cloning of P. aeruginosa Nucleic Acids into an Expression Vector

The pET-28b vector is prepared for cloning by digestion with restriction endonucleases, e.g., NcoI and EcoRI (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). The pET-28a vector, which encodes a His-Tag that can be fused to the 5′ end of an inserted gene, is prepared by digestion with appropriate restriction endonucleases.

Following digestion, DNA inserts are cloned (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously digested pET-28b expression vector. Products of the ligation reaction are then used to transform the BL21 strain of E. coli (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) as described below.

Transformation of Competent Bacteria with Recombinant Plasmids

Competent bacteria, E. coli strain BL21 or E. coli strain BL21(DE3), are transformed with recombinant pET expression plasmids carrying the cloned P. aeruginosa sequences according to standard methods (Current Protocols in Molecular, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). Briefly, 1 microliter of ligation reaction is mixed with 50 microliters of electrocompetent cells and subjected to a high voltage pulse, after which, samples are incubated in 0.45 milliliters SOC medium (0.5% yeast extract, 2.0% tryptone, 10 mM NaCl, 2.5 mM KCl, 10 mM MgCl2, 10 mM MgSO4 and 20, mM glucose) at 37° C. with shaking for 1 hour. Samples are then spread on LB agar plates containing 25 microgram/ml kanamycin sulfate for growth overnight. Transformed colonies of BL21 are then picked and analyzed to evaluate cloned inserts as described below.

Identification of Recombinant Expression Vectors with P. aeruginosa Nucleic Acids

Individual BL21 clones transformed with recombinant pET-28b P. aeruginosa ORFs are analyzed by PCR amplification of the cloned inserts using the same forward and reverse primers, specific for each P. aeruginosa sequence, that were used in the original PCR amplification cloning reactions. Successful amplification verifies the integration of the P. aeruginosa sequences in the expression vector (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994).

Isolation and Preparation of Nucleic Acids from Transformants

Individual clones of recombinant pET-28b vectors carrying properly cloned P. aeruginosa ORFs are picked and incubated in 5 mls of LB broth plus 25 microgram/ml kanamycin sulfate overnight. The following day plasmid DNA is isolated and purified using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, Calif., USA).

Expression of Recombinant P. aeruginosa Sequences in E. coli

The pET vector can be propagated in any E. coli K-12 strain e.g. HMS174, HB101, JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lacI gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), and the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying its gene of interest. Strains used include: BL21(DE3) (Studier, F. W., Rosenberg, A. H., Dunn, J. J., and Dubendorff, J. W. (1990) Meth. Enzymol. 185, 60-89).

To express recombinant P. aeruginosa sequences, 50 nanograms of plasmid DNA isolated as described above is used to transform competent BL21(DE3) bacteria as described above (provided by Novagen as part of the pET expression system kit). The lacZ gene (beta-galactosidase) is expressed in the pET-System as described for the P. aeruginosa recombinant constructions. Transformed cells are cultured in SOC medium for 1 hour, and the culture is then plated on LB plates containing 25 micrograms/ml kanamycin sulfate. The following day, bacterial colonies are pooled and grown in LB medium containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 nM of 0.5 to 1.0 O.D. units, at which point, 1 millimolar IPTG was added to the culture for 3 hours to induce gene expression of the P. aeruginosa recombinant DNA constructions.

After induction of gene expression with IPTG, bacteria are pelleted by centrifugation in a Sorvall RC-3B centrifuge at 3500× g for 15 minutes at 4° C. Pellets are resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 mM EDTA (STE buffer). Cells are then centrifuged at 2000× g for 20 min at 4° C. Wet pellets are weighed and frozen at −80° C. until ready for protein purification.

A variety of methodologies known in the art can be utilized to purify the isolated proteins. (Current Protocols in Protein Science, John Wiley and Sons, Inc., J. E. Coligan et al., eds., 1995). For example, the frozen cells may be thawed, resupended in buffer and ruptured by several passages through a small volume microfluidizer (Model M-110S, Microfluidics International Corporation, Newton, Mass.). The resultant homogenate may be centrifuged to yield a clear supernatant (crude extract) and following filtration the crude extract may be fractionated over columns. Fractions may be monitored by absorbance at OD280 nm and peak fractions may analyzed by SDS-PAGE.

The concentrations of purified protein preparations may be quantified spectrophotometrically using absorbance coefficients calculated from amino acid content (Perkins, S. J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations are also measured by the method of Bradford, M. M. (1976) Anal. Biochem. 72, 248-254, and Lowry, O. H., Rosebrough, N., Farr, A. L. & Randall, R. J. (1951) J. Biol. Chem. 193, pages 265-275, using bovine serum albumin as a standard.

SDS-polyacrylamide gels of various concentrations may be purchased from BioRad (Hercules, Calif., USA), and stained with Coomassie blue. Molecular weight markers may include rabbit skeletal muscle myosin (200 kDa), E. coli (-galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 kDa).

EQUIVALENTS

Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments and methods described herein. The specific embodiments described herein are offered by way of example only, and the invention is to limited only by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled.

110< td>1657916588Trypanosoma cruzi< td>minor jackknife< td>23112< td>27< td>297−39< td>longfin squid< td>Homo sapiens16623Pseudomonas26823916_f3_8Oryctolagus73−2< td>SaccharomycesPseudomonas putida86< td>−3MycobacteriumPseudomonas putida16661Q4506897< td>−3Pseudomonas putida−1010435664792_c3_84< td>209< td>Hordeum vulgare57011012365831_c1_31< td>195−5< td>−9123ChlamydomonasKlebsiella122< td>Enterobacter cloacae12616701< td>XanthomonasHerpesvirus papioBoreogadus saida1670813914014182143< td>1401671816719GTC ORF with score 612 to:< td>−515516727159160< td>−5163164165169< td>Enterobacter cloacae171Klebsiella174175< td>1674916751< /tr>1035P40976189Caenorhabditis16764194< td>−2< td>−10197198< td>16770(de:streptomyces coelicolor cosmid16781211Pyrococcusmice|C57BL/6xCBA/Caenorhabditis< td>16798< td>229< td>Klebsiella−107silkworm16808168091681016811(de:mycobacterium tuberculosis< td>1681716820< td>16821< td>(de:caenorhabditis elegans cosmid< td>16822< tr>< /tr>25316825< td/>1682716829GTC ORF with score 148 to:25916831< td>Mucuna hassjoo< /tr>< td>−43266< td>267−6593< td>16843< td>275276< td>277< td>Klebsiella< td>279< td>282< td>283< td>16859290< td>291< td>Aspergillus< td>294< td>Enterobacter cloacae300301302< td>652304< td>Escherichia colimice|C57BL/6xCBA/Myxococcus xanthus< tr>< td>Klebsiella321Pseudomonas< td>325< td>326< td>327< td>Brassica napus< td>16901< td>GTC ORF with score 203 to:< td>337P19706< td>341< td>4588505_c2_158< td>34416105455_c2_164< td>347< td>Escherichia coli< td>255< td>Klebsiella< td>−7Escherichia coli< td>16927< td>357< td>358< td>−82< td>Pseudomonas< td>720< td>OrgyiaLysobacter< td>16941< td>−11Human16946< td>Acanthamoeba37916952Chinese oak< td>Pseudomonas1695738713416967< td/>< td>399< td>402< td>403404405< td>408−59< td>Contig807816984(de:pseudomonas aeruginosa pilk< td>415< td>14179055_13_189< tr>16991< td>GTC ORF with score 1123 to:< td>−6742316535207_c1_229427< td>no gb taxonomy< td>17000Saccharomyces< td>Pyrococcus433< td/>< td>17008< td>438< td>439< td>440southeastern AsianGTC ORF with score 278 to:−15< td>446186< td>17019Pseudomonas putidaCaenorhabditisVolvox carteriCaenorhabditis< td>Homo sapiens< td/>< td>460Streptomyces fradiae< td>17039Klebsiella< td>470471< td>472< td>474< td>10447811223392_c3_36717051< td>481< td>48317057487Micrococcus luteus662(de: streptomyces coelicolor cosmid150U41162AP000004137< td>51517087< td>GTC ORF with score 445 to:< td>−45< td>Thermoanaerobacter< /tr>171031710517107< td>538< td>12973202_f3_153−2< td>Homo sapiens< td>17117< td>CONTIG707< td>554< td>559< td>−5< td>17133< /tr>189Enterobacter cloacae< td>574< td>−5< td>576533Aspergillus< td>582< td>−8< td>586249Enterobacter cloacae< td>589< td>592< td>21650811_c3_322< td>17167< td>(de:streptomyces coelicolor cosmid< td>Klebsiella< td>598< td>602African clawed frog< td>Mycobacterium< td>17181< td>612< td>613< td>614< td>Mycobacterium17190621P14328< td>17197139< td>17202Sus scrofa domesticaHaloferax sp.< td>639< td>278< td>642< td>33690625_f2_114644< td>645< td>−27< td>651< td>653< td>654< td>655< td>658< td>659343−10< td>17237GTC ORF with score 129 to:667< td>668< td>669< td>31922092_f3_194< td>human herpesvirus< td>malaria parasite< td>67533708542_f3_214< td>685< td>686< td>687< td>688< td>15723886_f3_248< td>1726417266blue mussellongfin squid< td>Plasmodium vivaxDrosophila710299< td>116< td>713< td>107< td>716< td>90< td>720721< td>723< td>725< td>726< td>727< td>110< td>Homo sapiens17307(de:streptomyces coelicolor cosmid< td>human herpesvirus< /tr>< td>17316(de:acanthamoeba castellanii< td>746< td>749< td>DictyosteliumChlamydomonasEdwardsiella ictaluri173241732517326< td>756759< td>Escherichia coli17340< td>Enterococcus775< td>11717348< td>778< td>779< td>780< td>78178778878979079214338907_f1_50< td>397796< td>−1799< td>Z95620< td>8053254141_f2_106< td>258< td>814< td>816< td>817< td>−16< td>−21< td>17391< td>827< td>Clostridium< td>829< td>17401Escherichia coli83217404Homo sapiens< td>Escherichia coli< td>Boreogadus saidaSaccharomyces< td>845< td>849Boreogadus saidaEscherichia coli8531742517426< td>14017431< td>Legionella< td>17439< td>17440870871−787492Boreogadus saidaBoreogadus saida< td>Streptomyces fradiae< td>885(cl:response regulator homology)< td/>142< td/>< td>Homo sapiensHome sapiensEnterobacter cloacae898GTC ORF with score 124 to−13904< td>17477Aspergillus< /tr>< /tr>< td>MycobacteriumKlebsiella< td>39817494250927< td>Orf virus< td>1302941402P45862Contig543A−70Escherichia coli< td>112< td>960< td>96117537< td>mice17539< td>1553< td>971< tr>< td>Boreogadus saida16217550< td>9821755517557< td>GTC ORF with score 196 to:< td>992< td>993995< td>17570103< td>1004Epstein-Barr virusVibrio< td>1011< td>(de:pseudomonas tolaasii cprs gene.)< td>17588< td>(sr:caenorhabditis elegans strain=bristol n2)(sr:, baker's yeast) (de:precursor)< td>175921759517597Boreogadus saida< td>Klebsiella< td>10331034< td>13016018_c1_4441039< td>17611< td>17615(de:cuticle collagen 12 precursor)64417617−38Dictyostelium< td>−91054< td>Enterobacter cloacae105617628< td>1058< td>1063< td>Boreogadus saida< td>mice|C57BL/6xCBA/< td>1069< td>17643< td>1073< td>17646< td>GTC ORF with score 389 to:< td>1076< td>Saimiriine< td>1081< td>35448283_c2_610< td>Escherichia coli< td>1085< td>1086< td>X99514< td>1089< td>1090< td>−6917663< td>1523P10569149< td>A69860< td>17676< td>(de:caenorhabditis elegans cosmidOrf virus< td>1108< td>1109< td>1110< td>equine herpesvirus< td>1114−5< td>1116Enterobacter cloacae< td>1118< td>137717692< td>17693< td>(ec:4.3.2.2) (de:adenylosuccinateNephila clavipes< td>1127< td>1128< td>17701Klebsiella< td>1133< td>17706< td>GTC ORF with score 106 to:1136< td>1138144< td>17713< td/>1153Klebsiella17727< td>17731Pseudomonas< td>Pseudomonas< td>17742< td>1172< td>1173(sr:house mouse) (de:mus musculus< td>Micrococcus luteus< td>longfin squid< td>Indian corn< td>118317760< td>1190< td>17762< td>Schizosaccharomyces< td>17767< td>Pisum sativum< td>Enterobacter cloacae< td>1203< td>−13< td>1209< td>Escherichia coli< /tr>< td>1213< td>177861779399629871083_f2_28917798< td>17803< td>−7< td>22033629040_f3_384< td>17812(de:caenorhabditis elegans cosmid< td>17815< td>1245Gallus gallus< td>17818< td>Klebsiella< td>−34< td>17827< td>−84Homo sapiensequine herpesvirus145< td>Bacillus< td>1266AF05830299316< td>H69874< tr>< td>1921285< td>209111< td>Klebsiella< td>1291S746531293< td>common tobacco17867< td>1297< td>1298< td>human herpesvirus< td>DictyosteliumParacoccus< td>Plasmodium vivax< td>1315162−6< td>17902< td>Pseudomonas17905GTC ORF with score 177 to:133517907< td>123< td>1342< td>17914(cl:herpesvirus immediate-early< td>Escherichia coli17919< td>113< td>1352< td>17925< td>1357Klebsiella< td>−1217931< /tr>< td>Litomosoides< /tr>< td>7921369< td>17942< td>(de:porin o precursor)Azospirillum< td>1375Enterobacter cloacae< td>1379< td>1380B40505< td>Canis familiarisEpstein-Barr virus< td>17956< td>−18< td>−10595< td>1397< td>13981797017971−16735417982< td>Rickettsia prowazekii strain< td>17989< td>10614221424< td>−8< td>−181269150_f1_82−15−7< td>−32< td>1801218013< td>GTC ORF with score 543 to:< td>1443< td>−34< td>18016< td>1447Xanthomonas12681451< td>1453< td>18026< td>1456< td>1458< td>1798914221424< td>−8< td>−181269150_f1_82−15−7< td>−49< td>−32< td>1801218013< td>1443< td>−34< td>1447Xanthomonas campestris1451< td>1453−8< td>18026< td>1456< td>1458−14< td>36718036−15< td>146818040Caenorhabditis elegans< td>1474< td>14886550_f2_176< td>1478< td>1479< td>18051< td>18052< /tr>< td>417< td>1489< td>1490< td>Herpes simplex virus (type< td>1493< td>180651496< td>1497< td>−21< td>1499180711501< td>−30−81507Caenorhabditis elegans9361513< td>180871524< td>152518098< td>Oenothera picensis1530< /tr>1810218104< td>Boreogadus saida< td>equine herpesvirus type 4Haloferax sp.154213772831_f3_452< td>−72< td>1548< td>−41Klebsiella pneumoniae< td>Escherichia coli< td>Homo sapiens< td>1564< td>1570< td>18143< td>1574< td>1575Klebsiella pneumoniae< td>−6< td>18151< td>Proteus mirabilis< td>−193< td>18157< td>Thermus thermophilus/18161< td>192< td>18165< td>181691600< td>1601< td>1602< td>18179< td>16091610< td>1611< td>1616< td>18191Klebsiella pneumoniae< td>−6< td>1623< td>1624< td>−3Zymomonas mobilis1635< td>18207< td>Salmonella choleraesuis18210238< td>18215< td>Epstein-Barr virus< td>Salmonella choleraesuis18222< td>18223< td>195< td>165618229< td>Brodetella pertussis199< td>18234< td>−39< td>1666< td>32160207_c3_856human herpesvirus18244−4< td>18247< td>Haemophilus influenzae< td>Homo sapiens< tr>< td>1692< /tr>< td>35911691< td>−6< td>18268< td>−3Enterobacter cloacae< td>1705(sr:,mouse) (de:repair1711< td>18284< td>(de:m.ammoniaphilum genes< td>1720< td>Escherichia coli< td>18297Saccharomyces cerevisiae17331736173718309< td>1739< td>−21< td>−518313< td>Achromobacter< tr>< td>Klebsiella pneumoniae< td>−1017471748< td>1751< td>175218329< td>1763< td>30360207_f1_162< td>176618339< td>1770< td>Escherichia coli1774< td>18347−818353183541784< td>18357< td>1787no gb taxonomy match< td>18364< td>1794180124691283_f2_334< td>13918378< td>1808< td>183831813< td>1814< td>1822< td>1823< td>−10< td>1825< td>1826< td>182918401< td>183518408< td>184091841218413< td>1844< td>1845< td>18419308185318431< td>Globodera pallidaKlebsiella pneumoniae186913142956_f3_597< td>22136006_f3_600Klebsiella pneumoniae< td>1874712< td>Caenorhabditis elegans< td>Enterobacter cloacae< td>188018453−43< td>−3< td>Mycobacterium avium18461< td>1892< td>1893Cyanobacterium synechocystis< td>1898< td>1899< td>−6< td>19031904< td>mice|C57BL/6xCBA/CaJ< td>19081910121191618488< td>18489< td>1919< td>−13< td>1924< td>1925< td>2603913_f3_773< td>Escherichia coli< td>86318504< td>193913172307_c1_870< td>18512< td>18514< td>18518< td>1853018534< td>1966< td>16658261_c1_965−29< td>1854019701971< td>197218544< td>1977−4−35< td>−14< td>1981< td>Staphylococcus epidermidis< td>1984185561986Pseudomonas aeruginosa< td>1994< td>−20< td>1997< td>18569−8< td>18589< /tr>185911859611072581_c2_1187< td>Mycobacterium tuberculosis18615< td>1613< td>Homo sapiens−9999< td>20503251711_c2_122018624< td>Streptomyces roseofulvusHaemophilus influenzae< td>−38−71206618640< td>18645< td>18649
TABLE 2
NTAA
SEQSEQNTAA
Orf NameIDIDLengthLengthScoreExponentOrganismAccessi onDescription
10954914_f3_411657224079
11976006_f3_3216573330253−21Cyanobacteri umJQ1236
synechocystis
5989443_c1_531657424682200−16Vibrio choleraeAF069055(de:vibr io cholerae sigma54-
dependent transcriptional activator
1(s54act1) gene, partial cds.)
< td/>(nt:catalytic domain)
15830341_c3_44165 7520167
5083327_f2_2 516576432144121 −7mice|C57BL/6xCBA/A38346(s r:, house mouse)
C aJ hybrid
26276456_c1_461657 7510170457−43 PseudomonasAF055999(de:< HIL>pseudomonas aeruginosa hemin
< td/>aeruginosauptake locus, hypothctical
proteinphuw (phuw), atpase
component (phuv), abc-type
permease (phuu), periplasmic binding
protein (phut), hemin degrading
factor (phus),and outer membrane
hemin receptor (ph . . .
24786541_c2_5716578477159203−15minor jackknifeL41834(sr:ensis minor (clone: 1/6) male
cla madult gonads cdna to mrna) (de:ensis
minor (clone 1/6) nuclear protein
mrna, complete cds.) (nt:putative)
24488558_c3_68507169103−3Epstein-Barr virusP03211(sr:b95-8,human herpesvirus 4)
(de:ebna-1 nuclear protein)
31363556_c2_5916 580474157122−7minor jackknifeLA1834(sr:ensis minor (clone: 1/6) male
cla madult gonads cdna to mrna) (de:ensis
minor (clone 1/6) nuclear protein
mrna, complete cds.) (nt:putative)
16038336_c3_610 16581462153107−4Homo sapiensQ08170(sr:,human) (de:pre-mrna splicing
factor srp75)
32150880_c2_311165 8223477131−8< HIL>PseudomonasAF010181(fn:tr ansfers d-rhamnose in an alpha1-2
aeruginosalinkage) (de:pseudomonas aeruginosa
< td/>glycosyltransferase wbpx (wbpx)
gene, complete cds.) (nt:one of three
< td/>transferases which function to)
35242666_f1_21216583< /td>414138108−5Boreogadus saidaU43200(dc:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
4552288_f3_4131658423778
25909416_f3_5141658520166
26 660177_c1_61516586741246 226−17Homo sapiensAB002322(sr:homo sapiens male brain cdna to
mrna, clone_lib:pbluescriptii s)
(de:human mrna for kiaa0324
gene, partial cds.)
34628910_c2_7161658 7762254106−3< HIL>SaimiriineQ01042(sr:11,) (de:immediate-early protein)
herpesvirus 2
25525216_c1_417405134
14253951_c2 _51816589483160
10250000_f1_11916590636< /td>211325−29Escher ichia coliP32695(de:hypothetical 36.8 kd protein in
dinf-qor intergenic region)
36414711_c1_72016 591630209169−11A44937(c1:kinetoplast-assoc iated protein)
32605438_c2_8211 6592723240186−14L41834(sr:ensis minor (clone: 1/6) male
cla madult gonads cdna to mrna) (de:ensis
minor (clone 1/6) nuclear protein
mrna, completc cds.) (nt:putative)
16973468_c3_922 16593576191148−9no gb taxonomyY15174(de:human papillomavirus type 76 e6,
matc he7, e1, e2, e4, 12, and 11 genes.)
(nt:putative)
16893767_f2_216594717239161 −10AeromonasU56832 (de:aeromonas hydrophila fk506
< td/>hydrophilabinding protein (fkpa) gene, complete
cds in 3.9 kb fragment.) (nt:orf5; no
significant similarity with known)
12272027_f3_324165 95339112153−11Aquifex acolicusG70456
36567881_ c2_62516596381126−7Klebsiella Contig531AGTC ORF with score 134 to:
pneumoniae(ai:7000766992) (or:Pseudomonas
aeruginosa)
12911468_c3_8261659 7666221452−43 Rhizobium melilotiP13632(de:c4-dicarboxyla te transport
(megaplasmidtranscriptional regulatory protein
pRME41B SYM)dcdt)
42558468_f1_116598741246546 −51PseudomonasS539 99(c1:gramicidin s synthetase i repeat
< HIL>aeruginosahomology:acetate--c oa ligase
homology:acyl carrier protein
homology)
9769817_f2_528 16599336111
2668 9416_c1_92916600378126
15394580_f2_1301660198375−34 Escherichia coliU41560(sr:escherichi a coli strain=k-12)
(ec:4.2.1.3) (de:escherichia coli
aconitase b (acnb) gene, partial cds.)
21562683_f2_2311660 2483160789−78 Escherichia coliP36683(ec:4.2.1.3) (de:(aconitase 2))
14932213_c1_43216603< /td>519172351−32Enterobacter cloacaeCONTIG503GTC ORF with score 351 to:
(ai:7000756806) (or:Pseudomonas
aeruginosa)
31726378_c3_5331660 4369122248−20 Enterobacter cloacaeCONTIG503GTC ORF with scorc 2125 to:
(ai:7501776800)
(or:Klebsiella pneumoniae)
31677056_c3_63416605423140418KlebsiellaContig5 30AGTC ORF with score 457 to:
pneumoniae(ai:7000802959) (or:Pseudomonas aeruginosa)
7145416_f3_2 351660621671
148 63202_f1_23616607414138< /td>509−49Pseudomonas putidaP13456(de:recf protein)
35804706_c3_1037 16608393131200−16 Escherichia coliS70160
48828120_f3_3 381660933311099 −4Haemophilus P45112(ec:3.1.—.—) (de:single-stranded-
< td/>influenzaedna-s pecific exonuclease recj,)
25862692_f2_239166 1025283178−12 Escherichia coliY09439(de:e. coli uup gene, partial.)
11955217_f3_440 16611345115289−25 BacillusP42100(de:hy pothetical 39.4 kd protein in
subtilis/Bacillusgntr-htpg intergenic region)
globigii
3256700_c3_74116612474158233−20Enterobacter cloacaeCONTIG505GTC ORF with score 474 to:
(ai:7501788553) (or:Klebsiella
pneumoniae)
24110217_f1_14216613 2919696−4Pseudomonas putidaD85415(sr:pseudomo nas putida
< td/>(strain:ucc22) dna) (de:pseudomonas
putida gene for conversion of aniline
to catechol.) (nt:amino group
< td/>transfer)
14866531_c1_243< /td>166141836098−4 AcinetobacterCONTIG221GTC ORF with score 2023 to:
baumanniiC(ai:98446) (or:Escherichia coli)
< td/>(sr:escherichia coli (strain:k12) dna
clone_lib:kohara lambda minise)
(de:e. coli genomic dna, kohara clone
< td/>#278(33.3-33.7 min.).)
(nt:orf_id:o277#10;
similar to (swissprot accession)
22004128_fI_14416615495164316−27PseudomonasP48632(d e:ferripyoverdine receptor precursor)
< td>aeruginosa
13147877_f2 _5451661622574
32678902_c3_104616617516< /td>171132−8Pseudom onasP32977(de:porin o precursor)
< td>aeruginosa
36620643_c1 _3471661822274
36047181_c1_2481661928896
36583431_f1_149 16620750249118−7S56117(sr:, longfin squid)
17082311_f2_350166 21642213163−11CaenorhabditisZ77655(de :caenorhabditis elegans cosmid
elegansc56a3, complete sequence.) (nt:weak
similarity to human calcium-
dependent proctase)
9769449_c1_6511 6622957319172−10O00268(sr:, human) (de:(tafii135) (tafii-130)
34167937_c3_952726242101−2Micrococcus luteusJQ0406
22315763_f1 _1531662426186
12893777_f1_25416625792264178−12mice|C57BL/6xCB A/A28996(c1:proline-rich protein) (sr:,
Ca J hybridhouse mouse)
13017587_f2_555166 26513170156−11Rattus norvegicusM64793(sr:rat (sprague-dawley) liver dna)
(de:rat salivary proline-rich protein
(rp15) gene, complete cds.)
25672037_f3_6561662 71146382195−14Rattus norvegicusB39066(c1:proline-rich protein) (sr:,
< td/>norway rat)
31347694_c1_75716628 1104367101−1m edicinal leechU92813(sr:medicinal leech) (de:hirudo
< td/>medicinalis tractin mrna,
< td/>complete cds.)
7320438_c2_85816629 1002334583−56 Escherichia coliP20966(ec:2.7.1.69) (de:(ec 2.7.1.69)
(eii-fru))
7128152_f2_2591663023477
53 39715_c3_5601663128595118−6PseudomonasAF051692(de:pseudomonas aeruginosa lema
aeruginosa(lema) gene, partial cds; and
responseregulator (pirr), histidine
protein kinase (pirs), and phenolate-
< td/>type ferrisiderophore receptor (pira)
genes, complete cds.) (nt:pirs;
transmembrane sensor component
34453318_f2_261 1663220468107−5P14756(ec:3 .4.24.26) (de:metalloproteinase))
aeruginosa
621663325283
11875776_f1_16316634< /td>1509502222−15< HIL>KlebsiellaContig554AGTC ORF with score 483 to:
pneumoniae(ai:7000831452) (or:Enterobacter cloacae)
22910840_f1_364 166351368455108 −5KlebsiellaContig503A GTC ORF with scorc 129 to:
pneumoniae(ai:7000772531) (or:Pseudomonas aeruginosa)
33417277_f1_5651663636841227135 −6KlebsiellaContig 554AGTC ORF with score 362 to:
pneumoniae(ai:7000831455) (or:Entcrobacter cloacae)
31900181_f1_666 1663729799132∠’8KlebsiellaContig554AGTC ORF with score 483 to:
pneumoniae(ai:7000831452) (or:Enterobacter cloacae)
10650693_f3_246 716638570189106 −3Canis familiarisA45195(c1:guanylate cyclase catalytic
domain homology) (sr:, dog)
15804056_f3_26681663 9537178148−9< HIL>Murine herpesvirusU97553(de:mur ine herpesvirus 68 strain
6 8wums, complete genome.)
31845968_f3_2769 1664028895106−5U46069(sr:e uropean rabbit) (de:oryctolagus
cuniculuscuniculus fertilin alpha subunit mrna,
< td/>complete cds.) (nt:sperm surface
protein with metalloproteinase and)
31486567_f3_37701664 1480160159−10 equine herpesvirusAF030027(fn:very large tegument protein)
type 4 EHV-4(de:equine herpesvirus 4 strain
ns80567, complete genome.)
(nt:counterpart of hsv-1 gene u136
and vzv gene 22)
26844849_c1_387116642 752725093311−9999 PseudomonasS53999(c1 :gramicidin s synthetase i repeat
< HIL>aeruginosahomology:acetate--c oa ligase
homology:acyl carrier protein
homology)
13942533_c2_45 721664350116698 −5AspergillusContig863 5GTC ORF with score 98 to:
fumigatus(ai:7000756994) (or:Pseudomonas aeruginosa)
13142958_c3_541664448916296PlasmodiumP08674(sr:gombak,) (de:circumsporozoite
< td/>cynomolgiprotei n precursor (cs))
32632056_c3_6574166 45720239150−8 AcanthamoebaP19706(sr:, amoeba) (de:myosin heavy chain
castellaniiib (myosin heavy chain iI))
29932330_f1_17516646 465154143−9Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
13806881_f1_27616647< /td>1131376177−11< HIL>MolluscumU60315(de:< i>molluscum contagiosum virus
< td/>contagiosum virussubtype 1, complete genome.)
subtype 1(nt:putative dna topoisomerase i;
description: homolog)
34609680_f1_4771 664820166
35682205_f1_878166491245414
12007067_f1_97916650435144100−3Caenorhabd itisAF045646(sr:caenorha bditis elegans
elegansstrain=bristol n2) (de:caenorhabditis
elegans cosmid f56b3.)
(nt:contains similarity to collagens)
1429168_f2_158016651663220104−3 Homo sapiensAF022224(fn:anti-apoptoti c protein)
(sr:human) (de:homo sapiens
bc1-2-binding protein (bag-1) mrna,
< td/>complete cds.) (nt:bag-11; long form
of bag-1; contains nuclear)
22135833_f2_1681 16652435144111−5P08640(s r:, baker's yeast) (ec:3.2.1.3)
cerevisiae(de:glucosida se) (1,4-alpha-d-glucan
glucohydrolase))
1606 9832_f2_178216653435144< /td>101−3equine herpesvirusD88734(sr:equ ine herpesvirus 1
type 1 EVH-1(strain:bk343, isolate:3f clone) dna)
(de:equine herpesvirus 1 dna for
membrane glycoprotein,
complete cds.)
35828876_f2_2183166 5414494821635−168 Pseudomonas putidaM57613(sr:pseudomo nas putida (strain
ppg2) dna) (de:pseudomonas putida
branched-chain keto acid
dehydrogenase operon(bkda1, bkda1
< td/>and bkda2), transacylase e2 (bkdb),
bkdr andlipoamide dehydrogenase
(lpdv) genes, complete cds.)
21986702_f2_2284166 552934977432−39P09062(ec:2.3.1.—) (de:chain transacylase))
14975718_f2_238516656441146110−6 KlebsiellaContig527AGTC ORF with score 168 to:
pneumoniae(ai:7000807184)
(or: Pseudomonas aeruginosa)
32204137_f2_2416657978325107mice|C57BL/6xCBA/S04336(c1 :unassigned ribonucleoprotein
CaJ hybridrepeat-containing proteins:
ribonucleoprotein repeat
homology) (sr:, house mouse)
16933157_f3_278716 658762253374−36AL123456( de:mycobacterium tuberculosis
< td>tuberculosish37rv complete genome; segment
149/162.) (nt:rv3537,
(mtcy03c7.19c), len: 563 aa.
similar eg to)
32596957_f3_288816659 1353450135−6< HIL>Rattus norvegicusB39066(c1:proline-rich protein)
(sr:, norway rat)
6738891_f3_308916660 13954641730−178P09060(ec:1.2.4.4) (de:(bckdh e1-alpha))
34613530_f3_329010653541162−118 Pseudomonas putidaP09062(ec:2.3.1.—) (de:chain transacylase))
22150333_f3_33911666213984651743∠’179Pseudomonas putidaP09063(ec:1.8.1.4) (de:dehydrogenase)
(lpd-val))
3160032_f3_ 3592166631206401832−83Bacillus(de:amino acid carricr protein alst)
subtilis/Bacillus
globigii
32522792_c1_42931666475 3250152−8Myxoc occus xanthusAF055904(de:myxoc occus xanthus
acetylornithine deacetylase (arge)
gene, complete cds; and unknown
gene.) (nt:orf2; no developmental
phenotype)
30369762_c1_449416665756251101< /td>−3Alphaherpesvirus P52506(sr:indiana-funkhauser/becker, prv)
pseudorabies virus(ec:3.2.2.—) (de:uracil-dna
PRVglycosylase, (udg))
6439575_c1_5195166 6632410794−6< HIL>MycobacteriumAL123456(de: mycobacterium tuberculosis
tuberculosish37rv complete genome; segment
159/162.) (nt:rv3850,
(mtcy01a6.18c), len: 218.
unknown.)
16304783_c2_5796< /td>166671158385243⠈’21Enterobacter cloacaeCONTIG393GTC ORF with score 348 to:
(ai:7501750253)
(or:Klebsiella pneumoniae)
22681466_c2_5816668549182
14868913_c2_5998166691038345164−10KlebsiellaContig530AGTC ORF with score 1135 to:
pneumoniae(ai:7000842307)
(or: Enterobacter cloacae)
2082901_c2_6099 16670993330151⠈’8Enterobacter cloacaeCONTIG484GTC ORF with score 1098 to:
(ai:7501740473)
(or:Klebsiella pneumoniae)
3254126_c2_6110016671669222104Rattus norvegicusM64793(sr:rat (sprague-dawley) liver dna)
(de:rat salivary proline-rich protein
(rp15) gene, complete cds.)
22464186_c2_6510116 672480159321−29P42179(de:bkd operon transcriptional
< td/>regulator)
10050816_c3_77 102166738462811 07−6StaphylococcusCONTIG030GTC ORF with score 179 to:
epid ermidisC(ai:4500688693) (or:Enterococcus faccalis)
12791456_c3_79 10316674483160140Enterobactcr cloacaeCONTIG484GTC ORF with score 465 to:
(ai:7501740484)
(or:Klebsiella pneumoniae)
11213183_c3_80166751629542
10516676630126−5Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:cbna-1 nuclear protein)
31656308_f1_8106 16677408135124−8X68600(sr:barley) (de:h. vulgare pze40
< td/>gene.)
4777041_f2_1610716678624207250−2 1AcinetobacterCONTIG170< /td>GTC ORF with scorc 250 to:
baumanniiC(ai:7000757116)
(or:Pseudomonas aeruginosa)
6770808_f3_20108166792664887
5181283_f3_23109166801713749−74Acinetobacte rCONTIG170GTC ORF with scorc 749 to:
baumanniiC(ai:7000757123)
(or:Pseudomonas aeruginosa)
31289543_c1_30166811293430
11116682585113−4Rhesus Epstein BarrU93909(sr:rhesus epstein barr virus)
virus(de:cercopithecine herpesvirus
15 nuclear antigen ebna-1
gene, complete cds.)
26649143_c2_3711216 683549182117−4AcanthamoebaAF085185(de :acanthamoeba castellanii
castellaniimyo sin-ia (mia) gene, complete
cds.) (nt:myosin-i)
32517041_c2_38113166842553851655− 64Escherichia coliB64836
29298930_c3_3 911416685495164 125−8Plasmid pAH4JC2322
24900040_c3_41 11516686663220123Nephila clavipesU20329(fn:spider silk) (de:nephila clavipes
spidroin 1 mrna, partial cds.)
< td/>(nt:fibroin)
16494831_c3_4211616687432143136Aquifex aeolicusG70322
34479187_ f1_21171668824982−7XanthomonasZ95386(de:x.campestris exbd1, exbd2, exbb
campestrisand tonb genes.)
10410181_f1_31181 6689495164132−8S50755
reinhardtii strain
UTEX 1061
11980342_f1_41191669 0417138263−23 NeisseriaU79563(de: neisseria gonorrhoeae tonb
gonorrhoeae(tonb), exbb (exbb) and exbd (exbd)
genes, complete cds.)
15911466_f1_8120166 912457818690−68U95087(fn:in volved in biosynthesis of the
pneumoniaeprosthetic) (de:klebsiella pneumoniae
< td/>malonate decarboxylase gene cluster
(mdca, mdcb, mdcc, mdcd, mdce,
< td/>mdcf, mdcg, mdch, mdcr) genes,
complete cds.) (nt:similar to citg
proteins of citrate lyases f . . .
6353840_f1_101211669227992168−13Enterobactcr cloacaeCONTIG457GTC ORF with score 301 to:
(ai:7501783383)
(or:Klebsiella pneumoniae)
16145756_f1_1216693822273157 −8blue musselAF029249(sr:blue mussel) (de:mytilus edulis
precollagen d (precol-d) mrna,
< td/>complete cds.)
35251001 _f1_1312316694417138128−7Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
13016666_f1_1412416695519172298−26KlebsiellaU56096(fn :unknown) (de:klebsiella
pneumoniaepneumoniae mdca, mdcb, mdcc,
< td/>mdcd, mdce, mdcf, and mdcg, genes,
complete cds.) (nt:similar to
malonyl coa-acyl carrier protein)
15711007_f2_171251669682527492−3CONTIG292GTC ORF with score 92 to:
(ai:7000757163)
(or:Pseudomonas aeruginosa)
13160950_f2_211669717285752207−229KlebsiellaU9 5087(de:klebsiella pneumoniac malonate
pneumoniaedecarbo xylase gene cluster (mdca,
mdcb, mdcc, mdcd, mdce, mdcf,
< td/>mdcg, mdch, mdcr) genes,
complete cds.) (nt:acyl carrier protein
transferase: alpha subunit of)
12359582_f2_231271669 8396131268−23 KlebsiellaU95087(de:klebsiella pneumoniae
< td/>pneumoniaemalon ate decarboxylase gene cluster
(mdca, mdcb, mdcc, mdcd, mdce,
< td/>mdcf, mdcg, mdch, mdcr) genes,
complete cds.) (nt:acyl carrier
protein; delta subunit of malonate)
3308541_f2_2412816699510169104−4 KlebsiellaContig501A GTC ORF with scorc 1258 to:
pneumoniae(ai:1500687053)
(or: Escherichia coli)
< td/>(ec:2.7.4.7) (de:(hmp-p kinase))
2525893_f2_25129 16700975324758−75 KlebsiellaU56096(fn: putative enzyme subunit involved
pneumoniaein malonate) (de:klebsiella
pneumoniae mdca, mdcb, mdcc,
< td/>mdcd, mdce, mdcf, and mdcg,
< td/>genes, complete cds.) (nt:an acetyl
coenzyme a carboxylase carboxyl)
32635752_f2_26130708235342−31KlebsiellaU95087(f n:involved in formation of thc
pneumoniaeholo-acyl carrier) (de:klebsiella
pneumoniae malonate decarboxylase
gene cluster (mdca, mdcb, mdcc,
< td/>mdcd, mdce, mdcf, mdcg, mdch,
< td/>mdcr) genes, complete cds.)
11911025_f3_2813116 7021083360347−31Z95386(de: x. campestris exbd1, exbd2, exbb
cam pestrisand tonb genes.)
1974136_f3_291321 6703717238105−3U23857(fn:binds to orip to permit replication
of the) (de:herpesvirus papio
< td/>brrf2 homolog gene, partial cds,
ebna1, bkrf2homolog and bkrf3
< td/>homolog genes, complete cds, and
homolog gene, partial cds.)
< td/>(nt:similar to ebna1
< td/>of epstein-barr v . . .
5012_f3_3013316704 660219210−17< i>ClostridiumZ96934(de:c lostridium beijerinekii
beijerinekiifms gene.)
10819842_f3_331341 6705513170120−5U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
12348955_f3_341351670 61029342183−12Orf virusB34768
16929207_f3_35136167071011336968 −97KlebsiellaU9508 7(de:klebsiella pneumoniae malonate
pneumoniaedecarbo xylase gene cluster (mdca,
mdcb, mdcc, mdcd, mdce, mdcf,
< td/>mdcg, mdch, mdcr) genes,
completecds.) (nt:decarboxylase
subunit; beta subunit of malonate)
30562584_c1_401371209402156−11Enterobacter cloacaeCONTIG457GTC ORF with score 156 to:
(ai:7000757186)
(or:Pseudomonas aeruginosa)
16980201 _c1_4713816709819272261−23Klebsiella< /td>Contig541AGTC ORF with score 261 to:
pneumoniae(ai:7000757193)
(or: Pseudomonas aeruginosa)
35659652_c1_4816710402133460 −44KlebsiellaConti g541AGTC ORF with score 460 to:
pneumoniae(ai:7000757194)
(or: Pseudomonas aeruginosa)
16058203_c1_5016711426141382 −35Enterobacter cloacaeCONTIG457GTC ORF with score 432 to:
(ai:7501783623)
(or:Klebsiella pneumoniae)
34114655_c1_5316712312103
26603958_c2_5814216713249154−11KlebsiellaContig541AGTC ORF with score 154 to:
pneumoniae(ai:7000757204)
(or: Pseudomonas aeruginosa)
14332833_c2_5916714429142117 −5Alphaherpesvirus P33479(sr:kaplan, prv) (de:immediate-early protein
pseudorabies virusie180)
< td/>PRV
32682280_c2 _62144167152073690−6Homo sapiensS16506(sr:, man)
15102041_c2_63145167 16501166134−7 human herpesvirusU13194(fn:transcripti onal regulation)
type 6 HHV-6(de:human herpesvirus 6
replication origin-binding protein
(hdrfo), partial cds, helicase-primase
component (hdrf1), virion protein
(hdlf1), putative helicase (hdrf2),
putative phosphoprotein(edrf1),
replica . . .
9863400_c2_70146167171035344159−10Enterobacter cloacaeCONTIG473GTC ORF with score 140 to:
(ai:236122) (or:Escherichia coli)
< td/>(sr:escherichia coli dna)
(de:escherichia coli tolqra gene
cluster dna.) (nt:orf4; putative)
17081408_c3_721471038345107−2mice|C57BL/6xCBA/AF062655(sr:house mouse) (de:mus musculus
CaJ hybridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
26031633_c3_73148936311424−40KlebsiellaContig541A
pneumoniae(ai:7000830999)
(or: Enterobacter cloacae)
10817541_c3_741 4916720984327112−6longfin squidS56117(sr:, longfin squid)
12595843_c3_761501 67211908635713−71 KlebsiellaContig541A GTC ORF with score 713 to:
pneumoniae(ai:7000757222)
(or: Pseudomonas aeruginosa)
24707281_f1_115116722540179119Nephila clavipesAF027973(de:neph ila clavipes flagelliform silk
protein (flag) mrna, partialcds.)
20992033_f1_2152 167231419472
10400468 _f1_6153167241029342377−35BradyrhizobiumAF007569(de:bradyrhizobium japonicum gstr
japonicum(gstr) gene, partial cds, and succinate
dehydrogenase membrane anchor
subunit (sdhc), membrane anchor
subunit (sdhd), flavoprotein
subunit (sdha) and iron-sulfurprotein
subunit (sdhb) genes, complete c . . .
14338332_f1_8154167251176391243−19KlebsiellaContig547AGTC ORF with score 243 to:
pneumoniae(ai:7000757234)
(or: Pseudomonas aeruginosa)
13105413_f1_10167261686561679−67Escherichia coliP24217(ec:2.7.1.69) (de:hpr) (ciii-fru)
< td/>(fructose pts diphosphoryl
transfer protein))
34455281_f1_16156567188144−10Escherichia coliP32673(ec:2.7.1.69) (de:ii, b component),)
4941456_f2_17157 1672816925631912−1 97PseudomonasP28812 (de:hypothetical protein in mmsb 3′
aeruginosaregion (orf1) (fragment))
32227268_f2_18158 167291455484343−31 Enterobacter cloacaeCONTIG469GTC ORF with score 343 to:
(ai:7000757244)
(or:Pseudomonas aeruginosa)
30339468_f2_2216730495164160 −10AcanthamoebaAF0 85185(de:acanthamoeba castellanii
castellaniimyosin-ia (mia) gene, complete cds.)
< td/>(nt:myosin-i)
10395933_f2_271673139012998longfin squidS56117(sr:, longfin squid)
15755381_f2_291611 67321041346257−22 HaemophilusP44330(ec :2.7.1.56) (de:1-
< HIL>influenzaephosphofructokinase , (fructose
1-phosphate kinase))
34114468_f3_36162167331074357128−8KlebsiellaContig542AGTC ORF with score 128 to:
pneumoniae(ai:7000757262)
(or: Pseudomonas aeruginosa)
20210312_f3_38167341251416172−9AcanthamoebaAF0 85185(de:acanthamoeba castellanii
castellaniimyo sin-ia (mia) gene, complete cds.)
< td/>(nt:myosin-i)
16503555_f3_3916735807268223 −19Enterobacter cloacaeCONTIG469GTC ORF with score 495 to:
(ai:7501735739)
(or:Klebsiella pneumoniae)
21877318_f3_42167361320439241−17Murine herpesvirusU97553(de:mur ine herpesvirus 68 strain
6 8wums, complete genome.)
13023957_f3_431661673715605191004−101< /td>RhodobacterP23388(sr:, rhodopseudomonas capsulata)
< td/>capsulatus(ec:2 .7.3.9:2.7.1.69) (de:(eiii-fru)))
16928841_f3_44167 16738780259385⠈’35Escherichia coliP23539(ec:2.7.1.56) (de:1-
phosphofructokinase, (fructose
1-phosphate kinase))
16910055_c1_4916816739663220158−12KlebsiellaContig245AGTC ORF with scorc 158 to:
pneumoniae(ai:7000757275)
(or: Pseudomonas aeruginosa)
15678905_c1_5116740363120132 −7Rana catesbeianaD88764(sr:ran a catesbeiana larva tail cdna
to mrna) (de:rana catesbeiana mrna
for alpha 2 type i collagen,
complete cds.)
32676943_c1_5317016 7411803600169−12CONTIG266GTC ORF with score 209 to:
(ai:7501730540)
(or:Klebsiella pneumoniae)
13087530_c1_54167421032343699−69Escherichia coliP21168(de:fructose repressor (catabolite
repressor/activator))
2213040 8_c1_5517216743948315149−8Homo sapiensS62928(sr:human salivary) (de:prb1m=prb1
medium length copy {exon 3}
(human, salivary, genomic, 924 nt).)
< td/>(nt:basic proline-rich
proteins (ps, pmf, pms, and pe))
33689590_c1_56173167 441110369361−33Contig501AGT C ORF with score 361 to:
pneumoniae(ai:7000757282)
(or: Pseudomonas aeruginosa)
12127218_c2_63167451176391100−2Enterobacter cloacaeCONTIG266GTC ORF with score 572 to:
(ai:7501730539)
(or:Klebsiella pneumoniae)
10835080_c2_64167462610869244−17Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
14557080_c2_671761674 7891296502−48 Escherichia coliP39408(de:hypothetical 28.9 kd protein in
osmy-deoc intergenic region)
4395776_c2_701771 674863321092−3common sunflowerS46272(sr:, common sunflower)
12369441_c2_72178528175124−6Homo sapiensPN0099(sr:, man)
12191068_c3_73179167 50501166118−5 mice|C57BL/6xCBA/AF062655(sr:house mouse) (de:mus musculus
CaJ hybridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
31755291_c3_741801461486171−9Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
9846943_c3_7518116752 828275127−5Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
12350785_c3_811821675 3999332115−4< HIL>Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
16260280_c3_821831675 41611536236−17Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
21484756_c3_831841675 5354117142−10 BacillusP94425(de:hypoth etical 10.9 kd protein)
subtilis/Bacillus
globigii
11728918_c3_8418516756344336−30StreptomycesAF072709(de:streptomyces lividans amplifiable
lividanselemen t aud4: putative transcriptional
< td/>regulator, putative ferredoxin,
putative cytochromep450
oxidoreductase, and putative
oxidoreductase genes, complete cds;
and unknown genes.)
(nt:orf9; similar . . .
12788582_c3_8518616757< /td>1698565326−27< HIL>KlebsiellaContig499AGTC ORF with score 1051 to:
pneumoniae(ai:7000798862)
(or: Pseudomonas aeruginosa)
33869656_f1_4187167581317438138 −5Schizosaccharomyce(sr:, fission yeast) (ec:1.2.1.31)
s pombede:aminoadipate reductase)
< td/>(alpha-ar))
31895753 _f1_818816759348115 151−11KlebsiellaContig253AGTC ORF with score 151 to:
pneumoniae(ai:7000757319)
(or: Pseudomonas aeruginosa)
35650293_f1_12167601101366838−83Escherichia coliP32066(de:hypothetical 41.9 kd protein in
fucr-gcva intergenic region (orf3))
11072968_f1_13190 16761324107205−16 Escherichia coliP37618(de:hypothetical 9.1 kd protein in
ftsy-nika intergenic region)
31847590_f1_28191 167621014338552−53PyrococcusAP000006( sr:pyrococcus horikoshii (str:ot3)
horikoshiidna, c1:pyrococcus horikoshi)
< td/>(de:pyrococcus horikoshii ot3
genomic dna, 1166001-1485000
< td/>nt. position (6/7).)
4774205_f2_301921 6763693230137−7AF000198 (sr:caenorhabditis elegans
elegansstrain=bristol n2) (de:caenorhabditis
elegans cosmid t28f2.)
(nt:similar to cuticular collagen)
32675683_f2_331931698565176−11Enterobacter cloacaeCONTIG490GTC ORF with score 300 to:
(ai:7000797020)
(or:Pseudomonas aeruginosa)
35449062_f2_34167652784927558−53KlebsiellaCont ig253AGTC ORF with score 558 to:
pneumoniae(ai:7000757345)
(or: Pseudomonas aeruginosa)
5178955_f2_3619516766101433791KlebsiellaContig5 19AGTC ORF with score 275 to:
pneumoniae(ai:7000787170)
(or: Pseudomonas aeruginosa)
3306418_f2_3719616767408135149KlebsiellaContig 532AGTC ORF with score 632 to:
pneumoniae(ai:7000771478)
(or: Pseudomonas aeruginosa)
10259456_f2_39167681488495179−13KlebsiellaCont ig532AGTC ORF with score 181 to:
pneumoniae(ai:7000795298)
(or: Pseudomonas aeruginosa)
35572936_f2_49167691020339136−7mice|C57BL/6xCBA/P26687( sr:,mouse) (de:twist related
CaJ hybridprotein (m-twist))
31453387_f3_53199690229221−20MycobacteriumAL123456 (de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
21/162.) (nt:rv0421c,
(mtcy22g10.18c), len: 209, unknown)
1432630_f3_55200 167711182393127−5 DissostichusU43149(d e:dissostichus mawsoni antifreeze
< td/>mawsoniglycopeptide afgp
polyproteinprecursor gene,
< td/>complete cds.)
15836706_f3_6420116 772411136104−6KlebsiellaContig544AGTC ORF with score 160 to:
pneu moniae(ai:69657)
< td/>(or:Human herpesvirus 4)
(cl:epstein-barr virus
< td/>nuclear antigen)
12900407_f3_7320216773810269116−5 miceS50883(sr:mice macrophage) (de:putative
transcription regulator {clone t2,
repetitive sequence}(micc,
< td/>macrophage, mrna, 1263 nt).)
< td/>(nt:method: conceptual translation
supplied by author.)
10972567_f3_7520316774888295371−34ChromatiumP25544(de :rubisco operon transcriptional
< td/>vinosumreg ulator)
14332715_f3_77204 16775618206158−10 Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:cbna-1 nuclear protein)
13020905_c1_7920516776501166160−10mice|C57BL/6xCBA/AF062655(sr:house mouse) (de:mus musculus
CaJ hybridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
162537_c1_81206 167771890629256−20ChlamydiaAE001273(d e:chlamydia trachomatis section
trachomatis52 of 87 of the complete genome.)
12978830_c1_82207167781434477495−47StreptomycesAL031225
coelicolor8b7.) (nt:sc8b7.07c, possible
oxidoreductase,: 475 aa;)
34478965_c1_85208167 79591196155−10equine herpesvirusD88734(sr:equ ine herpesvirus 1
type 1 EVH-1(strain:bk343, isolate:3f clone) dna)
(de:equine herpesvirus 1 dna for
membrane glycoprotein,
complete cds.)
30598783_c1_8620916 780357118126−7Pseudomonas putidaD85415(sr:pseudomonas putida (strain:ucc22)
< td>dna) (de:pseudomonas putida
gene for conversion of aniline to
catechol.) (nt:amino group transfer)
21772952_c1_902101146381279−25Enterobacter cloacaeCONTIG456GTC ORF with score 376 to:
(ai:7501743772)
(or:Klebsiella pneumoniae)
33850706_c1_9216782516171138 −9Homo sapiensA48018(sr:, man) (mp:4q13-4q21)
12002018_c1_9721216783447148138− 9southern root-knotU68729(sr:southern root-knot nematode)
nematode(de:meloidogyne incognita cuticle
preprocollagen (col-2) mrna,
< td/>complete cds.)
30682032_c2_1002131 6784615204100−5AP000006(sr: pyrococcus horikoshii (str:ot3)
horikoshiidna, c1:pyrococcus horikoshi)
< td/>(de:pyrococcus horikoshii ot3
genomic dna, 1166001-1485000
< td/>nt. position(6/7).)
29869505_c2_104214 167851077358211 −16RickettsiaAJ235269< /td>Rickettsia prowazekii strain
prowazckiiMadrid E, complete genome.
182312_c2_1092151 678625584
22133333_c2_111 216167877562511 32−6KlebsiellaContig554AGTC ORF with score 1366 to:
pneumoniae(ai:71316) (or:Escherichia coli)
< td/>(de:escherichia coli genomic
sequence of minutes 9 to 12.)
25495906_c2_11221716 7881425474104−1U76716(sr:house mouse) (de:mus musculus
CaJ hybridvoltage-sensitive calcium channel
alpha 1 a (ccha1 a)mrna, complete
cds.) (nt:ion channel)
35254591 _c2_117218167892325774155−7mice|C57BL/6xCBA/AF 062655(sr:house mouse) (de:mus musculus
CaJ hybridplenty-of prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
562702_c2_119219167901701566149−6Homo sapiensM96943(sr:homo sapiens (library: emb13,
clonetics) epidermal dna) (de:human
profilaggrin gene exons 1-3, 5′ end.
29566530_c2_12122016 7911317438
29960456_c3_12 622116792318105
33675666_c3_129222167932 7089
32522705_c3_13122316794516171233− 19HaemophiiusC64173
infl uenzae
22145213_c3_13222 416795150650192 −3Hordeum vulgareX68600(sr:barley) (de:h. vulgare
pze40 gene.)
21738400_c3_133225 1679631141037799−79Escherichia coliP37906(ec: 1.—.—.—) (de:probable
oxidoreductase ord1,)
33721007_c3_136226 1679735711895−4AF000198 (sr:caenorhabditis elegans
elegansstrain=bristol n2) (de:caenorhabditis
elegans cosmid t28f2.)
(nt:similar to cuticuiar collagen)
14703217_c3_13822729469812872−29 9XanthomonasAJ002070(fn:regulator of pathogenicity factors
campestrisand) (ec:4.2.1.3) (de:xanthomonas
campestris rpfa gene, complete.)
35833418_c3_139228 1679919386451035−1 04Escherichia coliJQ1475(cl:methyl-accepting chemotaxis
< td/>protein)
26605081_c3_14516800927308948−95RhizobiumZ8033 9(fn:subunit i of terminal cytochrome
< td>leguminosarumc oxidase) (de:r. leguminosarum
fixnd and fixod genes.)
13175818_f1_12301 6801684227216−18Contig522AG TC ORF with score 323 to:
pneumoniae(ai:7000829656)
(or: Enterobacter cloacae)
16307666_f1_223 11680219626531064CyanobacteriumS7 7243(sr:pcc 6803, , pcc 6803)
synechocystis(sr:pcc 6803,)
26048818_f1_323216 803786261181−12S42886(c1:unassigned collagens) (sr:,
< td/>silkworm)
5192913_f1_5233< /td>16804543180200∠’16Enterobacter cloacaeCONTIG248GTC ORF with score 200 to:
(ai:7000757461)
(or:Pseudomonas aeruginosa)
7300803_f1_7 2341680519865
10 182080_f1_11235168064651 5490−2Homo sapiensY13709(sr:human) (de:homo sapiens
caudal-type homeobox gene 2 (cdx2)
sequence.)
22944501 _f1_162361680763321097−2CaenorhabditisAF026211(sr:caenorhabditis elegans
elegansstrain=bristol n2) (de:caenorhabditis
elegans cosmid t13b5.)
(nt:similar to cuticular collagen)
35285207_f1_19237894297390−36Pseudomonas putidaAF075709(de:pseudo monas putida lsfa
(lsfa), complete cds; and ssu locus,
complete sequence.) (nt:ssua)
31899130_f1_2023820766911162−118 Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssuc)
34459405_f1_22239444147319−28Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssuf)
17049033_f1_27240777258105−2MycobacteriumAL021942
tuberculosish 37rv complete genome; segment
29/162.) (nt:rv0578c, (mtv039.16c),
len: 1306. member of)
11064775_f1_282411681 2525174267−23 Escherichia coliB65061(cl:mioc protein)
31752281_f2_3124216813615204118−4 human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus
6 serotype b putative major
< td/>immediate-earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
33839080_f2_3224316 814456151125−7DictyosteliumP14328(sr: , slime mold) (de:spore coat
discoideumprotein sp96)
9766458_f2_3724416815696231370−34Escherichia coliP29217(de:hypothetical 24.2 kd protein in
rimj-mvim intergenic region (g20.3))
14547965_f2_42245168166692221007−101Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.)
31338262_f2_442461590529184−11< /td>Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
24510091_f2_4524716818894297494−47BacillusG69816
subtilis< /HIL>/Bacillus
globigii
12697956_f2_472481681919 416461848−191P seudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssud)
34503957_f2_492491017338153−8PseudomonasA36128
aeruginos a
7204092_f3_612501200399111−3CaenorhabditisZ81503
elegansf14f7, complete sequence.)
< td/>(nt:predicted using
< td/>genefinder; similar to collagen;)
31353580_f3_632511275424905−91< /td>BacillusD70021
subtilis/Bacillus
< td/>globigii
7164581_f3_6525216823 1737578478−46K lebsiellaContig541AGTC ORF with score 478 to:
pneumoniae(ai:7000757521)
(or: Pseudomonas aeruginosa)
22129126_f3_68168241062353385−35Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssua)
33650283_f3_69254828275591−57BacillusP40401(de: probable abc transporter permease
subtilis/Bacillusprotein (orf1))
globigii
12206568_f3_7225516826606201705 −69Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssue)
31894155_f3_7325651317094−2 AspergillusContig2120GTC ORF with score 130 to:
fumigatus(ai:58877)
(or:< i>Emericella nidulans)
(sr:, aspergillus nidulans)
(de:sterigmatocystin biosynthesis
regulatory protein)
12230380_f3_7425716828417138513−49Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssua)
10270830_f3_75258516171148−11KlebsiellaContig450A
pneumoniae(ai:7000757531)
(or: Pseudomonas aeruginosa)
16823878_f3_761683015245071117−113Pseudomonas putidaAF075709(de:pseudo monas putida lsfa (lsfa),
complete cds; and ssu locus,
complete sequence.) (nt:ssub)
33650342_f3_79260756251294−26Synechococcus sp.U59236(de:synechococcus pcc7942
(strain PCC 7942)ribosomal protein s1 of 30s ribosome
(rps1), orf271, orf231, orf341,
carboxyltransferase alpha subunit
(acca), orf245, orf227, and gtp
cyclohydrolase i (fole) genes,
complete cds, and orf205 gene,
< td/>partial cds,) (pt . . .
19714783_c1_8626116832< /td>810269196−15Sy ncchococcus sp.P37372(sr:pcc 7942, anacystis nidulans r2)
(de:hypothetical 23.0 kd protein in
syna 5′ region (orf2)
31424137_c1_892621 6833486161244−21S54845(ec:3.5.4.16)
31298912_c1_9326316834471156130−7Bo reogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
1301681_c1_9426416835 831276151−11< HIL>Enterobacter cloacaeCONTIG375GTC ORF with score 151 to:
(ai:7000757550)
(or:Pseudomonas aeruginosa)
5276706_c1_9526516836504167456KlebsiellaContig 450AGTC ORF with score 456 to:
pneumoniae(ai:7000757551)
(or: Pseudomonas aeruginosa)
16886542_c1_9816837465154132 −8KlebsiellaContig 496AGTC ORF with score 193 to:
pneumoniae(ai:7000807650)
(or: Pseudomonas aeruginosa)
17057067_c1_10016838402133137−10KlebsiellaCont ig385AGTC ORF with scorc 552 to:
pneumoniae(ai:7000831233)
(or: Enterobacter cloacae)
14666267_c1_104 26816839735244112AspergillusContig7 664GTC ORF with score 112 to:
fumigatus(ai:7000757560)
(or:< HIL>Pseudomonas aeruginosa)
479756_c1_114269168401140380673 −66Escherichia coliB65005
35331963_c2_1 20270168411290429−58Cyanobacterium Q55759(sr:pcc 6803,) (cc:3.5.4.16) (de:gtp
synechocystiscyclohydrolase i, (gtp-ch-i))
14972886_c2_12127116842813270334−30 MethylobacteriumU87316(de:methylobacterium extorquens
< td/>extorquensputat ive glycerate kinase and
pyruvatekinase (pyka) genes,
complete cds.) (nt:orf2; putative)
17051418_c2_122272390129
35558537_c 2_12427316844813270 106−3Mycobacterium AF027770(de:mycobacterium smegmatis iron
smegmatisuptake genes, fxba (fxba) gene,
< td/>partial cds; and fxta (fxta), fxtb
(fxtb), fxbb (fxbb), fxbc (fxbc), fxtc
(fxtc), fxtd (fxtd), fxte (fxte), and fxtf
(fxtf) genes, complete cds.)
< td/>(nt:similar to membrane b . . .
33792902_c2_12527416845 495164377−35< HIL>KlebsiellaContig450AGTC ORF with score 377 to:
pneumoniae(ai:7000757581)
(or: Pseudomonas aeruginosa)
29585126_c2_12616846453150119−6AcanthamoebaAF0 85185(de:acanthamoeba castellanii
castellaniimyo sin-ia (mia) gene, complete cds.)
< td/>(nt:myosin-i)
6111393_c2_12916847672223309 −28KlebsiellaConti g450AGTC ORF with score 309 to:
pneumoniae(ai:7000757585)
(or: Pseudomonas aeruginosa)
31891431_c2_13216848816271166−11KlebsiellaCont ig506AGTC ORF with score 104 to:
pneumoniae(ai:7500984984)
(or: Dictyostelium discoideum)
(sr:dictyostelium discoideum (str:ax2)
dna) (de:dictyostelium discoideum
< td/>gene for trfa, complete cds.)
32604540_c2_1332781 68491200399157−9Contig496AG TC ORF with score 209 to:
pneu moniae(ai:7000807650)
< td/>(or:Pseudomona s aeruginosa)
26344162_c2_1351685017375781854−191Escherichia coliP33940(de:hypothetical 60.2 kd protein
in eco-alkb intergenic region)
32661416_c2_13628016851435144119−8 KlebsiellaContig554A GTC ORF with score 112 to:
pneumoniae(ai:6000692284)
(or: Physarum polycephalum)
(sr:slime mold) (de:physarum
< td/>polycephalum dna topoisomerase i
(top 1) mrna, complete cds.)
26694468_c2_1372811 68521785594159−11 KlebsiellaContig347A GTC ORF with score 349 to:
pneumoniae(ai:7000795297)
(or: Pseudomonas aeruginosa)
29926906_c2_14216853411136105−6Enterobacter cloacaeCONTIG449GTC ORF with score 105 to:
(ai:7000757598)
(or:Pseudomonas aeruginosa)
23690807_c3_14416854870289166−9mice|C57BL/6xCBA/AF062655(sr:house mouse) (de:mus musculus
CaJ hybridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
135916_c3_14528416855780259518−50Escherichia coliP52109(ec:1.—.—.—) (de:(ec 1.—.—.—))
573282_c3_1462851685638712891−3 SchizosaccharomyceZ95620 (sr:fission yeast) (de:s. pombe
< td/>s pombechromosome ii cosmid c3d6.)
(nt:spbc3d6.14c, unknown;
partial; serine rich,)
16695455_c3_147286 16857390129370−34 Escherichia coliP80449(ec:5.—.—.—) (de:(dihydroneopterin
triphospha te 2′-epimerase)
13144683_c3_1492871685843214396< /td>−2DrosophilaQ0 5319(sr:, fruit fly) (ec:3.4.21.—)
< td/>melanogaster(de:serine proteinase
< td/>stubble, (stubble-stubbloid protein))
32597917_c3_150288837278149−7human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus
6 serotype b putative major
< td/>immediate-earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
15913330_c3_1532891 68601365454475−45 KlebsiellaContig450A GTC ORF with score 475 to:
pneu moniae(ai:70000757609)
(or:Pseudomon as aeruginosa)
1417537_c3_15516861417138179 −14KlebsiellaConti g496AGTC ORF with score 209 to:
pneumoniae(ai:7000807650)
(or: Pseudomonas aeruginosa)
10276086_c3_15816862993330209−17KlebsiellaCont ig545AGTC ORF with score 598 to:
pneumoniae(ai:7000845728)
(or: Enterobacter cloacae)
22146033_c3_159 2921686344714896−2LeishmaniaU78522(de:leishmania donovani histidine
(Leishmania)secretory acid phosphatase (sacp-1)
donovanigene, complete cds.)
12987962_c3_1612931 6864780259177−12Contig5048 GTC ORF with score 177 to:
fumigatus(ai:7000757617)
(or:< HIL>Pseudomonas aeruginosa)
32660331_c3_16716865492163230−19KlebsiellaCont ig560AGTC ORF with score 106 to:
pneumoniae(ai:1500692508)
(or: Borcogadus saida)
(de:boreogadus saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of
polyprotein at conserved spacers r or)
30364786_c3_169295168 66426141105−4 AlphaherpesvirusP11675(s r:indiana-funkhauser/beckcr, prv)
pseudorabies virus(de:immediate-early protein ie180)
P RV
10651012_c3_1702961686 71041347105−2 AcanthamoebaP10569(sr:,a moeba) (de:myosin ic heavy
castellaniichain)
16272582_f1_329716868510169112−4Klebsiella Contig465AGTC ORF with score 123 to:
pneumoniae(ai:7000702062)
(de: pseudomonas fluorescens alkaline
protease, protease inhibitor,
< td/>zinc-protease transporter (aprd),
zinc-protease transporter (apre),
and zinc-protease transporter (aprf)
genes complete cds.)
1448462_f1_13298168 69729242286−25Escherichia coliP77216(de:hypothetical 47.8 kd protein in
csta-dsbg intergenic region)
15752316_f1_16299 16870672223156−9CONTIG450GTC ORF with score 1547 to:
(ai:7000795773)
(or:Pseudomonas aeruginosa)
24112581_f1_1716871444147124 −7Enterobacter cloacaeCONTIG412GTC ORF with score 304 to:
(ai:7000758236)
(or:Pseudomonas aeruginosa)
22913252_f1_18168722337778133−6KlebsiellaConti g536AGTC ORF with score 662 to:
pneumoniae(ai:7000758235)
(or: Pseudomonas aeruginosa)
25502061_f1_2816873402133
21535400_f1_31303168741959163−11Aspergillus Contig7727GTC ORF with score 163 to:
fumigatus(ai:7000757657)
(or:< HIL>Pseudomonas aeruginosa)
31379825_f1_35168751380459872−87Ralstonia eutrophaX99639(fn:transport of
4-methylmuconolactone)
(de:ralstonia eutropha mmlh,
< td/>mmli & mmlj genes.) (nt:putative)
31847283_f1_36305168761254417951− 95Pseudomonas putidaAF029714(de:pseudo monas putida repressor
(phan), regulatory protein (pham),
enoyl-coa hydratase i (phaa),
enoyl-coa hydratase ii (phab),3-
hydroxyacyl-coa dehydrogenase
(phac), ketothiolase (phad),
phenylacetyl-coa ligase
(phae), ring-oxidation
5917638_f2_41306168772250749113− 3Euprymna scolopesJN0867
31760430_ f2_4730716878843280 356−32Escherichia coliP77216(de:hypothetical 47.8 kd protein in
csta-dsbg intergenic region)
4401568_f2_493081 6879645214534−51P77174(de:hypothetical 23.9 kd protein in
csta-dsbg intergenic region)
16901087_f2_51309 16880759252183−14 AzospirillumX70360(d e:a. brasilense carr gene.) (nt:orf2)
brasilense
11183441_f2_ 5831016881606201105−3Canis familiarisA45195(cl:guan ylate cyclase catalytic
domain homology) (sr:, dog)
33397215_f2_59311168 821008335187−11A45748(cl:unassigned collagens)
< td>CaJ hybrid(sr:, house mouse)
4380068_f2_6131216 88311103691709−176Ralstonia eutrophaP14940(ec:1.1.1.1) (de:alcohol
dehydrogenase,)
6511542_f2_62 313168847772581 57−10mice|C57BL/6xCBA/A24264(cl:proline-rich protein)
CaJ hybrid(sr:, housc mouse)
35364182_f2_633141 6885402133117−6AF055904(de:myxoc occus xanthus
acetylornithine deacetylase (arge)
gene, complete cds; and unknown
gene.) (nt:orf2; no developmental
phenotype)
31378965_f2_64315168861014337
32682080_f2_67316168871218 405
12994405_f2_6931716888435144155−10 Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
22676592_f2_713181688 9534177140−10 KlebsiellaContig349AGTC ORF with score 149 to:
pneumoniae(ai:7000836613)
(or: Enterobacter cloacae)
3380216_f2_7231 9168901026341318−28PyrococcusAP000004 (sr:pyrococcus horikoshii (str:ot3)
horikoshiidna) (de:pyrococcus horikoshii ot3
genomic dna, 777001-994000
nt. position(4/7).) (nt:similar
to swiss p:p42967 percent ident:)
26681528_f3_76320 16891140146698−4Contig409AG TC ORF with score 98 to:
pneumoniae(ai:7000757702)
(or: Pseudomonas aeruginosa)
30725905_f3_8616892639212103 −3Homo sapiensA47234(cl:unassigned homeobox
proteins:homeobox homology)
(sr:, man)
33495655_f3_91322168 93774257100−5 longfin squidS56117(sr:, longfin squid)
4303193_f3_9532316 894915304442−42AF087482(de :pseudomonas aeruginosa clcc and
aeruginosaohbh genes, lys-r type
regulatoryprotein (clcr),
chlorocatechol-1,2-dioxygenase
< td/>(clca), chloromuconate
cycloisomerase (clcb), dienelactone
hydrolase (clcd), maleylacetate
reductase (clce). transposas . . .
29584653_f3_9632416895< /td>576191371−34AcinetobacterCONTIG195GTC ORF with score 371 to:
baumanniiC(ai:7000757722)
(or:Pseudomonas aeruginosa)
10647667_f3_10016896411136164−12KlebsiellaCont ig117AGTC ORF with score 164 to:
pneumoniae(ai:7000757726)
(or: Pseudomonas aeruginosa)
32552326_f3_1011689793331098 −3KlebsiellaContig 117AGTC ORF with score 164 to:
pneumoniae(ai:7000757726)
(or: Pseudomonas aeruginosa)
12975755_f3_10216898546181154−9CaenorhabditisU 80846(sr:caenorhabditis elegans
elegansstrain=bris tol n2) (de:caenorhabditis
elegans cosmid k06a9.)
(nt:partial cds; coded for by c.
elegans cdna yk50c7.5)
502158_f3_108328168992085694109−5longfin squidS56117(sr:, longfin squid)
16583301_f3_109329 16900909302108−3U59446(sr:rape) (de:brassica napus
< td/>myrosinase-binding protein related
protein mrna, partial cds.)
< td/>(nt:divergently related to myrosinase
< td/>binding protein,)
14742256_f3_110330759252586−57Bordetella pertussisAF006000(de:bor detella pertussis
d-3-phosphoglycerate dehydrogenase
homolog(scra) and brg1 (brg1) genes,
complete cds.) (nt:orf7; similar to b.
< td/>subtilis yesf)
12552141_c1_116331 1690231411046137−5African clawed frogU85970(sr:african clawed frog) (de:xenopus
laevis middle molecular weight
neurofilament proteinnf-m(2) mrna,
< td/>complete cds.) (nt:neuronal
intermediate filament protein;
duplicated)
30682817_c1_117 33216903936311702−69BacillusP4296 6(de:hypothetical 28.1 kd protein in
subtilis/Bacillussipu-pbpc intergenic region)
globigii
4033408_c1_11933316904142247313 5−5Gallus gallusA90458(cl:collagen alpha 1(i) chain:fibrillar
< td/>domesticuscollagen carboxyl-terminal
homology:von willebrand factor
type c repeat homology)
(sr:, chicken)
3400782_c1_12233416905414137117−7 StaphylococcusCONTIG080
epidermidisC(ai:7000771238)< /td>
(or:Pseudomonas aeruginosa)
36429131 _c1_12633516906792263190−13Homo sapiensX98705(sr:human) (de:h. sapiens dna
sequence of collal gene fused with
intron 1 of pdgfbgene.)
4315927_c1_127336 169071350449103−5< /td>KlebsiellaContig532A GTC ORF with score 110 to:
pneu moniae(ai:7000806016)
< td/>or:Pseudomonas aeruginosa)
31723766_c1_130169081677558654−64Rhodococcus sp.P46371(sr:ni86/21,) (de:hypothetical 53.0
kd gmc-type oxidoreductase in thca
5′ region (orf2))
26738456_c1_131338169091470489829−83BacillusP71016(ec: 1.2.1.8) (de:betaine aldehyde
subtilis/Bacillusdehydrogenase, (badh))
globigii
32711537_c2_148< /td>3391691065421711 4−3Acanthamoeba(sr:, amoeba) (de:myosin heavy chain
castellaniiib (myosin heavy chain il))
25807681 _c2_14934016911549182198−16Enterobacter cloacaeCONTIG484GTC ORF with score 248 to:
(ai:7501736005)
(or:Klebsiella pneumoniae)
13150250_c2_154169121263420
342169131113370978−98Escherich ia coliM10315(sr:e. coli (strain b/r) dna, clone
< td/>pcs68) (de:e. coli (strain b) ada
gene coding for ada polyprotein,
regulatoryprotein of adaptive
response.) (nt:ada polyprotein)
16275655_c2_16134316914933310139−7 KlebsiellaContig557AGTC ORF with score 331 to:
pneumoniae(ai:7000776648)
(or: Pseudomonas aeruginosa)
31720655_c2_16316915795264
34516916969 32244141HaemophilusU20964(de:haemophilus influenzae dna
influenzaetopoisomeras e i (topa) gene, complete
cds, putative pyridine nucleotide
< td/>transhydrogenase beta subunit(pntb)
gene, partial cds, orf2 and orf3 genes,
complete cds andputative
threonyl-trna synthetase (thrs) ge . . .
20973781 _c2_169346169172137093−5Enterobacter cloacaeCONTIG450GTC ORF with score 146 to:
(ai:7000808962)
(or:Pseudomonas aeruginosa)
35447962_c2_17916918594197230−19Escherichia coliL43373(sr:escherichi a coli (strain 31a/o6)
dna) (de:escherichia coli
(clone 20kpil) pilin 20k
gene complete cds.)
16932077_c2_1803481 6919834277398−37L77091(fn:stabilization of major subunit
proteins) (sr:escherichia coli
(individual_isoIate natural isolate,
strai) (de:escherichia coli f17d
fimbrial gene clustcr encoding the
majorfimbrial subunit protein
(f17d-a), the chaperone protein
f17d . . .
22785066_c2_18234916920 2943981186−10 Micrococcus luteusJQ0405
5156563_c3_ 189350169211074357−22Bacillus C69635
< td>subtilis/Bacillus
glo bigii
16103956_c3_190351 169222016671
108 08456_c3_192352169237172 3893−2Schizosaccharomyc eZ95620(sr:fission yeast) (de:s. pombe
< td/>s pombechromosome ii cosmid c3d6.)
(nt:spbc3d6.14c, unknown;
partial; serine rich,)
2393831_c3_1933531 69241719572121−4Contig469AG TC ORF with score 476 to:
pneu moniae(ai:7000841954)
< td/>(or:Enterobact er cloacae)
12630068_c3_201 354169251482493126longfin sguidS56117(sr: longfin squid
16307276_c3_2023551 6926168956294−1D90890(pn:succinate-semialde hyde
dehydrogenase (nadp+) (ec)
(sr:eschcrichia coli (strain:k12)
dna, clone_lib:kohara lambda
minise) (de:e. coli genomic dna,
kohara clone #443(59.8-60.2 min.).)
(nt:similar to swissprot
accession number p25526):)
35258530_c3_2033561014337190−15< /td>KlebsiellaContig554A GTC ORF with score 190 to:
pneu moniae(ai:7000757829)
< td/>(or:Pseudomona s aeruginosa)
13001666_c3_20616928678225130−7KlebsiellaConti g554AGTC ORF with score 130 to:
pneu moniae(ai:7000757832)
< td/>(or:Pseudomona s aeruginosa)
13791555_c3_2111692926528831127−114Escherichia coliP75857(de:region precursor)
5183453_f1_135916930879292169−11Enterobacter cloacaeCONTIG480GTC ORF with score 707 to:
(ai:7501752834)
(or:Klebsiella pneumoniae)
13027080_f1_236016931501166824PseudomonasP2456 0(de:hypothctical 17.0 kd protein in
aeruginosapilt 5′ region (orf1))
24713125_f1_33611 6932372123437−41P24561(de: hypothetical 8.9 kd protein in
aeruginosapilt 5′ region (orf2))
11996075_fI_43621 693310473481728−178PseudomonasP24559( de:twitching mobility protein)
aeruginosa
24817828_f1_5 363169341164387 1937−200PseudomonasS54702
aeruginosa
3367078_f 1_9364169351437478−71Mycobacterium AL021897(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
48/162.) (nt:rv1077, (mtv017.30),
cysm, len: 464. cysm2,)
14958408_f1_10365 16936429142138−9O10341(sr:, opmnpv) (de:hypothetical 29.3 kd protein
pseudotsugata(orf92))
multinucleocap sid
nucl ear polyhedrosis
virus OpMNPV
11744805_f1_173661 6937738245100−2A31772(cl:streptomyces proteinase
< td/>enzymogenesa:tr ypsin homology) (ec:3.4.21.12)
5960761_f1_26367169381347448103− 2CaenorhabditisU23520(sr:caenorhabditis elegans
elegansstrain=bristol n2)
(de:caenorhabditis elegans
cosmid c35b8.) (nt:similar to
cuticular collagen)
2991706_f1_2836816939561186123−5 Nephila clavipesAF027972(de:neph ila clavipes flagelliform
silk protein (flag) mrna, partialcds.)
22439766_f1_29369169401470489159−9 CaenorhabditisZ81503(de:caenorhabditis elegans cosmid
elegansf14f7, complete sequence.)
< td/>(nt:predicted using
< td/>genefinder similar to collagen;)
35369716_f1_313702019672213−13< /td>Nephila clavipesAF027735(de:neph ila clavipes minor ampullate
silk protein mispl mrna, partialcds.)
32047627_f1_3237116942582193135−7< /td>Epstein-Barr virusS27923
16660458_f1_3337216943561186167blue musselAF029249(sr:blue mussel) (de:mytilus edulis
precollagen d (precol-d) mrna,
< td/>complete cds.)
12321041_f1_3437316 944444147107−6longfin squidS56117(sr:, longfin squid)
16484455_f1_373741 6945915304114−4P50809(de: regulatory protein e2)
papillomavirus type
36
14563156_f1_38375618205259−22PseudomonasU79580( de:pseudomonas aeruginosa pilk
aeruginosagene, partial cds; and pill, chpa,
< td/>chpb, chpc, chpd, and chpe genes,
complete cds.) (nt:similar to
xyls/arac protein family)
12161030_f1_39376 169476422131050−106PseudomonasU79580( de:pseudomonas aeruginosa pilk
aeruginosagene, partial cds; and pill, chpa,
< td/>chpb, chpc, chpd, and chpe
genes, complete cds.) (nt:similar to
bacillus subtilis ycgf and
unknown orf)
33614582_f1_40377169 48522173103−2 StrongylocentrotusS23809 (cl:collagen alpha 2(i) chain:fibrillar
< td/>purpuratuscollagen carboxyl-terminal
homology) (sr:, purple urchin)
15886531_f1_42378 16949564187145−9AF085185( de:acanthamoeba castellanii
castellaniimyo sin-ia (mia) gene, complete cds.)
< td/>(nt:myosin-i)
14145658_f1_4416950537178268 −23KlebsiellaConti g526AGTC ORF with score 825 to:
pneumoniae(ai:7000844364)
(or: Enterobacter cloacae)
17064555_f1_453 8016951426141125−7CaenorhabditisAF022 985(sr:caenorhabditis elegans
elegansstrain=bris tol n2) (de:caenorhabditis
elegans cosmid t15b7.)
(nt:similar to collagen)
12679763_f1_463811617538247−20MycobacteriumAL123456 (de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
137/162.) (nt:rv3170, (mtv014.14),
len: 448. probable)
2541631_f1_4738216953675224636−62Escherichia coliF64848
30213580_f1_5 53831695415845272415−251PseudomonasAB011381(sr:pseudomonas aeruginosa
< td/>aeruginosa(str: tnp058) dna) (de:pseudomonas
aeruginosa gene for oprm,
< td/>complete cds.)
30191575_f1_5838416 9551446481149−6AF083334(sr:chinese oak silkmoth)
silkmoth(de:antheraea pernyi
fibroin gene, complete cds.)
10633293_f2_6538516 9561164387638−62M55524(sr: pseudomonas aeruginosa (strain
aeruginosapaol) (library: banhi genomi)
(de:pseudomonas aeruginosa
< td/>twitching motility protein (pilt)
gene, andorf's 1,2,4,5,6 and 7,
complete cds.) nt:orf4: putative)
22784416_f2_69386774257178−14Enterobacter cloacaeCONTIG361GTC ORF with score 178 to:
(ai:7000757907)
(or:Pseudomonas aeruginosa)
21619756_f2_7016958453150
12995955_f2_7138816959405154−11KlebsiellaContig332AGTC ORF with score 263 to:
pneu moniae(ai:7000822288)
< td/>(or:Enterobact er cloacae)
14661633_f2_753 891696029196245 −20CaenorhabditisZ8310 6(de:caenorhabditis elegans cosmid
elegansf22b8, complete sequence.)
< td/>(nt:predicted using genefinder;
similar to cystathione)
7042918_f2_76390 16961417138119−6Canis familiarisA45195(cl:guan ylate cyclase catalytic
domain homology) (sr:, dog)
35438883_f2_80391169 62561186167−13KlebsiellaContig526AGTC ORF with score 237 to:
pneumoniae(ai:7000844516)
(or: Enterobacter cloacae)
33255461_f2_863 9216963471156681−67PseudomonasP46384< /td>(de:pilg protein)
aeruginosa
31847526_f2_8 739316964927308 938−94Pseudomonas P43502(de:pili protein)
aeruginosa
31446011_f2_8 83941696520916963275−9999Pseudomonas< /td>P42257(de:pilj protein)
aeruginosa
13152083_f2_8 939516966912303 1346−137PseudomonasS61498(cl:protein-glutamate
aeruginosao-methyltransferase homology)
33616093_f2_90396744924828157−99 99PseudomonasU79580 (de:pseudomonas aeruginosa pilk
aeruginosagene, partial cds; and pill, chpa,
< td/>chpb, chpc, chpd, and chpe genes,
complete cds.) (nt:cheay homolog)
32605191_f2_9339716968708235823−82PseudomonasU79580(f n:unknown) (de:pseudomonas
aeruginosaaeruginosa pilk gene, partial
cds; and pill, chpa, chpb, chpc,
< td/>chpd, and chpe genes, complete cds.)
12603875_f2_1023981 69691995664289−25 Enterobacter cloacaeCONTIG439GTC ORF with score 727 to:
(ai:7501761442)
(or:Klebsiella pneumoniae)
32203875_f2_1071697046515493 −3mice|C57BL/6xCBA/Q06666(s r:, mouse) (de:octapeptide-repeat
CaJ hybridprotein t2)
26377143_f2_114400169 711011336120−4Escherichia coliS18537
3379541_f2_11 540116972486161 133−9Enterobacter cloacaeCONTIG410GTC ORF with score 133 to:
(ai:7000757953)
(or:Pseudomonas aeruginosa)
35829541_f2_11616973633210224−19Enterobacter cloacaeCONTIG410GTC ORF with score 224 to:
(ai:7000757954)
(or:Pseudomonas aeruginosa)
12632067_f2_12116974462153196−16KlebsiellaCont ig525AGTC ORF with score 275 to:
pneu moniae(ai:7000775494)
< td/>(or:Pseudomona s aeruginosa)
6036628_f2_12416975972323198 −15Enterobacter cloacaeCONTIG481GTC ORF with score 520 to:
(ai:7501762729)
(or:Klebsiella pneumoniae)
9859718_f3_1301697620016662910−9999Pseudomonas P24563(de:hypothetical 57.4 kd protein in
aeruginosapilt region (orf4))
31273328_f3_13240616977684227250−20PseudomonasP24563(d e:hypothetical 57.4 kd protein in
aeruginosapilt region (orf4))
15728966_f3_13940716978732243151−11KlebsiellaContig332AGTC ORF with score 151 to:
pneu moniae(ai:7000757977)
< td/>(or:Pseudomona s aeruginosa)
12277036_f3_1401697913924631112−113StenotrophomonasAF031709(de:stenotrophomonas maltophilia
maltophiliacys tathionine gamma-lyase-like
protein(cys1) gene, complete cds.)
32683318_f3_1524091 69801326441550−53 KlebsiellaContig526A GTC ORF with score 825 to:
pneu moniae(ai:7000844364)
< td/>(or:Enterobact er cloacae)
31256290_f3_153 41016981390129611PseudomonasP43501 (de:pilh protein)
aeruginosa
31289537_f3_1 654111698251317095−4AspergillusGTC ORF with score 219 to:
fumigatus(ai:175260)
(or: Volvox carteri)
35797680_f3_174 4121698311553841806 −186PseudomonasU79 580(de:pseudomonas aeruginosa pilk
aeruginosagene, partial cds; and pill, chpa,
< td/>chpb, chpc, chpd, and chpe genes,
complete cds.) (nt:cheb homolog)
11042255_f3_17641316565511067−108 PseudomonasU79580
aeruginosagene, partial cds; and pill, chpa,
< td/>chpb, chpc, chpd, and chpe genes,
complete cds.) (nt:similar to xyls/arac
protein family)
34636380_f3_180414169851089362121−5Enterobacter cloacaeCONTIG508GTC ORF with score 460 to:
(ai:7501774708)
(or:Klebsiella pneumoniae)
35707307_f3_185169861254417
41616987603200770−76Escherich ia coliP37904(de:18.7 kd protein in htrb-dini
intergenic region precursor)
10550950_f3_191417 1698813624531890−1 95PseudomonasP52477 (de:multidrug resistance protein mexa
aeruginosaprecursor)
1074055_f3_192418169893156 10515248−9999P seudomonasP52002(de:multidrug resistance protein mexb
aeruginosa(multidrug-efflux transporter mexb))
12988966_f3_194419 169901038345181−10Homo sapiensZ74615(sr:human) (de:h. sapiens mrna for
prepro-alpha1(i) collagen.)
261393_f3_1954201473490284−23KlebsiellaContig525A
pneumoniae(ai:7000829218)
(or: Enterobacter cloacae)
33863875_c1_208 421169921488495686Escherichia coliP25888(de:putative atp-dependent ma
helicase rhle)
25910937_c1_2104221 69931557518282−24 KlebsiellaContig558A GTC ORF with score 282 to:
pneumoniae(ai:7000758048)
(or: Pseudomonas aeruginosa)
3395753_c1_219169941569522
42416995489 162134−8Caenorhabdi tisP34391(de:putative cuticle collagen f09g8.6)
elegans
24083256_c1_232< /td>4251699655218315 1−9Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-related
< td/>nuclear matrix protein (srm 160)
mrna, complete cds.) (nt:160 kda)
11212705_c1_23542616 997531176119−8AcinetobacterCONTIG220G TC ORF with score 119 to:
baumanniiC(ai:7000758073)
(or:Pseudomonas aeruginosa)
5354791_c1_23616998984327161 −8Herpes simplex virusAF015297(de:human herpesvirus 6 (strain
(type 6/strainuganda-1102) ie2hom mrna, complete
Uganda-1102)cds.) (nt:similar to the
immediate-early 2 protein of human)
34064092_c1_237428 16999414137116−6P28284(sr:type 2/hg52,) (de:trans-acting
matchtranscriptional protein icp0
(vmw 118 protein))
33676092_c1_238429537178143−8SaccharomycesX89715(sr:baker's yeast) (de:s. cerevisiae
< td/>cerevisiaeaob56 7, aof1001, aoe110, aoe264
and aoe130 genes.)
10283415_c1_24043017001837278151−8 DictyosteliumP14328( sr:, slime mold) (de:spore coat
discoideumprotein sp96)
7072716_c1_24143117 002981326282−23X89715(sr :baker's yeast) (de:s. cerevisiae
< td/>cerevisiaeaob56 7, aof1001, aoe110, aoe264
and aoe130 genes.)
1307341_c1_247432 17003705234127−8AP000002(sr :pyrococcus horikoshii (str:ot3) dna)
horikoshii(de:pyrococcu s horikoshii ot3 genomic dna,
287001-544000 nt. position(2/7).)
< td/>(nt:motif=prokaryotic membrane lipoprotein
lipid)
21563886_c1_256170041524507145−7Haloferax sp.P21561(sr:aa 2.2,) (de:hypothetical 50.6 kd
protein in the 5′ region of gyra and
gyrb (orf 3)
36120791_c1_2574341700 5618205399−37 Escherichia coliP52049(de:hypotheiical 20.7 kd protein in
gshb-ansb intergenic region (o211))
21978293_c1_259435170062181726754−75PseudomonasL19649( fn:unknown) (sr:pseudomonas
aeruginosaaeruginosa (strain pao) dna)
(de:pseudomonas aeruginosa
< td/>aspartate transcarbamoylase (pyrb)
and dihydroorotase-like (pyrx) genes,
complete cds's.) (nt:dihydroorotase-
like
1305416_c1_26143617007519172104 −3Brassica napusU59446(sr:rape) (de:brassica napus
< td/>myrosinase-binding protein related
protein mrna, partial cds.)
< td/>(nt:divergently related to myrosinase
< td/>binding protein:)
32511656_c1_2644371413470589−57< /td>KlebsiellaContig332A GTC ORF with score 589 to:
pneumoniae(ai:7000758102)
(or: Pseudomonas aeruginosa)
12632162_c1_265170091068355192−15Enterobacter cloacaeCONTIG508GTC ORF with score 257 to:
(ai:7000758105)
(or:Pseudomonas aeruginosa)
20051938_c1_26717010768255257−22Enterobacter cloacaeCONTIG508GTC ORF with score 257 to:
(ai:7000758105)
(or:Pseudomonas aeruginosa)
31773288_c1_2691701114194721321−135Pseudomonas JQ0418(ec:1.5.1.2)
aeruginosa
31739416_c2_27544117012357< /td>118100−4Beta vulgarisS51939(sr:, beet) (ec:3.2.1.14)
14708291_c2_276442170131278425567∠’55Escherichia coliP25888(de:putative atp-dependent rna
helicase rhle)
33636705_c2_2804431 7014286295399−1U70657(sr:southeastern asian house mouse)
h ouse mouse(de:mus musculus castaneus sex
determining protein (sry) gene,
< td/>completccds.) (nt:hmg box
transcription factor)
15644805_c2_282444170151251416212−17KlebsiellaContig425A
pneumoniae(ai:7000819900)
(or: Enterobacter cloacae)
33985693_c2_284 44517016669222185Enterobacter cloacaeCONTIG358GTC ORF with score 185 to:
(ai:7000758122)
(or:Pseudomonas aeruginosa)
36033512_c2_292170171077358537−52Escherichia coliA65080
31722635_c2_2 93447170181101366−12Epstein-Barr virusM17294(sr:human herpesvirus 4 (clone: h6)
dna) (de:epstein-barr virus bamhi-h
(bhlfl) region encoding an orf, partialcds.)
(nt:orf; putative)
32438537_c2_294448417138137−8Canis familiarisS33121(cl:homeotic protein cdp:cut repeat
homology:homeobox homology)
(sr:, dog)
33864656_c2_29544917 0202949791−4< HIL>AspergillusContig5344GTC ORF with score 103 to:
fumigatus(ai:120246) (or:Mycoplasma
pneumoniae) (de:mycoplasma
pneumoniae section 38 of 63 of the
complete genome.) (nt:similar to
genbank accession number
138997 6, from)
16932306_c2_3004501 7021537178119−5X80272(de:p. putida pprb gene.)
16526583_c2_307451 1702225283113−6AF000298 (sr:caenorhabditis elegans strain=
elegansbristol n2) (de:caenorhabditis elegans
cosmid w03d2.) (nt:weak similarity
< td/>to collagens; glycine- and)
12626080_c2_31045217 023489162190−14S22697
36033262_c 2_31145317024465154 132−8Homo sapiensU89278(fn:interacts with the vertebrate
< td/>polycomb-group) (sr:human)
< td/>(de:human polyhomeotic 2 homology
(hph2) mrna, complete cds.)
36072933_c2_3154541 7025363120105−4Z78418(d e:caenorhabditis elegans cosmid
elegansf25d7, complete sequence.)
< td/>(nt:similar to claustrin
like: cdna cst ccmsh64f comes)
10001942_c2_319455 17026555184104−3S65954(sr:, man)
32282161_c2_32145617 02711073681048−106Escherichia coliP04425(ec:6.3.2.3) (de:synthetase) (gsh-s)
(gshase))
4877332_c2_326 4571702810953641666 −171PseudomonasQ59 653(ec:2.1.3.2) (de:transcarbamylase)
aeruginosa(atc ase))
9901031_c2_32745817 0298912961354−138 PseudomonasL19649(fn :unknown) (sr:pseudomonas
aeruginosaaeruginosa (strain pao) dna)
(de:pseudomonas aeruginosa
< td/>aspartate transcarbamoylase (pyrb)
and dihydroorotase-like (pyrx) genes,
complete cds's.)
(nt:dihydroorotase-like
15838330_ c2_329459170301434477499−48Klebsiella Contig332AGTC ORF with score 499 to:
pneumoniae(ai:7000758167)
(or: Pseudomonas aeruginosa)
35681966_c2_33417031309102173−13PseudomonasP24 564(de:hypothetical 19.5 kd protein in
aeruginosapilt region (orf6))
16225152_c2_33646117032654217896−90PseudomonasP24564(d e:hypothetical 19.5 kd protein in
aeruginosapilt region orf6))
30208402_c2_338462 1703310593521157−117PseudomonasP24562 (de:hypothctical 24.5 kd protein in
aeruginosapilt 5′ region orf5))
10641330_c2_339463 1703423176112−6P20186(de:hypothetical 35.5 kd protein in
transposon tn4556)
5350961_c3_341464 17035603200154−10 Aquifex acolicusA70373
14539205_ c3_34246517036900299602−58Aquifex acolicusD70424
32130160_ c3_343466170371140379
6520431_c3_34846717038 636211174−13< i>Drosophila erectaP13730(sr:, fruit fly) (de:salivary glue
protein sgs-3 precursor
31377258_c3_349468483160150−9Gallus gallusM25984(sr:gallus gallus dna) (de:chicken alpha-2
domesticuscollagen gene type i gene, exon 52.)
13958412_c3_35146917 040483160333−30Contig545AGT C ORF with score 333 to:
pneumoniae(ai:7000758189)
(or: Pseudomonas aeruginosa)
21976701_c3_35217041636211295−26Enterobacter cloacaeCONTIG438GTC ORF with score 295 to:
(ai:7000758190)
(or:Pseudomonas aeruginosa)
7129027_c3_35317042426141531 −51Enterobacter cloacaeCONTIG314GTC ORF with score 531 to:
(ai:7000758191)
(or:Pseudomonas aeruginosa)
14300901_c3_3561704347715897 −3KlebsiellaContig 171AGTC ORF with score 726 to:
pneumoniae(ai:7000824242)
(or: Enterobacter cloacae)
12926582_c3_357 4731704424681109−7KlebsiellaContig262 AGTC ORF with score 159 to:
pneumoniae(ai:7000804762)
(or: Pseudomonas aeruginosa)
16879378_c3_35917045462153757−75PseudomonasP52 003(de:multidrug resistance operon repressor)
< td>aeruginosa
14714657_c3 _36247517046435144−4PleuronectesU39735(de:pleuronectes americanus sperm
< td/>americanuschromatin protein hmrbnp-1 mrna,
< td/>partial cds.) (nt:hmrbnp-1;
crosslinks nucleosomes in
sperm)
5181258_c3_363476 1704714344771268−1 29Bacillus sphaericusP22805(cc:2.6.1.62) (de:aminotransferase))
12994457_c3_364 47717048447148117−7Homo sapiensS53363(sr:, man) (mp:11p15.5-11p15.5)
30730016_c3_365170491491496
47917050492 163110−4Dictyosteli umP14328(sr:, slime mold) (de:spore coat protein sp96)
di scoideum
34480056_c3_372480948315115−4Enterobacter cloacaeCONTIG456GTC ORF with score 239 to:
(ai:7000801274)
(or:Pseudomonas aeruginosa)
32595161_c3_3731705274192472100−1DrosophilaU65 431(sr:fruit fly) (de:drosophila
melanogaster melanogaster collagen type iv alpha
< td/>2 (dmcola2) mrna, complete cds.)
< td/>(nt:d.m. collagen typc iv
alpha 2)
6379806_c3_37448217053 2208735155−8< HIL>Enterobacter cloacaeCONTIG490GTC ORF with score 300 to:
(ai:7000797020)
(or:Pseudomonas aeruginosa)
12984382_c3_378170542001666367−33Escherichia coliU28377(de:escherichi a coli k-12 genome;
approximately 65 to 68 minutes.)
(nt:orf_o180;
< td/>was also orf_o62p before splice)
16540906_c3_37948417055624207316−28ThermusY09536(de:t. aquaticus pyrr, pyrb, bbc and pyrc genes.)
thermophilus/T. aqua
ticus/T. flavus
14722958_c3_38048 51705657018994⠈’2Gallus gallusS53710(cl:unassigned ribonucleoprotein
domesticusrepeat-c ontaining proteins:
ribonucleoprotein repeat
homology) (sr:, chicken)
17072781_c3_381486564187131−7KlebsiellaContig332AGTC ORF with score 499 to:
pneu moniae(ai:7000758167)
< td/>(or:Pseudomona s aeruginosa)
1406412_c3_386170581563520970−97Ralstonia eutrophaP13512(de:cation efflux system protein czcd)
10275958_c3_3904881 7059633210141−7JQ0405
11914682_f1 _548917060438145299−26HaemophilusP44886(de:hypothetical protein hi0827 precursor)
< td>influenzae
32245207_f1 _6490170611758585−65KlebsiellaContig536AGTC ORF with score 662 to:
pneu moniae(ai:7000758235)
< td/>(or:Pseudomona s aeruginosa)
32527036_f1_7491170622313770159 −10AzospirillumX70 360(de:a. brasilense carr gene.)
< HIL>brasilense
10260206_f1_949217063339112108 −5Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
16660216_f1_114931706 42181726282−24Enterobacter cloacaeCONTIG431GTC ORF with score 117 to:
(ai:99504) (or: Mus musculus
domesticus) cl: unassigned
< td/>hmg box proteins: hmg box
homology) (sr:, western european
house mouse) (mp:y)
12891340_f1_144941 706516385451424−146StreptomycesAL031184
coelicolor2a11.) (nt: sc2a11.03c, sdaa,
< td/>probable 1-serine
dehydratase.)
12611068_f1_16495170661116371621< /td>−60Pseudomonas AF012537(de:pseudomonas aeruginosa
< td/>aeruginosaacety l-coa synthetase gene, partial
cds, andarginine and ornithine
binding protein (aotj), membrane
protein (aotq), membrane protein
(aotm), aoto (aoto), atpase (aotp),
and argr (argr) genes,
complete cds.) (. . .
26760443_f1_2049617067< /td>1299432201−12< HIL>Micrococcus luteusJQ0405
13151387_f1 _2149717068420139−10equine herpesvirusD88685(sr:equ ine herpesvirus 1 (strain:hh1)
type 1 EVH-1dna) (de:equine herpesvirus 1
dna for tegument protein, partial cds.)
< td/>(nt:kpn i subfragment of orf24)
24658262_f1_224981 706991530497−2Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
30339792_f1_264991707 01620539
16284705_f1_285001707152817593< /td>−2PseudomonasS 29309
aeruginosa
12598790_f1_3150117072672223159 −10Burkholderia(sr:burkholderia cepacia
< td>cepaciastrain=17616) (de:burkholderia
cepacia d-serine deaminase
(dsd) gene, complete cds.)
< td/>(nt:unidentified orf)
33337705_f1_33502170 732100699
22906291_f1_35< /td>5031707410953642 94−26Pyrococcus(sr:pyrococcus horikoshii (str:ot3)
horikoshiidna) (de:pyrococcus horikoshii ot3
genomic dna, 777001-994000 nt.
position (4.7).) (nt:similar
to:Imrpongen percent ident: 37.019)
22010151_f1_36504 17075657218
34198533_f1_3 7505170762049682626−61ArchacoglobusG69306
fulgidus
32510 41_f1_38506170772100699< /td>632−62Bacillus P45866(de:hypothetical 79.2 kd protein in
subtilis/Bacillusacda 5′ region)
globigii
12125841_f1_3950717078816271183 −12PseudomonasZ54213(de:p. aeruginosa algy gene.)
< HIL>aeruginosa
31429758_f1_41< /td>5081707951617110 9−4Orf virusB34768
20391463_f1_ 455091708022875 218−18BacillusC69931
subtilis/Bacillus
globig ii
14583537_f1_4751017081411136144−10 Bos primigeniusA39762(cl:collagen alpha 1(xiv)
t auruschain:fibronectin type iii repeat
homology:von willebrand factor type
a repeat homology) (sr:, cattle)
31915916_f1_48511 1708229497
33712530_f1_49 512170838942971 14−3no gb taxonomyU52064(de:kaposi's sarcoma-associated
matchherpes-like virus orf73 homolog
gene, complete cds.)
< td/>(nt:herpesvirus saimiri
orf73 homolog)
16900817_f1_5851317084522173149−10Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
36062662_fI _6051417085924307−6StreptomycesZ46913(de:s. ambofaciens gene for
ambofacienshypothetica l polyketide gene.)
(nt:putative)
31505287_f1_6117086513170113−5CaenorhabditisA F000198(sr:caenorhabditis elegans strain=
elegansbristol n2) (de:caenorhabditis
elegans cosmid t28f2.)
(nt:similar to cuticular collagen)
13101592_f2_665161275424445−42KlebsiellaContig451A
pneumoniae(ai:7000758295)
(or: Pseudomonas aeruginosa)
3239706_f2_7151717088444147469Enterobacter cloacaeCONTIG459GTC ORF with score 173 to:
(ai:212910)
(or:Azospirillum brasilense)
(de:a. brasilense carr gene.)
(nt:orf2)
3019812_f2_7451 8170891662553
36 456955_f2_76519170901206 401133−8Pyrococcus< /HIL>AP000006(sr:pyrococcus horikoshii (str:ot3)
horikoshiidna, cl:pyrococcus horikoshi)
< td/>(de:pyrococcus horikoshii ot3
genomic dna, 1166001-1485000
< td/>nt. nosition (6/7).)
15085218_f2_89520 170911182393929−93Escherichia coliP33019(de:hypothetical 36.9 kd protein in lysp-nfo
intergenic region)
30199066_f2_93521 1709231771058561−53PseudomonasAF012537(de:pseudomonas aeruginosa acetyl-coa
< td/>aeruginosasynth etase gene, partial cds; and arginine and
ornithine binding protein (aotj), membrane
protein (aotq), membrane protein (aotm), aoto
(aoto), atpase (aotp), and argr (argr) genes,
complete cds.) (. . .
25522558_f2_9852217093< /td>1434477
7244203_f2_10252317094678225490−47Streptomyces AF072709(de:streptomyces lividans amplifiable
lividanselement aud4: putativetranscriptional
regulator, putative ferredoxin,
putative cytochrome450
oxidoreductase, and putative
oxidoreductase genes, completecds;
and unknown genes.)
(nt:orf1; hypothe . . .
16253843_f2_10552417095 411136114−5SaccharomycesP08640(sr:, baker's yeast) (cc:3.2.1.3)
cerevisiae(de:glucosida se) (1,4-alpha-d-glucan
glucohydrolase))
1195 7292_f2_1065251709633611 1110−5Sus scrofa domesticaI47141(sr:, domestic pig)
10819580_f2_10752617 0971062353335−29S35706
brock ii
22370206_f2_114527170982067688
341196 92_f2_11752817099540179< /td>
11886080_f2_11852917100576191
31659381_f2_121 530171011341446
30267558_f2_122531171021983660105−3Aspergil lusContig8591GTC ORF with score 247 to:
fumigatus(ai:405746)
(or:Mus sp.) (sr:mice macrophage)
(de:putative transcription regulator {
clone t2, repetitive sequence}(mice,
< td/>macrophage, mrna, 1263 nt).)
< td/>(nt:method: conceptual translation
supplied by author.)
16275091_f2_12453283127692−1 DictyosteliumAJ224893(fn:spore differentiation)
discoidcum(de: dictyostelium discoideum
< td/>srfa gene.
3160408_f3_13853317 10416415461221−124Escherichia coliP17447(de:high-affinity choline transport protein)
31535332_f3_1435341695564
15750841_f 3_144535171061146381125−4mice|C57BL/6xCBA/U767 16(sr:house mouse) (de:mus musculus
CaJ hybridvoltage-sensitive calcium channel
alpha 1 a (ccha1a) mrna, complete
cds.) (nt:ion channel)
25488876_f3_1465361518505
2353968_f3 _147537171081056351 143−9KlebsiellaContig523AGTC ORF with score 206 to:
pneumoniae(ai:7000796429)
(or: Pseudomonas aeruginosa)
36069557_f3_152171091623540
5391711072023996−2equine herpesvirusP28968(sr:ab4p,chv-1) (de:glycoprotein x
type 1 EVH-1precursor)
16611466_f3_1 5854017111972323197−16Klebsiella Contig441AGTC ORF with score 536 to:
pneumoniae(ai:7000835893)
(or: Enterobacter cloacae)
36066391_f3_174 54117112196865590KlebsiellaContig34 6AKlebsiella pneumoniae, GTC,
pneumoniaerel 1.0, 9812146
32620916_f1_175542171131002333200−14SchizosaccharomyceP78790(sr:, fission yeast) (de:(alpha-etf))
s pombe
11895750_f3_177543 17114609202
1666 1415_f3_1795441711556418 7120−5CaenorhahditisZ78418(de:caenorhabditis elegans
eleganscosmid f25d7, complete sequence.)
< td/>(nt:similar to claustrin
like; cdna est ccmsh64f comes)
29927206_f3_180545 17116705234133−6X87248(sr:human) (de:h. sapiens mrna for
hp8 protein.)
10808332_f3_182546462153101−3AlphaherpesvirusS04713 (cl:herpesvirus immediate-early
< td/>pseudorabies virusprotein ie175
PR V
22161577_f3_18654717118 1176391333−30 MycobacteriumZ92774(de:< HIL>mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
150/162.) (nt:rv3571,
(mtcy06g11.18), len: 358.
electron)
33704590_f3_18854 81711966322099⠈’2SaccharomycesP08640(sr:,baker's yeast) (ec:3.2.1.3)
cerevisiae(de:glucosida se) (1,4-alpha-d-glucan
glucohydrolase))
1512 1018_c1_2025491712061520 4166−12MycobacteriumZ95554(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
72/162.) (nt:rv1624c,
(mtcy01b2.16c), len: 195.
function:)
31808406_c1_2105 5017121540179112−4human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus
6 serotype b putative major
< td/>immediate-earlygenes.) (nt:similar to
hhv6a u86, region ic-b)
2853792_c1_21255117 1221908635
1286532_c1_214 552171236692221 05−6EnterococcusGTC ORF with score 107 to:
faccalis(ai:7500720569)
(or:Clostridium acetobutylicum)
6433286_c1_215553171242871956154 −8Aspergillus Contig8948GTC ORF with score 249 to:
fumigatus(ai:7000782680)
(or:< HIL>Pseudomonas aeruginosa)
15052158_c1_2161712517405791735−179Pseudomonas putidaP31048(de:hypothetical 54.3 kd protein in
lpd-3 5′ region (orf2))
10800331_c1_21955517126789262154−8 Myxococcus xanthusAF055904(de:myxoc occus xanthus
acetylornithine deacetylase (arge)
gene, complete cds; and unknown
gene.) (nt:orf2; no developmental
phenotype)
10260406_c1_222< /td>556171271320439
10272555_c1_2275571712838 4127111−5Chlam ydomonasS50755
reinhardtii strain
UTEX 1061
31423966_c1_229558< /td>17129927308123∠’7Enterobacter cloacaeCONTIG509GTC ORF with score 123 to:
(ai:7000758458)
(or:Pseudomonas aeruginosa)
31644791_c1_23017130525174103−6Treponema pallidumAE001200(de:trep onema pallidum section
16 of 87 of the complete genome.)
(nt:similar to gb:142023
sp:p44679 pid:1003656)
34511283_c1_232560171311038345240− 20KlebsiellaContig523AGTC ORF with score 518 to:
pneu moniae(ai:7000817451)
< td/>(or:Enterobact er cloacae)
12392028_c1_235 561171321053350122Brassica napusY08986(sr:rape) (de:b. napus gene
encoding oleosin-like protein.)
15113455_c1_2365621401466
14978466_ c1_237563171341659552605−59Klebsiella Contig496AGTC ORF with score 897 to:
pneumoniae(ai:7000824038)
(or: Enterobacter cloacae)
5097840_c1_2405 6417135852283598−58BacillusP46921(de:glycine betaine transport system
< HIL>subtilis/Bacilluspermease protein opuab)
< HIL>globigii
15744033_c1_24156517136118239371 6−71SalmonellaP17328(de:glycine betaine/l-proline transport atp-
choleraesuisbinding protein prov)
serotype
typhimurium
7120766_c1_246 566171376392125 56−54Escherichia coliP17446(de:regulatoty protein beti)
12695818_c1_2475671 713815335101978−204Escherichia coliS15181(cl:aldehyde dehydrogenase (nad+):
aldehyde dehydrogenase homology)
(cc:1.2.1.8) (mp:7.5 min)
11192931_c1_24856817 13917945972433−253Escherichia coliP17444(ec:1.1.99.1) (de:choline
dehydrogenase, (chd))
16930456_c2_259569 171401608535
25891068_c2_ 261570171411458485
16219592_c2_2625711714262
32672956_c2_26957 2171431209402101−4Chrysemys pictaA58208(cl:sperm histone) (sr:, painted turtle)
5212535_c2_275573 1714428293109−7CONTIG464GTC ORF with score 109 to:
(ai:7000758504)
or:Pseudomonas aeruginosa)
33839405_c2_281171452197299SchizophyllumAF00 5405(fn:oxidation of the 5′-hydroxymethyl
communeof) (de:schizophyllum commune
b2-aldehyde-forming enzyme mrna,
< td/>completecds.) (nt:secreted enzyme)
35789182_c2_28557517146699232103−3 AspergillusContig9071GTC ORF with score 103 to:
fumigatus(ai:7000758514)
or:Pseudomonas aeruginosa)
34269756_c2_28717147825274150−10Homo sapiensP52758(sr:, human) (de:homology))
32552318_c2_288577< /td>171481899632586⠈’57Myxococcus xanthusAF055904(de:myxoc occus xanthus
acetylornithine deacetylase (arge)
gene, complete cds; and unknown
gene.) (nt:arginine biosynthetic
protein; arge)
33720808_c2_2915781 71491353450
21735767_c2_2 92579171501890629−51Archaeoglobus< /td>E69400
fulgidus
31879205_ c2_29358017151663220137−7PlasmodiumP08675(sr:london,) (de:circumsporozoite
< td/>cynomolgiprotei n precursor (cs))
32511255_c2_2945811 7152549182123−8Contig9654G TC ORF with score 123 to:
fumigatus(ai:7000758523)
(or:< HIL>Pseudomonas aeruginosa)
24422677_c2_29517153921306684−67Escherichia coliP32484(de:hypothetical transcriptional
< td/>regulator in lysp-nfo intergenic region
261278_c2_29658317 154654217103−2Nephila clavipesAF027735(de:neph ila clavipes minor ampullate silk
protein misp1 mrna, partialcds.)
7125308_c2_30158417155435144142−10 ClostridiumContig192HGTC ORF with score 205 to:
acetobutylicum(ai:4500689127)
(or:Enterococcus faccalis)
10167793_c2_30358517156402133121Enterobacter cloacaeCONTIG456GTC ORF with score 146 to:
(ai:7000780481)
(or:Pseudomonas aeruginosa)
15838458_c2_305171571071356266−23Lyme diseaseH70117(sr:, lyme disease spirochete)
spirochete
10291040_c2_30658717158750112−4Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
35754766_c2_307588171 591428475317−28CONTIG459GTC ORF with score 533 to:
(ai:7501748159)
(or:Klebsiella pneumoniae)
29808406_c2_31117160570189107−4Homo sapiensA48018(sr:, man) (mp:4q13−4q21)
10675958_c2_31359 017161534177163 −11Borcogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
10726692_c2_315591171 62570189205−17KlebsiellaContig451AGTC ORF with score 205 to:
pneumoniae(ai:7000758544)
(or: Pseudomonas aeruginosa)
16800800_c3_318171632478825
593171641350449351−32Escheric hia coliP76253(ec:1.14.1.—)
(de: putative dioxygenase alpha
< td/>subunit yeaw,)
7161407_c3_3315941 71651245414
32110068_c3_3 3259517166456151112−5Homo sapiensX63071(sr:human) (de:h. sapiens mrna
for novel dna binding protein.)
34620793_c3_3335961779592120−4StreptomycesAL022268
coelicolor4h2.) (nt:sc4h2.20, probable
aminotransferase, len: 532);
31489441_c3_3385971 7168771256354−33Contig498AG TC ORF with score 354 to:
pneumoniae(ai:7000758567)
(or: Pseudomonas aeruginosa)
21961391_c3_33917169660219124−5Homo sapiensY13247(sr:human) (de:homo sapiens
fb19 mrna.)
13791681_c3_341599 171701458485683−67CyanobacteriumS77027(sr:pcc 6803, , pcc 6803) (sr:pcc 6803,)
< HIL>syncchocystis
14963275_c3_ 34360017171711236
7235411_c3_34860117172 852283116−7Ent erobacter cloacaeCONTIG436GTC ORF with score 325 to:
(ai:7501791606)
(or:Klebsiella pneumoniae)
36535831_c3_34917173957318225−19Lyme diseaseH70117(sr:, lyme disease spirochete)
spirochete
4007281_c3_35060317174957318317−42Mycobacterium Z81360(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
78/162.) (nt:rv1718, (mtcy04c12.03),
< td/>len: 272. similar to)
11812625_c3_351604171 75342113119−6 StreptomycesL20249(sr:streptomyces coriofaciens (library:
coriofaciensisp 5485) dna) (de:streptomyces
coriofaciens beta-ketoacyl synthase
homologue gene, partial cds.)
< td/>(nt:homologous to saccharopolyspora
< td/>erythraea< /HIL>)
36226643_c3_354605 171761245414430−40MycobacteriumZ80108 (de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
62/162.) (nt:rv 1400c,
(mtcy21b4.17c), len: 320.
possible)
22161711_c3_35560 617177423140192 −15KlebsiellaContig523 AGTC ORF with score 97 to:
pneumoniae(ai:73282)
(or: Dictyostelium discoideum)
(sr:,slime mold) (de:g-box binding
factor (gbf))
6738336_c3_3566071 7178606201109−3U85970(sr:african clawed frog) (de:xenopus
laevis middle molecular weight
neurofilament proteinnf-m(2) mrna,
< td/>complete cds.) (nt:neuronal
intermediate filament protein;
duplicated)
32667580_c3_358 60817179912303114−4equine herpesvirusD88685(sr:equ ine herpesvirus 1 (strain:hh1)
type 1 EVH-1dna) (de:equine herpesvirus 1 dna
for tegument protein, partial cds.)
< td/>(nt:kpn i subfragment of orf24)
11851430_c3_367609 17180219673196−1AL009198 (de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
144/162.) (nt:rv3370c, (mtv004.28c),
len: 1079, dnae, probable)
23628207_c3_3716101932643133−8PseudomonasU54795 (de:pseudomonas aeruginosa
< td/>aeruginosabetai ne semialdehyde dehydrogenase
(betb) gene, partial cds.)
32305191 _c3_37261117182333110231−19Klebsiella Contig536AGTC ORF with score 231 to:
pneumoniae(ai:7000758601)
(or: Pseudomonas aeruginosa)
32707256_c3_37517183543180184−14Enterobacter cloacaeCONTIG459GTC ORF with score 184 to:
(ai:7000758604)
(or:Pseudomonas aeruginosa)
11723913_c3_37617184396131222−19Enterobacter cloacaeCONTIG459GTC ORF with score 222 to:
(ai:7000758605)
(or:Pseudomonas aeruginosa)
26360957_c3_377171851341446592−59RickettsiaAJ2 35269Rickettsia prowazekii strain
prowazekiiMadrid E, complete genome.
32620840_f1_16151 71861278425239−20 KlebsiellaContig551A GTC ORF with score 239 to:
pneumoniae(ai:7000758607)
(or: Pseudomonas aeruginosa)
23521883_f1_5616171871728575111 −4KlebsiellaContig 549AGTC ORF with score 124 to:
pneumoniae(ai:405746) (or:Mus sp.)
(sr:mice macrophage) (de:putative
transcription regulator (clone t2,
repetitive sequence) (mice,
macrophage, mrna, 1263 nt).)
< td/>(nt:method: conceptual translation
supplied by author)
16491467_f1_86171 7188849282646−63AL021958 (de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
35/162.) (nt:rv0753c, (mtv04l.27c),
len: 510. mmsa, probable)
14534842_f1_961817189519172433−41Escherichia coliP25527(de:permease))
7225330_f1_16619345114118−7Rhizobium sp.P55533(sr:ngr234,) (de:hypothetical 9.2 kd protein
y4ko)
2551651_f1_1762017191366121672− 66EnterococcusCONTIG108< /td>GTC ORF with score 672 to:
faecalis(ai:7000758623)
(or:Pseudomonas aeruginosa)
35682333_f1_2017192573190145 −10mice|C57BL/6xCBA/D29149( cl:proline-rich protein) (sr:, house
Ca J hybridmouse)
33830291_f1_2262217193741246100 −2Dictyostelium(sr:,slime mold) (de:spore coat protein sp96)
< td/>discoideum
403543 6_f1_25623171941488495
33675913_f1_2762417195582193429−40 Escherichia coliP52077(de:elaa protein)
26052206_fI_30625171961584527121−3MolluscumL10127(sr: molluscum contagiosum virus
< td/>contagiosum virustype 1 dna) (de:molluscum
subtype 1contagiosum virus type 1 orf1
and orf2 dna.) (nt:orf17)
16276081_f1_42626702233
36134828_f 1_47627171981500499
13927281 _f1_6362817199486161136−9AchromobacterA61183
georgiopolitanum
32166380_f1_6562917200420103−5Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
35604030_f1_696301720 11356451151−7 Escherichia coliD90774(sr:escherichi a coli (strain:k12) dna,
clone_lib:kohara lambda minise)
(de:e. coli genomic dna, kohara
clone #263(30.5-30.9
min.).) (nt:orf_id:o263#22; similar to
(swissprot accession)
10817661_f1_866311677558225−18< /td>OrgyiaO10341(sr: ,opmnpv) (de:hypothetical 29.3 kd
pseudotsugataprotein (or(92))
multinuclcocapsid
nuclear polyhedrosis
virus OpMNPV
12620317_f1_876321 7203456151133−8I47141(sr:, domestic pig)
16925681_f1_92633172 04354117177−13Aquifex aeolicusF70386
16541290_ f1_9663417205456151 138−8human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus 6
serotype b putative major immediate-
< td/>earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
16995656_f1_9963517 2061626541161−8P21561(sr:aa 2.2,) (de:hypothetical 50.6 kd
protein in the 5′ region of gyra
and gyrb (orf 3))
34192942_f2_101636172 0714044671134−115 Escherichia coliP76641(de:hypothetical 50.2 kd protein in
kdui-lyss intergenic region)
11848405_f2_1046371720814074681862−192< /td>Pseudomonas putidaA42800(cl:beta-alanine--py ruvate
transaminase) (ec:2.6.1.18)
16510305_f2_10563817209900299160− 12KlebsiellaContig471AGTC ORF with score 160 to:
pneumoniae(ai:7000758711)
(or: Pseudomonas aeruginosa)
26567030_f2_1091721025584
14195840_f2_11064017211837775−77Escherichia coliP25527(de:permease))
31817028_f2_11164117212 468155209−17En terobacter cloacaeCONTIG482GTC ORF with score 209 to:
(ai:7000758717)
(or:Pseudomonas aeruginosa)
12971893_f2_112172131314437
64317214948315427−40Escherich ia coliP45691(de:hypothetical transcriptional
< td/>regulator in argr-cafa intergenic
< td/>region)
32673291_f2_12217215702233131 −7KlebsiellaContig 551AGTC ORF with score 131 to:
pneumoniae(ai:7000758728)
(or: Pseudomonas aeruginosa)
33869583_f2_12717216110436798−2NocardioidesZ93 338(de:a. simplex ksdi genes and three
simplexopen reading frames.) (nt:low
similarity to phytoene
dehydrogenase from)
10626887_f2_1336461 72171641546
25886265_f2_1 3564717218696231129−5Herpes simplex virusAF015297(de:human herpesvirus 6 (strain
(type 6/strainuganda-1102) ie2hom mrna, complete
Uganda-1102)cds.) (nt:similar to the
immediate-early 2 protein of human)
820833_f2_14164817 219435144101−3DictyosteliumP14328(sr: , slime mold) (de:spore coat
discoideumprotein sp96)
35425755_f2_1426491 722055818592−4KlebsiellaContig517AGTC ORF with score 333 to:
pneumoniae(ai:7000822737)
(or: Enterobacter cloacae)
25798577_f2_150 650172212367788314Enterobacter cloacaeCONTIG370GTC ORF with score 322 to:
(ai:7000807782)
(or:Pseudomonas aeruginosa)
32130263_f2_15217222837278199−14Microbacterium X79027(de:m. ammoniaphilum genes
< td/>ammoniaphilummamir and mamim.)
35603251_f2_15465217223387128133−9 Enterobacter cloacaeCONTIG222GTC ORF with score 133 to:
(ai:7000758760)
(or:Pseudomonas aeruginosa)
16144708_f2_15517224540179101−3Enterobacter cloacaeCONTIG370GTC ORF with score 322 to:
(ai:7000807782)
(or:Pseudomonas aeruginosa)
16300201_f2_156172251581526227−18Enterobacter cloacaeCONTIG435GTC ORF with score 227 to:
(ai:7000758762)
(or:Pseudomonas aeruginosa)
31339558_f2_15817226408135135−8Streptomyccs fradiaeP20186(de:hypothetical 35.5 kd protein in
transposon tn4556)
30760205_f2_16565617227630209117−5 infectious bovineP30022(sr:p8-2,) (de:tegument protein ul49
rhinotracheitis virushomolog)
35679 206_f2_17065717228621206 163−12Enterobacter cloacaeCONTIG436GTC ORF with score 163 to:
(ai:7000758776)
(or:Pseudomonas aeruginosa)
21738816_f2_171172292019672750−74Enterobacter cloacaeCONTIG436GTC ORF with score 750 to:
(ai:7000758777)
(or:Pseudomonas aeruginosa)
16897933_f2_17217230468155122−6Homo sapiensAB002322(sr:homo sapiens male brain edna
to mrna, clone_lib:pbluescriptii s)
(de:human mrna for kiaa0324
gene, partial cds.)
6142706_f2_17866017 23168852294295−21 Micrococcus luteusJQ0405
5101443_f2_ 17966117232738245−32Mycobacterium< /td>AL123456(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
2/162.) (nt:rv0018c, (mtcy 10h4. 18c),
< td>len: 514,highly similar to)
35336663_f2_180662172 3340213497−4m igratory locustAJ000390(sr:migratory locust) (de:locusta
migratoria mrna for nachr alpha1
subunit.)
35286580_f3_183 66317234618205154Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
24110042_f3_187664172 3545014997−2< HIL>CaenorhabditisZ75539(de:< HIL>caenorhabditis elegans cosmid
elegansf28c1, complete sequence.)
< td/>(nt:predicted using
< td/>genefinder, cdna est embl:c 13354)
17070467_f3_188665 17236861286708−71 MycobacteriumAL123456(de:mycobacterium tubcrculosis
tuberculosish37rv complete genome; segment
35/162.) (nt:rv0753c, (mtv041.27c),
len: 510. mmsa, probable)
12994215_f3_189666723240129−8KlebsiellaContig471A
pneumoniae(ai:7000758795)
(or: Pseudomonas aeruginosa)
2585191_f3_19017238564187112 −7KlebsiellaContig 493AGTC ORF with score 195 to:
pneumoniae(ai:7000812226)
(or: Pseudomonas aeruginosa)
12578816_f3_19117239468155176−14AcinetobacterC ONTIG220GTC ORF with score 176 to:
baumanniiC(ai:7000758797)
(or:Pseudomonas aeruginosa)
31744043_f3_193172401104367
67017241351116
488532_f3_198671 17242459152123−6U92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus
6 serotype b putative major
< td/>immediate-earlygenes.) (nt:similar to
hhv6a u86, region ic-b)
3367207_f3_19967217 2432316771194−12X69926(sr:malari a parasite)
(de:p. falciparum (b3)
hrpii gene, exon 2.)
29767706_f3_205673172 44543180126−6 AcanthamoebaP10569(sr:, amoeba) (de:myosin ic heavy
castellaniichain)
6850791_f3_20767417245714237121−6Enterobac ter cloacaeCONTIG430GTC ORF with score 485 to:
(ai:7000776269)
(or:Pseudomonas aeruginosa)
32707076_f3_21117246456151
676172471587528184−10Oryctolag usP16230(sr:, rabbit) (de:precursor (hep))
< HIL>cuniculus
22142255_f3_218< /td>677172481350449
3322665_f3_22167817249483 16091−3Enterob acter cloacaeCONTIG500GTC ORF with score 107 to:
(ai:7000723201)
(or:no gb taxonomy match)
(de:human papillomavirus type 80 e6,
e7, e1, e2, e4, 12, and 11 genes.)
(nt:putative)
31891467_f3_222679172501800599148< /td>−6equine herpesvirusAF030027(fn:very large tegument protein)
type 4 EHV-4(de:equine herpesvirus 4 strain
ns80567, complete genome.)
(nt:counterpart of hsv-1 gene u136
and vzv gene 22)
7301083_f3_2256801725 1672223106−4< HIL>Gallus gallusK02113(sr:chicken) (de:gallus gallus
domesticusv itellogenin gene coding for
phosvitin, exons 23 and 24.)
10438276_f3_23068117 252867288159−9Homo sapiensZ34277(sr:human) (de:h. sapiens (jer47)
muc5ac mrna for mucin (partial).)
24782151_f3_23268217253468155
31807905 _f3_233683172541308435
31910958_f3_237684172552955984603−58Enterobacter cloacaeCONTIG435GTC ORF with score 603 to:
(ai:7000758843)
(or:Pseudomonas aeruginosa)
11988266_f3_23817256444147146−9AcanthamoebaAF0 85185(de:acanthamoeba castellanii
castellaniimyo sin-ia (mia) gene, complete cds.)
< td/>(nt:myosin-i)
34258293_f3_239172572379792289−25Enterobacter cloacaeCONTIG435GTC ORF with score 289 to:
(ai:7000758845)
(or:Pseudomonas aeruginosa)
16277168_f3_2421725828895196 −16AcinetobacterCO NTIG187GTC ORF with score 196 to:
baumanniiC(ai:7000758848)
(or:Pseudomonas aeruginosa)
16037583_f3_246172591266421
689172601674557159−8Murine herpesvirusU97553(de:mur ine herpesvirus 68 strain
6 8wums, (complete genome.)
6925032_f3_249690172611350449337−30Vibrio choleraeAJ231091(de:vibr io cholerae z29f gene.)
12932183_f3_250691 172621131376641−63Enterobacter cloacaeCONTIG436GTC ORF with score 216 to:
(ai:4000713594) (or:Vibrio
alginolyticus) (sr:vibrio alginolyticus
(strain:vi05) dna) (de:vibrio
alginolyticus dna for poma, pomb,
< td/>complete cds.) (nt:essential for rotation
of the sodium-driven polar)
16898316_f3_251692 1726337561251250−17LegionellaY15044(d e:legionella pneumophila 22kb dna
pneumophilafragment from icm gene cluster.)
13026031_f3_254693453150196−14BacillusH69878
subtilis/Bacillus
globigii
5989705 _c1_269694172651605534155−10AspergillusContig8591GTC ORF with score 247 to:
fumigatus(ai:405746)
(or:Mus sp.) (sr:mice macrophage)
(de:putative transcription regulator
{clone t2, repetitive sequence}
(mice, macrophage, mrna, 1263 nt).)
< td/>(nt:method: conceptual translation
supplied by author.)
12757691_c1_271695747248147−7Homo sapiensAB002322(sr:homo sapiens male brain cdna to
mrna, clone_lib:pbluescriptii s)
(de:human mrna for kiaa0324 gene,
< td/>partial cds.)
2598556_c1_28169617 2672499832144−6AF015539(sr:blue mussel) (de:mytilus edulis
precollagen p (precol-p) mrna,
< td/>complete cds.)
11930441_c1_2836971 7268693230109−6S56117(sr:, longfin squid)
22744581_c1_284698 17269801266154−9P08677(de:circumsporozoite protein precursor (cs))
16902205_c1_2906991 7270411136127−8P13238(sr:, fruit fly) (de:sv23))
< td>melanogaster
14960033_ c1_29770017271621206169−12Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopcptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
16847555_c1_301701172 721206401
32286291_c1_307 702172731311436
21769081_c1_308703172748 37278567−55Esc herichia coliP17582(ec:4.2.1.1) (de:carbonic anhydrase,)
20174207_c1_309704172751392463569−5 7MycobacteriumAL123456(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
141/162.) (nt:rv3273, (mtcy71.13),
len: 764, membrane protein,)
480055_c1_314705172761359452138−8Glossus humanusAF008298(sr:gloss us humanus)
(de:glossus humanus
cytochrome c oxidase subunit i (coi)
< td/>gene, mitochondrial gene encoding
mitochondrial protein, partial cds.)
13776916_c1_3157061 72771086361190−25 ParacoccusP06030(ec: 1.9.3.1) (de:subunit 3)
denitrificans(oxidase aa(3) subunit 3))
24072842_c1_318707172 781179392150−8Aquifex acolicusD70314
33848780_ c1_32070817279504167275−24Pseudomonas P47206(sr:, pseudomonas perfectomarina)
< td/>stutzeri(d e:hypothetical protein in apt 3′
region (fragment))
2422650_c1_321709 17280642213240−20< /td>KlebsiellaContig302A GTC ORF with score 240 to:
pneumoniae(ai:7000758927)
(or: Pseudomonas aeruginosa)
2041301_c1_32317281780259275 −24RhizobiumS72165
leguminosarum bv.
viciae
32135155_c1_ 32471117282612203−26Rhizobium S72164
< td>leguminosarum bv.
viciae
36148506_c1_ 326712172831080359−7Enterobacter cloacaeCONTIG451GTC ORF with score 181 to:
(ai:7000767457)
(or:Pseudomonas aeruginosa)
35725168_c1_32917284369122112−6AchromobacterA6 1183
georgiopolitanum
14707183_c1 _34071417285477158−4infectious bovineZ78205(de:bovine herpesvirus type 1
< i>rhinotracheitis virusul22-35 genes.) (nt:homolog of
icp18.5 or hsv-1)
24417700_c1_341715 17286741246379−35 KlebsiellaContig471A GTC ORF with score 379 to:
pneumoniae(ai:7000758947)
(or: Pseudomonas aeruginosa)
35345287_c1_34417287543180142−9AchromobacterA6 1183
georgiopolitanum
31276666_c1 _34671717288336111−5Apergillus Contig9020GTC ORF with score 126 to:
fumigatus(ai:113583)
(or: Saccharomyces cerevisiae)
de:(yjr151c) (pn:hypothetical
118:similarity to mucin proteins,
yk1224c, sta1p) (gn:j2223)
< td/>(gtcfc:11.1) (ec:) (yj9p_yeast)
(keggfc:11.2) (sgdfc:9.1.0)
(db:gtc-saccharomyces cerevisiae))
5901055—c2_35171817289408135108−5Homo sapiensPN0099(sr:, man)
31892191_c2_35271917 29037741257376−33 Enterobacter cloacaeCONTIG436GTC ORF with score 376 to:
(ai:7000758958)
(or:Pseudomonas aeruginosa)
16849041_c2_3531729130541017219−17Enterobacter cloacaeCONTIG436GTC ORF with score 219 to:
(ai:7000758959)
(or:Pseudomonas aeruginosa)
5183326_c2_35517292816271229 −18silkwormS74439(sr:silkwo rm) (de:silk fibroin heavy
< td/>chain {3′ region} (bombyx
mori=silkworms, mrnapartial, 2008
nt).) (nt:this sequence comes
< td/>from FIG. 1c.)
33853968_c2_357 722172931074357471−45Enterobacter cloacaeCONTIG436GTC ORF with score 471 to:
(ai:7000758963)
(or:Pseudomonas aeruginosa)
16928291_c2_35817294681226120−6KlebsiellaConti g217AGTC ORF with score 282 to:
pneumoniae(ai:5500695239)
(or: Alcelaphine herpesvirus 1)
(sr:wildebeest herpesvirus)
(de:alcelaphine herpesvirus 1 1-dna,
complete sequence.) (nt:orf73; similar
to h, saimiri and kshv orf73)
13067628_c2_359724 172951485494343−31Enterobacter cloacaeCONTIG436GTC ORF with score 343 to:
(ai:7000758965)
(or:Pseudomonas aeruginosa)
36428877_c2_36117296882293385−36Enterobacter cloacaeCONTIG435GTC ORF with score 385 to:
(ai:7000758967)
(or:Pseudomonas aeruginosa)
14708511_c2_3621729723677881353−138Enterobacter cloacaeCONTIG435GTC ORF with score 1353 to:
(ai:7000758968)
(or:Pseudomonas aeruginosa)
34635030_c2_36517298315010491418< /td>−145Cyanobacterium S76431(cl:atp-dependent clp proteinase
< td>synechocystischain a) (sr:pcc 6803,, pcc 6803)
< td/>(sr:pcc 6803,) (ec:3.4.21.—)
32523556_c2_369728 172991056351166 −9Micrococcus luteusJQ0405
35367276_c2 _37272917300780259−3Homo sapiensM77663(sr:homo sapiens cdna to mrna)
< td/>(de:human keratin 10 mrna, 3′ end.)
14345125_c2_3737301 73011863620165−8Q07283(sr:, human) (de:trichohyalin)
35414756_c2_3747 311730270523493 −4SaccharomycesS66936< /td>(rnp:15r)
cerevisiae
3636 4466_c2_3757321730352217 3125−6Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
32696006_c2_377733173 041863620
26740706_c2_378 73417305939312
13025805_c2_3827351730644 1146106−6Trypa nosoma cruziU61533(de:trypanoso ma cruzi mucin-like
< td/>protein (muc.t0-1) gene, complete
cds.) (nt:mucin-like protein)
16535705_c2_387736732243323−29StreptomycesAL023517
coelicolor1b5.) (nt:sc1b5.14c, probable
transmembrane transport)
15730157_c2_388737 173081524507
31901080 _c2_3897381730912784251357−138RhizobiumQ08855(ec:1.9.3.1) (de:subunit 1))
leguminosarum
16300656_c2_392 739173102227326 7−23Atlantic codP55777(sr:,atlantic cod) (ec:1.9.3.1)
(de:cytochrome c oxidase
polypeptide iii,)
31457961_c2_3947401 731183427798−3Candida albicansCONTIG580GTC ORF with score 474 to:
5(ai:112109)
(or:Saccharomyces cerevisiae)
(de:(ygr112w) (pn:hypothetical
45:similarity to human surf-1 protein)
(gn:g6150) (gtcfc:13.7) (ec:)
< td/>(yg2x_yeast) (keggfc:11.2)
(sgdfc:13.0.0)
< td/>(db:gtc-saccharomyc es cerevisiae))
25494592_c2_397 74117312951316567−55CyanobacteriumS75620(sr:pcc 6803,, pcc 6803)
synechocystis(sr:pcc 6803,)
10011462_c2_403742 17313435144117−6U92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus
6 serotype b putative major
< td/>immediate-earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
23629535_c2_4047431 731414074681082−109SalmonellaP50334(d e:c4-dicarboxylate transport
choleraesuisprotein)
serotype
typhimurium< /td>
17004166_c2_40574417315735244198−16no gb taxonomyAF033674(de:pseudomonas marginalis pv.
matchalfalfae strain lmg2214 unknown
genes.) (nt:orf1)
13019580_c2_406745855284142−6AcanthamoebaAF085185
castellaniimyo sin-ia (mia) gene, complete cds.)
< td/>(nt:myosin-i)
12003958_c2_40717317474157263−23AspergillusS47 523
fumigatus
36525826_c2_41174717318333110617−60Enterococcus CONTIG108GTC ORF with scorc 617 to:
faccalis(ai:7000759017)
(or:Pseudomonas aeruginosa)
509831_c2_422748173191032343244 −21KlebsiellaConti g471AGTC ORF with score 277 to:
pneumoniae(ai:7000775135)
(or: Pseudomonas aeruginosa)
36352000_c2_42417320939312198−14AchromobacterP 52686(sr:atcc 19151,) (de:sds degradation
georgiopolitanumtranscri ptional activation protein)
4307292_c3_4357501732139913296−3P14328(s r:,slime mold) (de:spore coat
discoideumprotein sp96)
33869528_c3_4557511 7322432143131−8S50755
reinhardtii strain
UTEX 1061
35635216_c3_45675217 323600199301−27AF037441(de:edwa rdsiella ictaluri putative
18.8 kda protein (eip19) putative
17.8 kda protein (eip18), putative
54.5 kda protein (eip55), and
putative 19.5 kda protein (eip20)
genes, complete cds.) (nt:eip20;
< td/>possibly antigenic to catfish)
33673588_c3_45775315095021275−130 Edwardsiella ictaluriAF037441(de:edwa rdsiella ictaluri putative
18.8 kda protein (eip19) putative 17.8
kda protein (eip18), putative 54.5 kda
protein (eip55), and putative 19.5 kda
protein (eip20) genes, complete cds.)
< td/>(nt:eip55; antigenic to catfish)
16299016_c3_458754519172210−17Edwardsiella ictaluriAF037441(de:edwa rdsiella ictaluri putative
18.8 kda protein (eip19), putative
17.8 kda protein (eip18), putative
54.5 kda protein (eip55), and
putative 19.5 kda protein (eip20)
genes, complete cds.) (nt:eip18;
< td/>possibly antigenic to catfish)
16254058_c3_460755792263317−29Enterobacter cloacaeCONTIG435GTC ORF with score 317 to:
(ai:7000759066)
(or:Pseudomonas aeruginosa)
31923781_c3_46217327420139117−6equine herpesvirusP28968(sr:ab4p,ehv-1) (de:glycoprotein x
type 1 EVH-1precursor)
36031376_c3_4 6675717328966321116−4pig roundwormA44982(cl:unassigned collagens)
< td/>(sr:, pig roundworm)
10241705_c3_467758 17329435144122−8Homo sapiensI53641(sr:, man) (mp:11p15.5 p15.5)
11039212_c3_470173301953650827−82Escherichia coliAF044503(de:excheric hia coli strain ec11
unknown (498), hep gene,
< td/>complete cds; and rhsg accessory
genetic element vgrg protein, core
component and dsorf-g1 genes,
complete cds.)
5322840_c3_47676017 3312250749978−98AF044503(de:excheric hia coli strain ec11
unknown (498), hep gene,
< td/>complete cds; and rhsg accessory
genetic element vgrg protein, core
component and dsorf-g1 genes,
complete cds.)
26302292_c3_4777611 7332477158
32662627_c3_47 9762173331980659
16151056_c3_48776317334 609202120−5Can is familiarisA45195(cl:guan ylate cyclase catalytic
domain homology) (sr:, dog)
260811_c3_4887641733 51371456178−10blue musselAF029249(sr:blue mussel) (de:mytilus edulis
precollagen d (precol-d) mrna,
< td/>complete cds.)
35586567_c3_4917651 73361335444483−47 RickettsiaAJ235269Rickettsia prowazekii strain
prowazekiiMadrid E, complete genome.
4425963_c3_492766 17337360119202−15 NitrobacterX89566(ec :1.9.3.1) (de:n. winogradskyi dna
winogradskyifor coxa, coxb and coxc genes.)
10026907_c3_49476717338843280296−26Homo sapiensAF044321(fn:controls the formation (possibly
by catalyzing) (sr:human)
< td/>(de:homo sapiens cytochrome c
oxidase assembly protein cox11
< td/>(cox11) mrna, complete cds.)
< td/>(nt:yeast cox11 ortholog;
putative heme a)
22445443_c3_4957681733 9744247114−3< HIL>infectious bovineAJ004801(de:bovine herpesvirus
rhinotracheitis virus1 complete genome.)
14927156_c3_498769801266
25979568_c3 _499770173411188395 143−7equine herpesvirusD88685(sr:equ ine herpesvirus 1 (strain:hh1)
type 1 EVH-1dna) (de:equine herpesvirus 1 dna
for tegument protein, partial cds.)
< td/>(nt:kpn i subfragment of orf24)
10161292_c3_501771 17342771256223−18 PseudomonasP47206(sr :, pseudomonas perfectomarina)
< td/>stutzeri(d e:hypothetical protein in apt
3′ region (fragment))
35417643_c3_50377217343642213359−33 MycobacteriumZ95150 (de:mycobacterium tuberculosis h37rv
< td/>tuberculosiscomplete genome: segment 135/162.)
(nt:rv3095, (mtcy164.06), len: 158.
possible)
29806932_c3_51277 317344864287162 −10StreptomycesP32425< /td>(de:hypothetical transcriptional
< td/>ambofaciensregulator in instable dna locus (orf 1))
16978201_c3_515774173 459123031137−115CONTIG108 GTC ORF with score 1137 to:
faecalis(ai:7000759121)
(or:Pseudomonas aeruginosa)
1307282_c3_51917346819272
15823567_c3_52077617347354419−39Escherichia coliP16680(de:phna protein)
16301416_c3_522777570189361−33KlebiellaContig329AGTC ORF with score 361 to:
pneumoniae(ai:7000759128)
(or: Pseudomonas aeruginosa)
16876327_c3_52417349786261265−23KlebsiellaCont ig471AGTC ORF with score 265 to:
pneumoniae(ai:7000759130)
(or: Pseudomonas aeruginosa)
12900157_c3_526173501317438217−15KlebsiellaCon tig536AGTC ORF with score 238 to:
pneumoniae(ai:7000809741)
(or: Pseudomonas aeruginosa)
26835406_c3_52917351855284267−23KlebsiellaCont ig551AGTC ORF with score 274 to:
pneumoniae(ai:7000809943)
(or: Pseudomonas aeruginosa)
32674193_c3_53017352762253157−12KlebsiellaCont ig551AGTC ORF with score 180 to:
pneumoniae(ai:7000810058)
(or: Pseudomonas aeruginosa)
16616588_f1_17821735320166
2 4323592_f1_2783173542257 4
10003966_f1_1278417355< /td>22597521264−129KlebsiellaContig402AGTC ORF with score 1331 to:
pneumoniae(ai:7000838221)
(or: Enterobacter cloacae)
32680433_f1_197 8517356498165283−25Escherichia coliP31055(ec:4.1.2.25) (de:probable dihydroneopterin
aldolase, (dhna))
7206393_f1_247861 73571491496757−75 Enterobacter cloacaeCONTIG387GTC ORF with score 911 to:
(ai:7501737381)
(or:Klebsiella pneumoniae)
24652167_f1_2517358747248249 −21KlebsiellaConti g500AGTC ORF with score 249 to:
pneumoniae(ai:7000759161)
(or: Pseudomonas aeruginosa)
15822151_f1_261735916175381006−102KlebsiellaCo ntig500AGTC ORF with score 1006 to:
pneumoniae(ai:7000759162) (or:Pseudomonas aeruginosa)
31453283_f1_2717360606201190 −15Enterobacter cloacaeCONTIG287GTC ORF with score 514 to:
(ai:7501763901)
(or:Klebsiella pneumoniae)
14175961_f1_28173611170389265−23KlebsiellaCont ig426AGTC ORF with score 423 to:
pneumoniae(ai:7000825490)
(or: Enterobacter cloacae)
6523961_f1_3179 1173621050349328−30Enterobacter cloacaeCONTIG421GTC ORF with score 328 to:
(ai:7000759167)
(or:Pseudomonas aeruginosa)
25822527_f1_36173631644547
793173642508 835367−32Ralstonia< /i>S41544
solanacearum
12679811_f1_5179417365678225427−40Ralstoni aI40540(cl:response regulator homology)
solanacearum
34164162_f 1_5479517366750249−37ClostridiumContig075HGTC ORF with score 397 to:
acetobutylicum(ai:7000759190)
(or:Pseudomonas aeruginosa)
33489562_f1_6117367672223107 −6Enterobacter cloacaeCONTIG408GTC ORF with score 169 to:
(ai:7000804017)
(or:Pseudomonas aeruginosa)
3335336_f2_7079717368295598499AchromobacterL811 25(sr:pseudomonas sp (strain imt37)
< HIL>georgiopolitanumdna) (de:pseudomonas sp. (strain
imt37) monooxygenase subunit
gene, completecds.)
16033543_f2_7279817369660219609−6 0Enterobacter cloacaeCONTIG490GTC ORF with score 609 to:
(ai:7000759208)
(or:Pseudomonas aeruginosa)
20961456_f2_7317370786261106 −3Schizosaccharomyces(sr:fission yeast) (de:s. pombe
< td/>pombechromosome ii cosmid c3d6.)
(nt:spbc3d6.14c, unknown;
partial; serine rich,)
12128332_f1_758001 73711533510235−19 Enterobacter cloacaeCONTIG490GTC ORF with score 436 to:
(ai:7501727989)
(or:Klebsiella pneumoniae)
7322675_f2_788011737212064011248−127Escherichia coliF65094(cl:o-sialogly coprotein
endopeptidase) (ec:3.4.24.57)
(mp:67 min)
7161628_f2_798021737 3522173107−6< HIL>KlebsiellaContig362AGTC ORF with score 348 to:
pneumoniae(ai:7000838282)
(or: Enterobacter cloacae)
1182256_f2_8080 3173741014337120−7KlebsiellaContig362 AKlebsiella pneumoniae, GTC, rel
pneumoniae1.0, 9812146
16807333_f2_96804 173752889590−5KlebsiellaContig426AGTC ORF with score 90 to:
pneumoniae(ai:7000759232)
(or: Pseudomonas aeruginosa)
14894375_f2_10217376336111
80617377777228−19Streptomyce s griseusP08075(cc:2.7.7.24) (de:enzyme))
16900040_f2_116807173781212403
407187 _f2_117808173791197398713−70ArchaeoglobusG69450(cl:atp-binding cassette homology)
fulgidus
25660031_f2_11 9809173802367788
35802013_f2_12281017381 933310957−96Ps eudomonas putidaP42509(de:hypothetical protein in type 5′
region (fragment))
11800081_f2_1238111738215065012524− 262PseudomonasP20580(ec:4.1.3.27) (de:anthranilate
aeruginosa synthase component i,)
2458252_f3_1308121738 321069
34650842_f3_15181317384318105186< /td>−15KlebsiellaC ontig362AGTC ORF with score 186 to:
pneumoniae(ai:7000759287)
(or: Pseudomonas aeruginosa)
22549066_f3_15217385891296214−17Aquifex acolicusC70315
31535125_ f3_157815173861527508337−31Klebsiella Contig500AGTC ORF with score 337 to:
pneumoniae(ai:7000759293)
(or: Pseudomonas aeruginosa)
16148526_f3_16117387507168142−10KlebsiellaCont ig533AGTC ORF with score 142 to:
pneumoniae(ai:7000759297)
(or: Pseudomonas aeruginosa)
31892931_f3_163173881575524232−19KlebsiellaCon tig426AGTC ORF with score 321 to:
pneumoniae(ai:7000825513)
(or: Enterobacter cloacae)
16289583_f3_165 818173891296431202KlebsiellaContig 426AGTC ORF with score 303 to:
pneumoniae(ai:7000825478)
(or: Enterobacter cloacae)
25594452_f3_172 819173901017338250Chlorobium tepidumU58313(de:chlorob ium tepidum 7.5 kda
chlorosome protein (csmb) gene,
< td/>completecds.) (nt:orft)
12925930_f3_1738201284427217−17< /td>Coxiella burnetiiQ45885(de:dnaj-like protein djla (mucoidy
activation protein mucz))
14664193_f3_181821 173921047348410−38AgrobacteriumU60011 (de:agrobacterium tumefaciens
tumefaciens (TIplasmid pti15955 occr (occr) gene,
PL ASMIDpartial cds; mcl pseudogene,
PTIBO542)complete sequence; trar-like
regulator (trlr), motd (motd), motc
(motc), motb (motb), and mota
(mota) genes, complete cds; and
unknown genes,) (nt . . .
14927093_f3_18482217393 1131376446−42 AgrobacteriumU60011(de:< HIL>agrobacterium tumefaciens
tumefaciens (TIplasmid pti15955 occr (occr) gene,
PL ASMIDpartial cds; mel pseudogene,
PTIBO542)complete sequence; trar-like regulator
(trlr), motd (motd), motc (motc),
motb (motb), and mota (mota) genes,
complete cds; and unknown
genes,) (nt . . .
25494405_f3_18582317394 1362453503−48 AgrobacteriumU60011(de:< HIL>agrobacterium tumefaciens
tumefaciens (TIplasmid pti15955 occr (occr) gene,
PL ASMIDpartial cds; mel pseudogene,
PTIBO542)complete sequence; trar-like regulator
(trlr), motd (motd), motc (motc),
motb (motb), and mota (mota) genes,
complete cds; and unknown
genes,) (nt . . .
31644842_f3_18682417395 936311489−47< HIL>AgrobacteriumU60011(de:agrobacterium tumefaciens
tumefaciens (TIplasmid pti15955 occr (occr) gene,
PL ASMIDpartial cds; mel pseudogene,
PTIBO542)complete sequence; trar-like regulator
(trlr), motd (motd), motc (motc),
motb (motb), and mota (mota) genes,
complete cds; and unknown
genes,) (nt . . .
34375780_f3_18782517396 693230867−87< HIL>Escherichia coliP32661(ec:5.1.3.1) (de:epimerase) (ppe)
< td/>(r5p3e))
14103766_c1_20482 61739727992129⠈’9ClostridiumContig146H GTC ORF with score 129 to:
acetobutylicum(ai:7000759340)
(or:Pseudomonas aeruginosa)
13141382_c1_21917398597198135−7Homo sapiensPN0099(sr:, man)
36148901_c1_22082817 3992577858437−40Contig075H GTC ORF with score 437 to:
acetobutylicum(ai:7000759356)
(or:Pseudomonas aeruginosa)
32695391_c1_22517400639212104−3MycobacteriumP9 6942(ec:3.4.24.—) (de:cell division
tuberculosisprotein ftsh homolog,)
33864213_c1_23283017255741551−15 9Escherichia coliAB011549(sr:escheric hia coli (str:o157:h7,
sub_str:rimd 0509952)
(de:escherichia coli plasmid po157
< td/>dna, complete sequence.) (nt:putative
reverse transcriptase; similar to)
36134583_c1_233831174 022907968503−89P31554(de:organic solvent tolerance protein
precursor)
26272892_c1_23417403702233106 −3KlebisellaContig 501AGTC ORF with score 1258 to:
pneumoniae(ai:1500687053)
(or: Escherichia coli)
< td/>(ec:2.7.4.7) (de:(hmp-p kinase))
30205441_c1_23583314164711002−101 Escherichia coliP19624(de:pyridoxal phosphase biosynthetic
protein pdxa)
13145656_c1_2368341 7405906301130−5L12347(sr:homo sapiens (library: atcc 1136,
< td/>stratagene) (female embryonal cdn)
(de:homo sapiens collagen chain
< td/>mrna, 3′ end.
21735288_c1_23783517 4061260419692−68P05637(ec:3.6.1.41) (de:(diadenosine
tetraphosphatas e))
12972001_c1_2398361740721697222647⠈’275Escherichia coliP77391(de:hypothetical 74.5 kd protein in
gapa-rnd intergenic region)
26345182_c1_24183717408918305113−6 longfin squidS56117(sr:, longfin squid)
2132953_c1 _242838174091821606 1880−194Escherichia coliP29013(de:hypothetical 60.8 kd protein in
fadr-dada
intergenic region)
6453580_c1_243839 17410891296121−4U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
10969143_c1_256840174 1118606193084−9999PseudomonasP26480(d e:rna polymerase sigma factor rpod
aeruginosa(sigma-70))
12307191_c1_2578411741214 94497
12541430_c1_258842< /td>174132257495−4 Homo sapiensAB014562(sr:homo sapiens adult male brain
< td/>cdna to mrna, clone_:pbluescripti)
< td/>(de:homo sapiens mrna for kiaa0662
protein, partial cds.)
32553806_c1_2598431 7414420139130−7P08640(sr :, baker's yeast) (ec:3.2.1.3)
cerevisiae(de:glucosida se)
(1,4-alpha-d-glucan glucohydrolase))
32245768_c2_27284 417415480159264 −23KlebsiellaContig472 AGTC ORF with score 264 to:
pneumoniae(ai:7000759408)
(or: Pseudomonas aeruginosa)
16015816_c2_27417416229876596−2mice|C57BL/6xCBA/U46463( sr:house mouse) (de:mus musculus
CaJ hybridglutamine repeat protein-1 mrna,
< td/>complete cds.) (nt:grp-1)
12995463_c2_275846 174171266421
20025688 _c2_277847174181215404
12632331_c2_27984817419867288392−37ClostridiumContig075HGTC ORF with score 392 to:
acetobutylicum(ai:7000759415)
(or:Pseudomonas aeruginosa)
31409752_c2_284174201083360111−3DrosophilaK026 21(sr:d. melanogaster dna, clone tm17)
melanogaster(de:d. melanogaster tropomyosin gene
isoform 33 (9c), exon 10c.)
< td/>(nt:tropomyosin isoform 33 (9c))
16928902_c2_2978501 7421411136124−7U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
23204058_c2_298851174 221344447831−83P21202(ec:5.2.1.8) (de:sura), (ppiase)
(rotamase c))
2474137_c2_3028521742 3873290670−66 Escherichia coliP06992(ec:2.1.1.—)
(de:d imethyltransferase))
15048505_c2_30317424417138352 −32Escherichia coliP05636(de:apag protein)
15666633_c2_306854432143273−24Escherichia coliP09390(de:glpe protein)
31757091_c2_309855429142151−10LitomosoidesU54556 (de:litomosoides sigmodontis
sigmodontismic rofilarial sheath proteins shp3a
< td/>(shp3a) and shp3 (shp3) genes,
complete cds.) (nt:structural protein;
similar to shp3 genes from)
4304193_c2_31185617 42712814261318−134Escherichia coliP76235(de:hypothetical 49.4 kd protein
in gapa-rnd intergenic region)
12363180_c2_31385717428618205
21661006_c2_ 315858174291662553−5StreptomycesAL031124(de:streptomyces coelicolor cosmid
coelicolor1c2.) (nt:sc1c2.25c, unknown,:
1329 aa; contains two)
32525341_c2_31685917 430852283114−3OryctolagusD14157(sr:oryctolagus cuniculus edna
cuniculusto mrna) (de:rabbit mrna for rabbit
brain calcium channel biii,
< td/>complete cds.) (nt:alpha-1 subunit)
31928892_c2_317860471156122−6Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-related
< td/>nuclear matrix protein (srm160)
mrna, complete cds.) (nt:160 kda)
3334408_c2_318861174 32408135343−31Pseudomonas putidaAF014397(de:pseudo monas putida
macromolecular synthesis operon:
30s subunitribosomal protein s21
(rpsu), dna primase (dnag), and
sigma-70 (rpod) genes,
complete cds.)
32282155_c2_3198621 7433519172346−31U63641(fn:u nknown) (de:legionella
pneumophilia pneumophila rpod operon lporfx,
lpdnag, and lprpodgenes,
complete cds.)
12704131_c2_3208631 74341788595205−13 Bos primigeniusP23206(sr:, bovine) (de:collagen 1(x)
tauruschain precursor)
35822906_c2_324864 174352970989308−23 Bordetella pertussisP16575(ec:2.7.3.—) (de:virulence sensor
protein bvgs precursor,)
21734502_c2_32586517436101433795−2< /td>AspergillusContig8669GTC ORF with score 124 to:
fumigatus(ai:5500701392)
(or:< HIL>Nephila clavipes) (de:nephila
clavipes minor ampullate silk
protein misp1 mrna, partialcds.
10178250_c2_3318661743726186
24495341_ c3_333867174388552841310−133PseudomonasQ06553(de:transcription rcgulatory protein
aeruginosaprtr (pyosin repressor protein))
22461557_c3_334868375124515−49PseudomonasQ06552 (de:transcription regulatory protein
aeruginosaprtn (pyocin activator protein))
31894767_c3_3368692142713149−10< /td>KlebsiellaContig472A GTC ORF with score 149 to:
pneumoniae(ai:7000759472)
(or: Pseudomonas aeruginosa)
1308530_c3_33717441360119131 −9Enterobacter cloacaeCONTIG487GTC ORF with score 131 to:
(ai:7000759473)
(or:Pseudomonas aeruginosa)
7115706_c3_33817442417138125 −8Enterobacter cloacaeCONTIG399GTC ORF with score 115 to:
(ai:58918) (or:Saccharomyces
< td/>cerevisiae) (mp: 14r)
2000457_c3_344872174 43786261305−27KlebsiellaContig545AGTC ORF with score 598 to:
pneumoniae(ai:7000845728)
(or: Enterobacter cloacae)
30367781_c3_349 87317444417138116PseudomonasS29309< /td>
aeruginosa
26821906_c3_3581744518962
29782330_c3_36787517446279100−6AspergillusContig10074GTC ORF with score 200 to:
fumigatus(ai:58918)
(or:< i>Saccharomyces cerevisiae)
(mp: 14r)
16894580_c3_37487617 447822273115−4equine herpesvirusD88685(sr:equ ine herpesvirus 1 (strain:hh1)
type 1 EVH-1dna) (de:equine herpesvirus 1 dna
for tegument protein, partial cds.)
< td/>(nt:kpn i subfragment of orf24)
9866583_c3_3768771 7448477158133−8U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
35253277_c3_378878174 49807268114−3 Canadian hardS18733(cl:glutenin) (sr:, common wheat)
w inter wheat
6360418_c3_38087917 450501166168−12U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
10824040_c3_381880174 51492163117−6 SchizosaccharomycesZ95620(sr:fission yeast) (de:s. pombe
< td/>pombechromosome ii cosmid c3d6.)
(nt:spbc3d6.14c, unknown;
partial: serine rich,)
2400181_c3_3828811 74521374457107−3P20186(de:hypothetical 35.5 kd protein in
transposon tn4556)
10291433_c3_3848821745313654541253−127< /td>Escherichia coliP06961(ec:2.7.7.25) (de:(trna cca-
pyrophosphorylase))
11956437_c3_387< /td>8831745466622136 6−33Escherichia coliP31056(de:hypothetical 22.2 kd protein in
baca-ttda intergenic region (o205))
35258326_c3_388884174551059352339−31Enterobacter cloacaeCONTIG490GTC ORF with score 339 to:
(ai:7000759524)
(or:Pseudomonas aeruginosa)
10286467_c3_3891745620976982442−253Pseudomonas putidaAF014397(de:pseudo monas putida
macromolecular synthesis operon:
30s subunitribosomal protein s21
(rpsu), dna primase (dnag), and
sigma-70 (rpod) genes,
complete cds.)
17082636_c3_3978861 745724728231346−137CyanobacteriumS74707
synechocystis(sr:pcc 6803, pcc 6803)
< td/>(sr:pcc 6803,)
33867338_c3_399887 1745816985651532−157Escherichia coliAB011549(sr:escheric hia coli (str:o157:h7,
sub_str:rimd 0509952)
(de:escherichia coli plasmid po157
< td/>dna, complete sequence.)
< td/>(nt:putative reverse transcriptase:
similar to)
16588533_f1_188817459 1425474148−7< HIL>Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-related
< td/>nuclear matrix protein (srm160)
mrna, complete cds.) (nt: 160 kda)
16541303_f1_58891746 01944647585−57BacillusZ93940(de: b. subtilis genomic dna fragment
subtilis/Bacillusfrom yuca to yuch.) (nt:putative)
globigii
12245687_f 1_689017461819272−7Caenorhabditis< /td>Q09456(de:putative cuticle collagen c09g5.5)
elegans
31657277_f1_7891 174622001666638−62BacillusC69830
subtilis< /HIL>/Bacillus
globigii
12157700_f1_889217463125 7418127−4Gallu s gallusA90458(cl:collagen alpha 1(i) chain:fibrillar
domesticuscollagen carboxyl-terminal
homology:von willebrand factor
type c repeat homology)
(sr:, chicken)
12291263_f1_9893 17464678225131−7A60533(sr:, man) (mp:1q21-q24)
14583507_f1_1089417465468155146−9 Homo sapiensO00268(sr:, human) (de:(tafii135) (tafii-130)
32547642_f1_13895 17466363120126−7Homo sapiensAB011108(sr:homo sapiens male brain cdna
to mrna, clone_lib:pbluescriptii s)
(de:homo sapiens mrna for kiaa0536
protein, partial cds.)
13022906_f1_1589617 4671470489108−3I72525(sr:, man)
32134817_f1_18897174 681524507288−25CONTIG447GTC ORF with score 1123 to:
(ai:7501774336)
(or:Klebsiella pneumoniae)
16876080_f1_20174691788595887−89Escherichia coliP39455(de:hypothetical 46.6 kd protein in
phep-nfnb intergenic region)
35828950_f1_22899 17470639212234−20 AcinetobacterCONTIG180
baumanniiC(ai:7000694271)
( or:Bacillus subtilis)
31532091_f1_26 90017471435144175PseudomonasAF0548 68(de:pseudomonas aeruginosa
< td/>aeruginosaautoi nducer synthetase (rhli) gene,
< td/>partialcds; cyclohexadienyl
< td/>dehydratase (phec), hypothetical 0299
protein (yigm),
chloramphenicol-sensitive protein
(rard), and hypotheticalprotein (yafl)
genes, complete . . .
15128326_f1_2890117472< /td>1140379171−9no gb taxonomyU52064(de:kaposi's sarcoma-associated
matchherpes-like virus orf73 homolog
gene, complete cds.)
< td/>(nt:herpesvirus saimiri orf73
< td/>homolog)
22087791_f1_299021747320166
33726083_f1_3490317474471156189−15Enteroba cter cloacaeCONTIG147GTC ORF with score 189 to:
(ai:7000759570)
(or:Pseudomonas aeruginosa)
32681533_f1_39174751254417419−39KlebsiellaCont ig501AGTC ORF with score 820 to:
pneumoniae(ai:7000821374)
(or: Enterobacter cloacae)
24472842_f1_419 0517476447148144−9SaccharomycesP32323 (sr:, baker's yeast) (de:a-agglutinin
cerevisiaeattachmen t subunit precursor)
34267082_f1_43906465154141−10Pyrus communisU14009(sr:pear) (de:pyrus communis
arabinogalactan-protein (agp) mrna,
< td/>complete cds.)
2992951_f1_44907174 78714237102−2 Streptomyces griseusP54742(ec:2.7.1.—) (de:serine/threonine
< td/>protein kinase afsk,)
32520431_f1_459081 7479903300
11745802_f1_47 909174807652545 81−56Escherichia coliF64919
14300833_f1_4 891017481486161 118−6Apspergillus Contig8154GTC ORF with score 433 to:
fumigatus(ai:177837)
(or: Zea mays) (sr:, maize)
25886567_f1_539111 7482870289123−7Contig1817G TC ORF with score 143 to:
fumigatus(gn:wsp1+) (fn:actin patch assembly
and localization) (sr:fission yeast)
(de:schizosaccharomycepombe
wis kott-aldrich syndrome protein
homolog (wsp1+) gene, complete
cds, and btf3/beta-nac gene,
< td/>partialsequence.) (nt:wasp
32605166_f1_54912174831080359409−38Escherichia coliA64920
11727182_f1_5 591317484714237 258−22Rhodobacter S39906
< td>capsulatus
16067806_f1 _57914174851308435
30330168_f1_589151748631510490−3her pes simplex virusZ86099(fn:internal protein of immature
type 2 HSV-2capsids) (de:herpes simplex virus
< td/>type 2 (strain hg52),
complete genome.)
35799156_f1_599161748755818597−4AL123456 (de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
120/162.) (nt:rv2703, (mtcy05a6.24),
len: 528. function: siga.)
6535287_f1_7191717 488420139387−36Contig516AGT C ORF with score 612 to:
pneumoniae(ai:7000845890)
(or: Enterobacter cloacae)
29791287_f1_739 1817489417138415−39Enterobacter cloacaeCONTIG210GTC ORF with score 415 to:
(ai:7000759609)
(or:Pseudomonas aeruginosa)
2227155_f1_80919174902391796
32557837_f1_819201749111971644−169Pseudomon asP72170(ec:3.5.2.3) (de:dihydroorotase,
aeruginosa(dhoas e))
16204833_f1_849211749 2894297384−36 KlebsiellaContig546AGTC ORF with score 716 to:
pneumoniae(ai:7000835662)
(or: Enterobacter cloacae)
25402126_f1_869 2217493549182642−63AzotobacterP22759< /td>(de:bacterioferritin (bfr)
vinelandii(cytochrome b-557.5))
31379791_f1_919232433810
7150708_f1 _9692417495876291−20Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
_conserved spacers r or)
33647337_f1_101925174 96939312117−4 Canis familiarisS33121(cl:homeotic protein cdp:cut repeat
homology:homeobox homology)
(sr:, dog)
12972212_f1_10292617 49755218396−5 KlebsiellaContig547AGTC ORF with score 117 to:
pneumoniae(ai:7000786349)
(or: Pseudomonas aeruginosa)
5103457_f1_10417498783260878 −88Escherichia coliP11288(de:hypothctical 29.6 kd protein in
thrc-talb intergenic region)
30704517_f1_10892817499126041990−1 MycobacteriumQ50589( de:hypothetical protein cy19g5.10
tuberculosis(fragment))
25556931_f1_1119291750012634201397−143PseudomonasL22611(fn:relat ed to alginate biosynthesis.)
aeruginosa(sr:pseudomonas aeruginosa (strain
8830) dna) (de:pseudomonas
aeruginosa alginate synthesis
related protein (alg44)gene, complete
cds, algd and alg8 genes, 3′ end
and 5′ end.)
25881706_f1_1129301 7501456315202808−292PseudomonasU27829 (de:pseudomonas aeruginosa
< td/>aeruginosaalgin ate-c5-mannuronan-epimerase
(algg), algx (algx) and
alginate lyase (algl) genes,
complete cds.)
32300818_f1_1139311 750214374782476−257PseudomonasU27829( de:pseudomonas aeruginosa
< td/>aeruginosaalgin ate-c5- mannuronan-epimerase
< td/>(algg), algx (algx) and
alginate lyase (algl) genes,
complete cds.)
14650332_f1_1149321 750311073681915−198PseudomonasA40610( ec:4.2.2.3)
aeruginosa
32307942_f 1_1169331750427669212028−210PseudomonasU50202(fn:required for alginate
aeruginosao-acctylation) (sr:pseudomonas
aeruginosa strain=frd1)
(de:pseudomonas aeruginosa
< td/>alginate gene cluster algi (algi),
algj (algj) and algf (algf)
genes. complete cds.)
24102192_f1_1179341 75056752241113−113PseudomonasU50202(f n:required for alginate
aeruginosao-acctylation) (sr:pseudomonas
aeruginosa strain=frd1)
(de:pseudomonas aeruginosa alginate
gene cluster algi (algi), algj (algj)
and algf (algf) genes, complete cds.)
12142316_f1_1189351 75061578525115−4B34768
13922006_f1_ 119936175071188395−133Escherichia coliP77690(de:hypothctical 42.9 kd protein in
ais-pmrd intergenic region)
31897681 _f1_1219371750831951064< /td>1002−101SalmonellaAF036677(de:salmonella typhimurium putative
choleraesuisopero n regulated by pmrab, necessary
serotypefor 4-aminoarabinose lipid a
< i>typhimuriummodification and
polymyxinresistance, pmrg (pmrg)
gene, partial cds; pmrf (pmrf) gene
and 6orfs, complete cds: and pmrd
(pmrd) gene . . .
10830408_f1_12593817509 486161174−13< HIL>SalmonellaAF036677(de:salmonella typhimurium putative
choleraesuisopero n regulated by pmrab, necessary
serotypefor 4-aminoarabinose lipid a
< i>typhimuriummodification and
polymyxinresistance, pmrg (pmrg)
gene, partial cds; pmrf (pmrf) gene
and 6orfs, complete cds: and pmrd
(pmrd) gene . . .
13022780_f1_12893917510 573190296−26< HIL>Aquifex aeolicusH70301
16100341_ f1_12994017511861287409−38Enterobacter cloacaeCONTIG266GTC ORF with score 566 to:
(ai:7501773210)
(or:Klebsiella pneumoniae)
1367341_f2_130175121065354409−37Escherichia coliB65005
14706583_f2_1 35942175131890629−34Anabaena flos-aquaeAJ005201(fn:catalyzes atp-dependent formation
(strain IUCC 1444)of) (de:anabaena variabilis cphb and
cpha genes, complete.)
34073967_f2_137943 17514408135155−10< /td>Homo sapiensM74027(sr:homo sapiens (tissue library:
lambda-gem-11 (stratagene)) bloo)
< td/>(de:human mucin-2 gene, partial cds.)
33876633_f2_1399441 75152859952593−55 PseudomonasAF030352( de:pseudomonas aeruginosa two
aeruginosacomponent sensor (lema) gene,
< td/>partialcds.) (nt:protein histidine
kinase; similar to p. fluorescens)
14940632_f2_143 945175161839612331−29MycobacteriumU46844(de:mycobacterium smegmatis
smegmatiscatalas e-peroxidase (katg),
putativearabinosyl transferase (embc,
emba, embb), genes complete
cds andputative propionyl-coa
carboxylase beta chain (pccb) genes,
partialcds.) (nt:orf7; hypothetical
membrane. . .
3135028_f2_14694617517< /td>1092363107−2human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HRV-6complex) (de:human herpesvirus 6
serotype b putative major immediate-
< td/>earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
36067901_f2_1479471 75181230409
10042665_f2_1 5094817519711236304−27Bacillus(de:hypothetical 19.6 kd protein in
subtilis/Bacillusacda 5′ region)
globigii
32241283_f2_152< /td>9491752011433802 27−17KlebsiellaGTC ORF with score 888 to:
pneumoniae(ai:7000830083)
(or: Enterobacter cloacae)
5105408_f2_1539 5017521804267400−37KlebsiellaContig54 3AGTC ORF with score 888 to:
pneumoniae(ai:7000830083)
(or: Enterobacter cloacae)
33711431_f2_154 95117522651216714PseudomonasAF0548 68(de:pseudomonas aeruginosa
< td/>aeruginosaautoi nducer synthetase (rhli) gene,
< td/>partialcds; cyclohexadienyl
< td/>dehydratase (phec), hypothectical
0299 protein (yigm),
chloramphenicol-sensitive protein
(rard), and hypotheticalprotein (yafl)
genes, complete . . .
4554762_f2_17695217523< /td>2115704150−7BurkholderiaU41162(sr:burkholderia cepacia
cepaciastrain=1761 6) (de:burkholderia
cepacia d-serine deaminase (dsd)
< td/>gene, complete cds.)
< td/>(nt:unidentified orf)
12187843_f2_17995317 524630209703−69E64919
14975830_f2_1 8195417525693230141−7Micrococcus luteusJQ0405
14713966_f2 _18495517526504167−5mice|C57BL/6xCBA/Q06666 (sr:, mouse) (de:octapeptide-repeat
CaJ hybridprotein t2)
34507215_f2_185956175 2728293
2448806_f2_186957175281038345690 −68Escherichia coliP76182(de:hypothetical 38.1 kd protein in
add-nth intergenic region)
16917817_f2_18895817529549182383−35HaemophilusQ57020(d e:hypothetical protein hi1688)
influenzae
12554135_f2_19 5959175301515504161−11Klebsiella Contig541AGTC ORF with score 278 to:
pneumoniae(ai:7000809606)
(or: Pseudomonas aeruginosa)
16025652_f2_1971753123778115 −7KlebsiellaContig 541AGTC ORF with score 403 to:
pneumoniae(ai:7000809729)
(or: Pseudomonas aeruginosa)
15815967_f2_20417532732243130−5Escherichia coliD90774(sr:escherichi a coli (strain:k12) dna,
clone_lib:kohara lambda minise)
(de:e. coli genomic dna, kohara
clone #263(30.5-30.9
min.).) (nt:orf_id:o263#22; similar to
(swissprot accession)
12206261_f2_216962 1753325584115−6PseudomonasP72170( ec:3.5.2.3) (de:dihydroorotase,
aeruginosa(dhoas e))
31679208_f2_218963175 34615204110−3 AlphaherpesvirusP11675(s r:indiana-funkhauser/becker, prv)
pse udorabies virus(de:immediate-early protein ie180)
P RV
10163186_f2_2229641753 528293121−8CyanobacteriumS77421(sr:pc c 6803,, pcc 6803)
synechocystis(sr:pcc 6803,)
11133405_f2_223965 175361389462157−8 Trypanosoma cruziA44937(cl:kinetoplast-assoc iated protein)
11844541_f2_23496653717899−5 longfin squidS56117(sr:, longfin squid)
13063506_f2_235967 17538891296127−6S50883(sr:mice macrophage) (de:putative
transcription regulator {clone t2,
repetitive sequence} (mice,
macrophage, mrna, 1263 nt).)
< td/>(nt:method: conceptual translation
supplied by author.)
15797217_f2_237968324107
34416680_f2 _24096917540924307−159Pseudomonas A32013(cl:ornithine
aeruginosacarbamoyltransferase:
aspartate/ornithine
ca rbamoyltransferase homology)
(ec:2.1.3.3)
13095158_f2—241< /td>9701754199933214 5−10Enterobacter cloacaeCONTIG483GTC ORF with score 190 to:
(ai:7000771209)
(or:Pseudomonas aeruginosa)
22158443_f2_2461754213684552236−232Pseudomonas P11759(ec:1.1.1.132) (de:gdp-mannose 6-
aeruginosadehydrogenase, (gmd))
12605031_f2_252972 1754314614862426−252PseudomonasX99206 (de:p. aeruginosa algk gene and
aeruginosapartial alg44 and alge genes.)
16897567_f2_2539731754413354442170−225< /td>PseudomonasP18895(de:alginate production protein alge
aeruginosaprecursor)
26438166_f2_26397417545158 45272734−284Ps eudomonasU50202(fn:required for alginate
aeruginosao-acetylation) (sr:pseudomonas
aeruginosa strain=frd1)
(de:pseudomonas aeruginosa alginate
gene cluster algi (algi), algj (algj)
and algf (algf) genes, complete cds.)
34089530_f2_2649751 7546531176151−10U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
25433342_f2_266976175 4716535502482−258 PseudomonasP07874(cc :5.3.1.8:2.7.7.22) (de:(gdp),
< td>aeruginosa(gdp-mannose pyrophosphorylase)
(gmp))
14551908_f2_270 97717548311410372396−249Escherichia coliE64996
12755032_f2_2 73978175491692563−8Caenorhabditis< /td>AF067607(de:caenorhabditis elegans cosmid
elegansc18h7.) (nt:similar to cuticular
collagen; c18h7.3)
16220816_f2_274979597198211−17SalmonellaAF036677 (de:salmonella typhimurium putative
choleraesuisopero n regulated by pmrab, necessary
serotypefor 4-aminoarabinose lipid a
typhim uriummodification and
polymyxinresistance, pmrg (pmrg)
gene, partial cds; pmrf (pmrf) gene
and 6orfs, complete cds;
and pmrd (pmrd) gene . . .
13006956_f2_27598017551 1083360149−8< HIL>Homo sapiensAB010962(sr:homo sapiens female uterurs cdna
to mrna, clone_:lamda gt11)
< td/>(de:homo sapiens mrna for mifr-2,
complete cds.) (nt:metalloproteinase
in the female reproductive)
24066668_f2_2779811755226487238−2 0Enterobacter cloacaeCONTIG266GTC ORF with score 238 to:
(ai:7000759813)
(or:Pseudomonas aeruginosa)
26604080_f3_28117553759252228−19PyrococcusAP00 0002(sr:pyrococcus horikoshii (str:ot3)
horikoshiidna) (de:pyrococcus horikoshii ot3
genomic dna, 287001-544000 nt.
position(2/7).) (nt:similar
to :d888021 percent ident:28.708 in)
15798536_f3_286983175 54474157104−3 Epstein-Barr virusP03181(sr:b95-8, human herpesvirus 4)
(de:hypothetical bhlf1 protein)
12771080_f3_28798436312099−4 Homo sapiensAB002322(sr:homo sapiens male brain cdna
to mrna, clone_lib:pbluescriptii s)
(de:human mrna for
kiaa0324 gene, partial cds.)
13088180_f2_2919851 755642013998−3Homo sapiensU47924(sr:human) (de:human chromosome
< td/>12p13 sequence, complete sequence.)
< td/>(nt:human dentatorubral and
pallidoluysian atrophy)
16253957_f3_300986489162225−17RhodobaterAF016236 (de:rhodobacter sphaeroides
sphaeroidesdms o/tmao-sensor kinase (dors),
dmso/tmao-response regulator (dorr),
dmso/tmao-cytochromec-containing
subunit (dorc), dmso-membrane
protein (dorb), and
dmso/tmao-reductase (dora) genes,
complete cds.)
5208211_f3_30198717 55828293119−8 Enterobacter cloacaeCONTIG457GTC ORF with score 318 to:
(ai:112766)
(or:Escherichia coli) nt:o72;
072; alternate orf with good statistics.
16931905_f3_30798817559705234220−17 Enterobacter cloacaeCONTIG488GTC ORF with score 274 to:
(de:mycobacterium smegmatis
catalase-peroxidase (katg),
putativearabinosyl transferase (embc,
emba, embb), genes complete cds and
putative propionyl-coa carboxylase
beta chain (pccb) genes, partialcds.)
(nt:orf7: hypothetical . . .
32525091_f3_31098917560 1521506531−51 BacillusP37514(de:hypoth etical 49.7 kd protein in
subtilis/Bacillustetb-exoa intergenic region)
globigii
11067658_f3_311< /td>9901756114314761 34−6mice|C57BL/6xCBA/P05143(sr:, mouse) (de:proline-rich protein
CaJ hybridmp-3 (fragment)
22074166_f3_314991 1756227691196−16KlebsiellaContig543A
pneumoniae(ai:7000759850)
(or: Pseudomonas aeruginosa)
22665708_f3_31517563618205412−39Enterobacter cloacaeCONTIG452GTC ORF with score 412 to:
(ai:7000759851)
(or:Pseudomonas aeruginosa)
12210455_f3_31917564564187183−13Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
12632191_f3_321994175 65903300284−25Enterobacter cloacaeCONTIG437GTC ORF with score 299 to:
(ai:7501778850)
(or:Klebsiella pneumoniae)
4398533_f3_3341756620436802369−246Escherichia coliP00959(ec:6.1.1.10) (de:(metrs))
32133416_f3_33599617567834277
1080733 0_f3_33799717568489162122−7Rattus norvegicusS24169(sr:, norwawy rat)
32661390_f3_34099817 56946215392−2 HumanP50768(de:regulator y protein e2)
papillomavirus type
22
31822191_f3_34199923317761267−12 9Escherichia coliG64919
32683157_f3_3 44100017571729242−3Araneus diadematusU47856(de:aran eus diadematus fibroin-4
mrna, partial cds.)
16142202_f3_3471001 17572639212798−79 Escherichia coliP20625(ec:4.2.99.18) (de:lyase))
24664753_f3_34810021757319564
30552030 _f3_35110031757442314093−4Pyrococcus< /td>AP000003(su:pyrococcus horikoshii (str:ot3)
horikoshiidna) (de:pyrococcus horikoshii ot3
genomic dna, 544001-777000 nt.
position(3/7).)
12752306_f3_35517575519172140−8Caenorhahditis AF000298(sr:caenorhahditis elegans
elegansstrain=bris tol n2) (de:caenorhabditis
elegans cosmid w03d2.)
(nt:weak similarity to
collagens; glycine- and)
10414581_f3_36210051 757638821293157−7 Homo sapiensAB002322(sr:homo sapiens male brain cdna
to mrna, clone_lib:pbluescriptii s)
(de:human mrna for kiaa0324 gene,
< td/>partial cds.)
35659768_f3_3651006 1757748616199−3P03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
103887_f3_366100717578423140521−50NeisseriaY14298(ec: 4.4.1.5) (de:neisseria meningitidis
meningitidisg loa gene.) (nt:subunit glyoxalase i)
21538563_f3_3721008175 791266421704−69P46232(ec:3.1.13 .—) (de:ribonuclease t,
parahaemolyticus(exoribonuclease t) (rnase t))
9891411_f3_3761009175 801449482131−5SaccharomycesP08640(sr: ,baker's yeast) (ec:3.2.1.3)
cerevisiae(de:glucosida se) (1,4-alpha-d-glucan
glucohydrolase))
3372 6452_f3_3781010175815971 98247−21Enterobacter cloacaeCONTIG313GTC ORF with score 247 to:
(ai:7000759914)
(or:Pseudomonas aeruginosa)
17050806_f3_37917582567188134−6Homo sapiensAB011167(sr:homo sapiens male brain cdna
to mrna, clone_lib:pbluescriptii s)
(de:homo sapiens mrna
for kiaa0595 protein, partial cds.)
2040657_f3_38410121 7583304510143404−9999< /td>PseudomonasAJ007828
< HIL>tolaasii
31927207_f3_38510131758454918212 5−6Canadian hardJN0690(cl:glutenin) (sr:, common wheat)
w inter wheat
13004705_f3_3861014 175851110369680−67CyanobacteriumS75652(cl:unassigned atp-binding cassette
synechocystisproteins:atp binding cassette
homology) (sr:pcc 6803,, pcc
6803) (sr:pcc 6803,)
16585811_f3_39110151758615395122574−267< /td>PseudomonasL22611(fn:related to alginate biosynthesis.)
aeruginosa(sr:pseudomonas aeruginosa
< td/>(strain 8830) dna) (de:pseudomonas
aeruginosa alginate synthesis related
protein (alg44) gene, complete cds,
algd and alg8 genes, 3′ end and
5′ end.) (nt:initiation site unkno . . .
31447681_f3_39210161758 71269422111−3 HumanP06422(de:regulator y protein e2)
papillomavirus type
8
10645692_f3_3931017900299199−12CaenorhabditisU80846
elegans(de:caenorhabditi s elegans cosmid
k06a9.) (nt:partial cds; coded for by
c. elegans cdna yk50c7.5)
12994405_f3_3951018 17589813270114−3SaccharomycesP47179
cerevisiae
1609478 0_f3_3981019175901251416 131−5Homo sapiensO00268(sr:, human) (de:(tafii135) (tafii-130)
22945836_f3_3991020175911647548183− 10OryctolagusP27884 (sr:, rabbit) (de:brain calcium channel
cuniculusbi-2 protein)
32057716_f3_40010211599532
32516292_ f3_403102217593918305152−7Alphaherpesvirus< /HIL>P11675(sr:indiana-funkhauser/becker, prv)
pseudorabies virus(de:immediate-early protein ie180)
P RV
31922708_f3_4051023175 94918305151−7 OryctolagusP27884(sr:, rabbit) (de:brain calcium channel
cuniculusbi-2 protein)
9847802_f3_41010241005334140−7mice|C57BL/6xCBA/C29149(cl:proline-r ich protein)
CaJ hybrid(sr:, house mouse)
876466_f3_41110251 759611913961046−106Escherichia coliP77757(de:hypothetical 36.3 kd protein in
ais-pmrd intergenic region)
13777255_f3_41710261836611329−27SalmonellaAF036677(de:salmonella typhimurium
choleraesuispu tative operon regulated by pmrab,
< HIL>serotypenecessary for 4-aminoarabinose lipid
typhimuriuma modification and
polymyxinresistance, pmrg (pmrg)
gene, partial cds; pmrf (pmrf) gene
and 6orfs, complete cds;
and pmrd (pmrd) gene . . .
25801533_f3_41810271759 81257418836−83SalmonellaAF036677(de:< HIL>salmonella typhimurium putative
choleraesuisopero n regulated by pmrab, necessary
serotypefor 4-aminoarabinose lipid a
< i>typhimuriummodification and
polymyxinresistance, pmrg (pmrg)
gene, partial cds; pmrf (pmrf) gene
and 6orfs, complete cds; and pmrd
(pmrd) gene . . .
20875162_13_42110281759 91629542951−95XanthomonasJC2525(ec:1. 1.99.—)
campestris pv.
campestris
11915830_c1_42310 2917600723240766−76Escherichia coliP20966(ec:2.7.1.69) (de:(ec 2.7.1.69)
(eii-fru))
16538530_c1_428 103017601498165134−8human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus 6
serotype b putative major immediate-
< td/>earlygenes.) (nt:similar to
hhv6a u86, region ie-b
16510317_c1_42910311 7602489162135−8U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
33463505_c1_430103217 6031143380508−49Contig502AG TC ORF with score 508 to:
pneumoniae(ai:7000759966)
(or: Pseudomonas aeruginosa)
12991318_c1_436176041200399589−57KlebsiellaCo ntig502AGTC ORF with score 589 to:
pneumoniae(ai:7000759972)
(or: Pseudomonas aeruginosa)
2520433_c1_439176051491496
1035176061527508
5989791_c1_4451036 176072847948
17034575 _c1_4501037176081497498< /td>
12970033_c1_458103817609< /td>984327248−21Enterobacter cloacaeCONTIG393GTC ORF with score 248 to:
(ai:7000759994)
(or:Pseudomonas aeruginosa)
2205407_c1_470176102178725518−48Mycobacterium AL021928(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
11/162.) (nt:rv0197, (mtv033.05),
len: 762.unknown but similar)
35336592_c1_4761040615204775−77LegionellaL46863( sr:legionella pneumophila
pneumophila(in dividual_isolate
philadelphia 1) dna) (de:legionella
pneumophila alkylhydrogenperoxide
reductase (tsaa) gene, complete cds
and glutaredoxin-like protein (grla)
gene, completecds.)
11192667_c1_4771041< /td>176122262753310⠈’27Vibrio alginolyticusD84310(sr:v ibrio alginolyticus (strain:138-2)
dna) (de:vibrio alginolyticus gene
for rnase t and moty gene for
component of sodium-driven polar
< td/>flagellar motor, complete cds.)
30566441_c1_4781042 176131239412954−96Rattus norvegicusP09034(sr:, rat) (ec:6.3.4.5) (de:ligase))
31267283_c1_4831043176141059352106∠’2Rattus norvegicusQ63003(sr:, rat) (de:5e5 antigen)
35407666_c1_4841044900299137−7CaenorhabditisP20630
< td>elegans
16511403_c1_48 51045176162505834−63PseudomonasX99514(fn:outer membrane component of
aeruginosamultidrug efflux) (de:p. aeruginosa
< td/>mexe, mexf & oprn genes.)
25955006_c1_487104625283180−14Enterobacter cloacaeCONTIG210GTC ORF with score 94 to:
(ai:4000710458)
(or:Synechocystis)
(gtcf:14.1) (kcggfc:14.2)
(kazusafc:16.1)
25631336_c1 _488104717618336111
14656417_c1_489104817619 1947648160−8< i>Homo sapiensM63730(sr:human, cdna to mrna)
< td/>(de:human bullous pemphigoid
< td/>(bp180) mrna, partial cds.)
36428876_c1_4901049 176201464487744−74SchizosaccharomycesAL02 1815(sr:fission yeast) (de:s. pombe
< td/>pombechromosome i cosmid c8e4.)
(nt:spac8e4.05c, putative cis-
muconate cycloisomerase,)
10647831_c1_49410 50176211449482413MycobacteriumAL02 1925(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
100/162.) (nt:rv2258c, (mtv022.08c),
len: 353. unknown but)
34234808_c1_49710511 7622537178104−3AJ224893( fn:spore differentiation)
discoidcum(de: dictyostelium discoideum
< td/>srfa gene.)
29823278_c1_498105217623582193110−7 KlebsiellaContig311A GTC ORF with score 164 to:
pneumoniae(ai:7000822108)
(or: Enterobacter cloacae)
13011578_c1_500 105317624504167133Enterobacter cloacaeCONTIG388GTC ORF with score 133 to:
(ai:7000760036)
(or:Pseudomonas aeruginosa)
9813433_c1_50417625480159105−5AspergillusCont ig9493GTC ORF with score 181 to:
fumigatus(ai:5500701468)
(or:< HIL>Equine herpesvirus 4) (fn:very
large tegument protein)
(de:equine herpesvirus 4
strain ns80567, complete genome.)
(nt:counterpart of hsv-1 gene u136
and vzv gene 22)
35261428_c1_507105517 6261131376351−32CONTIG388GTC ORF with score 351 to:
(ai:7000760043)
(or:Pseudomonas aeruginosa)
5163581_c1_5171762711343771223−124Pseudomonas fragiP72190(de:hypothetical 30.2 kd protein in
capb 3′ region)
32557262_c1_520105789729892−4 AspergillusContig6660GTC ORF with score 92 to:
fumigatus(ai:7000760056)
(or:< HIL>Pseudomonas aeruginosa)
15041631_c1_5211762913384452259< /td>−234PseudomonasB53652
aeruginosa
16532941 _c1_5221059176301464487< /td>1046−106Pseudomonas< /HIL>P54291(de:autoinducer synthesis protein rhli)
aeruginosa
16289691_c1_524< /td>1060176312262753 1501−154PseudomonasAF054868(de:pseudomonas aeruginosa
< td/>aeruginosaautoi nducer synthetase (rhli) gene,
< td/>partialcds; cyclohexadienyl
< td/>dehydratase (phcc), hypothetical 0299
protein (yigm), chloramphenicol-
sensitive protein (rard), and
hypotheticalprotein (yafl)
genes, complete . . .
30713192_c1_52710611763 221671167−12< HIL>PseudomonasAF054868(de:pseudomonas aeruginosa
< td/>aeruginosaautoi nducer synthetase (rhli) gene,
< td/>partialcds; cyclohexadienyl
< td/>dehydratase (phcc), hypothctical 0299
protein (yigm), chloramphenicol-
sensitive protein (rard), and
hypotheticalprotein (yafl)
genes, complete . . .
35792777_c1_53210621763 31419472261−22KlebsiellaContig442AGTC ORF with score 261 to:
pneumoniae(ai:7000760068)
(or: Pseudomonas aeruginosa)
26660317_c1_53517634603200128−6rainbow troutAB008374(sr:oncorhynchus mykiss fibroblast
< td/>cell_l:rtt cdna to mrna)
< td/>(de:oncorhynchus mykiss mrna for
alpha 3 type i collagen, partial cds.)
16275167_c1_5361064 176351398465739−73Escherichia coliP21507(de:atp-dependent rna helicase srmb)
29536267_c1_5431065 17636663220125−5U43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
16929005_c1_544106617 637561186143−8Homo sapiensM94131(sr:homo sapiens intestine cdna to
mrna) (de:human mucin 2 (muc2)
mrna, partial cds.)
10676666_c1_5461067 17638627208124−5I49607(cl:collagen alpha 1(i) chain:fibrillar
< td/>CaJ hybridcollagen carboxyl-terminal
homology:von willebrand factor
type c repeat homology) (sr:, house
< td/>mouse)
7089591_c1_5481068< /td>1763938412797− 5Enterobacter cloacaeCONTIG456GTC ORF with score 479 to:
(ai:7501757122)
(or:Klebsiella pneumoniae)
13129158_c1_54917640336111162−12KlebsiellaCon tig535AGTC ORF with score 479 to:
pneumoniae(ai:7000846281)
(or: Enterobacter cloacae)
35290711_c1_552 1070176411362453
24070918_c1_5561071176421368455
29802083_c2_56410721083360107−6AspergillusContig7202 GTC ORF with score 107 to:
fumigatus(ai:7000760100)
(or:< HIL>Pseudomonas aeruginosa)
13022341_c2_5671764431210399−4MolluscumL10127 (sr:molluscum contagiosum virus type
contagiosum virus1 dna) (de:molluscum contagiosum
subtype 1virus type 1 orf1 and orf2 dna.)
< td/>(nt:orf17)
26385791_c2_568 1074176451083360108 −2Petromyzon marinusI51116(sr:, sea lamprey)
24714091_c2_5691075801266389−36KlebsiellaContig502A
pneumoniae(ai:7000760105)
(or: Pseudomonas aeruginosa)
10836406_c2_57117647603200133−6Canis familiarisS33121(cl:homeotic protein cdp:cut repeat
homology:homeobox homology)
(sr:, dog)
14658567_c2_57310771 76481128375116−4Q01042(sr:1 1,) (de:immediate-early protein)
herpesvirus 2
31380433_c2_597107817649639212109−3 Rana catesbeianaAB015440(sr:r ana catesbeiana cdna to mrna)
< td/>(de:rana catesbeiana mrna for alpha
< td/>1 type i collagen, complete cds.)
30719080_c2_6001079 176501596531
15056468_c2_ 6041080176511275424 255−22KlebsiellaContig471AGTC ORF with score 428 to:
pneumoniae(ai:7000787132)
(or: Pseudomonas aeruginosa)
32557843_c2_60717652642213
1082176532649882153−6Chinese oakAF083334(sr:chinese oak silkmoth)
silkmoth(de:antheraea pernyi fibroin gene,
< td/>complete cds.)
4509658_c2_61610831 7654330109421−39P37010(de:hypothetical 12.9 kd protein in
lhr-sodb intergenic region)
4744840_c2_62210841765529497196−16 Enterobacter cloacaeCONTIG474GTC ORF with score 196 to:
(ai:7000760158)
(or:Pseudomonas aeruginosa)
34095667_c2_62817656591196205−17Enterobacter cloacaeCONTIG313GTC ORF with score 205 to:
(ai:7000760164)
(or:Pseudomonas aeruginosa)
36114381_c2_63017657440414673222 −9999Pseudomonas(fn:cytoplasmic membrane
aeruginosacomponent of multidrug)
< td/>(de:p. aeruginosa mexe, mexf &
oprn genes.)
2461086_c2_6361087176581236411
3172017_c2_ 6491088176591026341 361−33Enterobacter cloacaeCONTIG388GTC ORF with score 361 to:
(ai:7000760185)
(or:Pseudomonas aeruginosa)
22916653_c2_653176601974657467−44Enterobacter cloacaeCONTIG388GTC ORF with score 580 to:
(ai:7501733302)
(or:Klebsiella pneumoniae)
31916680_c2_6561766131381045732< /td>−72KlebsiellaC ontig244AGTC ORF with score 1073 to:
pneumoniae(ai:7000821392)
(or: Enterobacter cloacae)
10806888_c2_661 109117662609202690RickettsiaAJ2352 69Rickettsia prowazckii strain
prowazekiiMadrid E, complete genome.
29494376_c2_662109221671
29812576_c2_ 663109317664924307−156Pseudomonas A53652
< td/>aeruginosa
138458 36_c2_667109417665786261 1268−129Pseudomonas P54292(de:rhlr regulatory protein (elastase
aeruginosamodulator))
16285030_c2_669109517666 27089405−38PseudomonasA42325
aeruginosa
35784505_c2_6701096176678402791372−140< HIL>PseudomonasQ01269(ec:4.2. 1.51.4.—.—.—)
aeruginosa(de:(e c4.2.1.51)/arogenate
< td/>dehydratase,))
35839 660_c2_67110971766895431 71428−146PseudomonasAF054868(de:pseudomonas aeruginosa
< td/>aeruginosaautoi nducer synthetase (rhli) gene,
< td/>partialcds; cyclohexadienyl
< td/>dehydratase (phec), hypothetical 0299
protein (yigm),
chloramphenicol-sensitive protein
(rard), and hypotheticalprotein (yafl)
genes, complete. . .
285416_c2_676109817669< /td>1350449
36146025_c2_684109917670100233313 5−6Klebsiella Contig544AGTC ORF with score 422 to:
pneumoniae(ai:7000837913)
(or: Enterobacter cloacae)
30167715_c2_690 1100176711416471160 −8RuminococcusS213 23
flavefaciens
7164581_c2_691110117672106235311 8−3Acanthamoeba(sr:, amoeba) (de:myosin ic heavy
castellaniichain)
36188266_c2_699110217673417< /td>138107−5Drosoph ilaP48608(sr:, fruit fly) (de:diaphanous protein)
melanogaster
885452_c2_7 04110317674849282−10Bacillus(cl:regulatory protein mpra)
subtilis/Bacillus
globigii
22058281_c2_70611041767571423799−2herpes simplex virusZ86099(fn:immediate early protein;
type 2 HSV-2transcriptional) (de:herpes simplex
virus type 2 (strain hg52),
complete genome.)
25671927_c3_70911051068355123−5CaenorhabditisZ81518
elegansf28d9, complete sequence.) (nt:cdna
cst embl:c09269 comes from this
gene; cdna est)
33439517_c3_71711061 7677537178139−9B34768
11211567_c3_ 7181107176782610869 1578−162Klebsiella Contig502AGTC ORF with score 1578 to:
pneumoniae(ai:7000760254)
(or: Pseudomonas aeruginosa)
10978785_c3_72117679816271430−41KlebsiellaCon tig544AGTC ORF with score 488 to:
pneumoniae(ai:7000788018)
(or: Pseudomonas aeruginosa)
21890768_c3_72217680783260258−22Enterobacter cloacaeCONTIG420GTC ORF with score 258 to:
(ai:7000760258)
(or:Pseudomonas aeruginosa)
20837933_c3_726176811695564114−3StrongylocentrotusL34680(sr:strongylocentrotus purpuratus
< td/>purpuratus(libr ary: lambda gt11) gastrula stag)
< td/>(de:strongylocentrotus purpuratus
< td/>calcium-binding protein (endo16)
mrna, complete cds.)
6719507_c3_73311111 76821653550133−4AF030027(fn:very large tegument protein)
type 4 EHV-4(de:equine herpesvirus 4 strain
ns80567, complete genome.)
(nt:counterpart of hsv-1 gene u136
and vzv gene 22)
35751086_c3_734111217 6831458485
14261280_c3_74 211131768422574 119−7Enterobacter cloacaeCONTIG507GTC ORF with score 150 to:
(ai:7000772159)
(or:Pseudomonas aeruginosa)
35597160_c3_74417685768255149−11KlebsiellaCon tig398AGTC ORF with score 469 to:
pneumoniae(ai:7000839977)
(or: Enterobacter cloacae)
1270391_c3_7551 11517686702233123Rhesus Epstein BarrU93909(sr:rhesus epstein barr virus)
virus(de:cercopithccine herpesvirus 15
nuclear antigen ebna-1 gene,
< td/>completecds.)
16844590_c3_756176871443480466−44Rhizobium sp.P55682(sr:ngr234,) (de:hypothetical transport
protein y4wd)
30721030_c3_7641117 1768826186101−6CONTIG507GTC ORF with score 101 to:
(ai:7000760300)
(or:Pseudomonas aeruginosa)
15914591_c3_76717689696231133−7mice|C57BL/6xCBA/E29149 (cl:proline-rich protein)
CaJ hybrid(sr:, house mouse)
35430201_c3_7681119176901176391162−8RuminococcusS21323
flavefaciens
2628581_c3_770112017691458769−76< i>PseudomonasX99514(fn:periplasm ic link protein of
aeruginosamultidrug efflux) (de:p. aeruginosa
< td/>mexc, mexf & oprn genes.)
11039512_c3_7861121657218133−6Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
36035888_c3_78711221707568671−66< /td>MethanococcusQ58339
jannaschii< /td>lyase, (adenylosuccinase) (asl))
16895832_c3_792112317694684227130−6 Homo sapiensX07884(sr:human) (de:human mrna for
alpha-1 (i) chain of procollagen
type i.) (nt:preprocollagen
(aa −22 to 450) (1500 is 1st base in)
12626002_c3_795112417 6951533510171−9U37520(de:nephil a clavipes dragline silk
protein spidroin 1 gene, partialcds.)
32697541_c3_796112517696648215157− 9human herpesvirusU92288(fn:helicase, helicase-primase complex)
type 6 HHV-6(de:human herpesvirus 6 serotype b
putative major immediate-
< td/>earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
13026458_c3_7971126 17697543180235−20 KlebsiellaContig311A GTC ORF with score 235 to:
pneumoniae(ai:7000760333)
(or: Pseudomonas aeruginosa)
32672881_c3_80017698777258209−16KlebsiellaCon tig376AGTC ORF with score 550 to:
pneumoniae(ai:7000762499)
(or: Pseudomonas aeruginosa)
14148931_c3_801176991551516
3261593_c3_805112917700648215123−5Brassica napusU59446(sr:rape) (de:brassica napus
< td/>myrosinase-binding protein related
protein mrna, partial cds.)
< td/>(nt:divergently related to myrosinase
< td/>binding protein;)
7086058_c3_8061130723240131−6silkwormS42886(cl:unassigned collagens)
< td/>(sr:, silkworm)
35407691_c3_8101131 17702918305118−4Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
14152041_c3_814113217 703432143144−10Contig534AGT C ORF with score 144 to:
pneumoniae(ai:7000760350)
(or: Pseudomonas aeruginosa)
35284757_c3_81617704120340095−4KlebsiellaCont ig396AGTC ORF with score 222 to:
pneumoniae(ai:7000817275)
(or: Enterobacter cloacae)
31297256_c3_817 11341770517915961813−187Escherichia coliP26616(ec:1.1.1.38) (de:probable malate
oxidoreductase (nad), (malic enzyme))
16147543_c3_8181135489162106−6AspergillusContig9437
fumigatus(ai:7000760354)
(or:< HIL>Pseudomonas aeruginosa)
2525258_c3_826177071446481131−5Homo sapicnsZ74616(sr:human) (de:h. sapiens mrna for
prepro-alpha2 (i) collagen.)
30260276_c3_8271137177081662553331−3 0Enterobacter cloacaeCONTIG488GTC ORF with score 618 to:
(ai:7000806161)
(or:Pseudomonas aeruginosa)
34198890_c3_828177091512503
15839211_c3_8321139177101575< /td>524115−3Mollusc umU60315(de:molluscum contagiosum virus
< td/>contagiosum virussubtype 1, complete genome.)
subtype 1(nt:contains large predicted
non-globular regions and)
16035842_c3_83511401 77111812603
31385056_c3_8 37114117712486161−8OryctolagusP27884(sr:, rabbit) (de:brain calcium channel
cuniculusbi-2 protein)
13089652_c3_84111421467488
30166407_ f1_1114317714861286 177−14Enterobacter cloacaeCONTIG165GTC ORF with score 216 to:
(ai:7000773303)
(or:Pseudomonas aeruginosa)
13086686_f1_4114417715948315871 −87BacillusE69902< /td>
subtilis/Bacillus
globigii
11225693_f1_10114517 716522173104−3SchizosaccharomycesZ98529(sr:fission yeast) (de:s. pombe
< td/>pombechromosome i cosmid c16e8.)
(nt:spac16c8.01, putative
cytoskeleton assembly control)
12994581 _f1_16114617717390129106−5Canis familiarisS33121(cl:homeotic protein cdp:cut repeat
homology:homeobox homology)
(sr:, dog)
25910461_f1_24114717 7182754917
17047702_f1_33 1148177192130709
35255068_f1_36114917720 453150174−12slime moldAF023910(sr:slime mold) (de:physarum
< td/>polycephalum dna topoisomerase i
(top1) mrna, completecds.)
14745381_f1_47115017721444147
340866 37_f1_53115117722333110< /td>
15103955_f1_541152177232022673218−15Enterobacter cloacaeCONTIG370GTC ORF with score 322 to:
(ai:7000807782)
(or:Pseudomonas aeruginosa)
10321041_f1_601772444414796 −2human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus
6 serotype b putative major
< td/>immediate-earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
1370950_f1_64115417 725513170155−11Contig560AGT C ORF with score 155 to:
pneumoniae(ai:7000760442)
(or: Pseudomonas aeruginosa)
2613152_f1_761155177261551516315−28RhodobacterAF0 10496(de:rhodobacter capsulatus strain
capsulatussb1003, partial genome.)
12110458_f1_7711561326441699−69BacillusH69784
subtilis/Bacillus
globigii
31922906_f1_86115717728681226101−2Ho mo sapiensP18825(sr:, human) (de:alpha-2c-1
adrenergic receptor (alpha-2c-1
adrenoceptor) (subtype c4))
2864142_f1_881158177 29987328163−8 AlphaherpesvirusP11675(s r:indiana-funkhauser/becker, prv)
pseudorabies virus(de:immediate-early protein ie180)
P RV
32523958_f1_9011591773 01506501182−10mice|C57BL/6xCBA/AF062655(sr:house mouse) (de:mus musculus
CaJ hybridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
22157016_f1_9211601980659122−3no gb taxonomyU52064(de:kaposi's sarcoma-associated
matchherpes-like virus orf73 homolog
gene, complete cds.)
< td/>(nt:herpesvirus saimiri orf73
< td/>homolog)
16878341 _f1_97116117732612203148−9Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
33729206_f1_981162177 331089362320−29AF061070(de :pseudomonas stutzeri orf117
stutzeri(orf117), orf86 (orf86) genes,
completecds; and ptxabcde operon,
partial sequence.)
< td/>(nt:putative binding protein
component of)
16255455_f1_100116317 7341746581409−38AF061070(d e:pseudomonas stutzeri orf117
stutzeri(orf117), orf86 (orf86) genes,
completecds; and ptxabcde operon,
partial sequence.)
< td/>(nt:putative inner membrane
component of)
13020313_f1_101116417 735852283
11973916_f1_102 116517736309102
31929168_f1_103116617737 2310769
3161700_f1_105116 7177381011336427−40PseudomonasAJ00782 5(de:pseudomonas aeruginosa mext
aeruginosagene.)
33726441_f1_108116817739 900299
14714580_f1_111116917740435144217 −17MycobacteriumZ7 0283(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
98/162.) (nt:rv2214c, (mtcy190.25c),:
< td/>592.ephd, equivalent)
4422713_f1_112117017741897298598−58 MycobacteriumAL008967(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
122/162.) (nt:rv2777c, (mtv002.42c),
len: 356 aa. unknown)
12510938_f1_1131171738245254−22Enterobacter cloacaeCONTIG488GTC ORF with score 254 to:
(ai:7000760491)
(or:Pseudomonas aeruginosa)
16274156_f1_116177431179392189−14KlebsiellaCo ntig554AGTC ORF with score 211 to:
pneumoniae(ai:7000780407)
(or: Pseudomonas aeruginosa)
32598952_f1_11717744930309133−5mice|C57BL/6xCBA/AF062655
CaJ hybridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
36189201_f1_1191174 17745876291153−7Homo sapiensAB002322(sr:homo sapiens male brain cdna to
mrna, clone_lib:pbluescriptii s)
(de:human mrna for kiaa0324 gene,
< td/>partial cds.)
6072556_f1_12011751 77461122373109−2JQ0406
32520006_f1 _1241176177472298765194−11Pseudomonas S53999(cl:gramicidin s synthetase i repeat
< HIL>aeruginosahomology:acetate--c oa ligase
homology:acyl carrier protein
homology)
21484761_f1_125117717748846281106 −2Homo sapiensAF065164(sr:human) (de:homo sapiens
hyperpolarization activated channel 1
(ih 1) mrna, partial cds.)
31885430_f1_1301178 17749165054990−3S56117(sr:, longfin squid)
12348841_f1_1331179177501308435653−64StreptomycesU09991 (fn:antibiotic efflux; transmembrane
venezuelaeprotein) (de:streptomyces venezuelae
< td/>isp5230 chloramphenicol resistance
< td/>protein (cmlv) and chloramphenicol
phosphotransfera se genes,
complete cds.)
36035202_f1_1371180 17751825274114−4P14918(sr:, maize) (de:extensin precursor
(proline-rich glycoprotein))
7286461_f1_1421181< /td>177521809602
1301 6086_f1_1461182177531407 468103−2AchromobacterL81125(sr:pseudomonas sp (strain imt37)
< HIL>georgiopolitanumdna) (de:pseudomonas sp. (strain
imt37) monooxygenase subunit gene,
< td/>completecds.)
16302018_f1_14917754678225108−6AspergillusCon tig4037GTC ORF with score 110 to:
fumigatus(ai:187452)
(or: Chironomus tentans)
24478803_f1_158 1184177551230409
16097557_f1_161118517756912 303
5113331_f1_1681186177571605534479−45Escherichia coliP00926(ec:4.2.1.14) (de:d-serine
dehydratase, (d-serine deaminase))
3413533_f1_1691187177581080359363−3 3StreptomycesAL031124(de:streptomyces coelicolor cosmid
coelicolor1c2.) (nt:sc1c2.12c, probable integral
membrane protein,:)
35667691_f2_174118817759993330708−70 Escherichia coliP36767(de:hypothetical 34.0 kd protein in
arom-araj intergenic region)
31882331_f2_176118926186116−7 Enterobacter cloacaeCONTIG432GTC ORF with score 116 to:
(ai:7000760554)
(or:Pseudomonas aeruginosa)
21586443_f2_17717761390129360−33Escherichia coliP30743(de:surge protein)
31297691_f2_1791191169856595−1no gb taxonomyU52064(de:kaposi's sarcoma-associated
matchherpes-like virus orf73 homolog
gene, complete cds.)
< td/>(nt:herpesvirus saimiri orf73
< td/>homolog)
36383291_f2_180< /td>1192177632391796
31911082_f2_185119317764 36871228196−12mice|C57 BL/6xCBA/A28996(cl:proline-rich protein) (sr:,
Ca J hybridhouse mouse)
24098916_f2_18711941776551917296−2Z99161 (sr:fission yeast) (de:s. pombe
< td/>pombechromosome i cosmid c11g7.)
(nt:spac11g7.01, unknown;
serine rich, len:536aa)
10276416_f2_19011951776663020990−2SchizosaccharomycesD8 9103(sr:schizosaccharomycepombe
p ombe(strain:pr745) cdna to mrna)
< td/>(de:schizosaccharomycepombe
mrna , partial cds, clone: sy 0143.)
(nt:unnamed protein product)
10244467_f2_19111961335444107−5PyrococcusAP000006(sr:pyrococcus horikoshii (str:ot3)
horikoshiidna, cl:pyrococcus horikoshi)
< td/>(de:pyrococcus horikoshii ot3
genomic dna, 1166001-1485000
< td/>nt. position(6/7).)
20447153_f2_193119 717768459152
997 8457_f2_1951198177691257 418
33681906_f2_211119917 770414137124−6Bos primigeniusP04258(sr:, bovine) (de:collagen alpha 1
< i>taurus(iii) chain)
32071055_f2_21612001777140213399−5S57948(sr:, garden pea)
12630032_f2_22612011 777285528493−2Indian cornAF001635(de:zea mays physical impedance
induced protein (iig2) mrna,
< td/>completecds.) (nt:contains a nuclear
targeting sequence; the mrna was)
31330391_f2_22712021 77731089362117−7CONTIG431GTC ORF with score 406 to:
(ai:7501741747)
(or:Klebsiella pneumoniae)
30729155_f2_23317774468155120−6Caenorhabditis AF000298(sr:caenorhabditis elegans
elegansstrain=bris tol n2) (de:caenorhabditis
elegans cosmid w03d2.) (nt:weak
similarity to collagens; glycine- and)
35757330_f2_23612041 77751887628638−63 KlebsiellaContig452A GTC ORF with score 975 to:
pneumoniae(ai:7000824092)
(or: Enterobacter cloacae)
25409665_f2_238 1205177761737578594 −58KlebsiellaConti g452AGTC ORF with score 975 to:
pneumoniae(ai:7000824092)
(or: Enterobacter cloacae)
26666637_f2_239 120617777990329184ArchaeoglobusH69 468
fulgidus
17038130_f2_241 120717778327108107−6Ovis orientalis ariesP26372(sr:, sheep) (de:keratin, ultra high-
< td/>sulfur matrix protein (uhs keratin))
13141432_f2_2421208 177791206401207−16 Enterobacter cloacaeCONTIG454GTC ORF with score 229 to:
(ai:7501747091)
(or:Klebsiella pneumoniae)
21978957_f2_243177802100699110−2no gb taxonomyU93872(sr:kaposi's sarcoma-associated
matchherpesvirus-human herpesvirus 8)
(de:kaposi's sarcoma-
associated herpesvirus glycoprotein
m, dna replication protein,
glycoprotein, dna replication protein,
flieeinhibitory protein and
y-cyclin genes . . .
16147688_f2_24812101778 129859941824−188P23852(de:rna polymerase associated protein
(atp-dependent helicase hepa))
14588291_f2_249121117782738245183−14MethanobacteriumA69233
therm oautotrophicum
12791285_f2_25112121778327691106< /td>−6Enterobacter cloacaeCONTIG363GTC ORF with score 130 to:
(ai:7000809218)
(or:Pseudomonas aeruginosa)
16144716_f2_253177841383460
35439566_f2_2541214177851266< /td>421139−9mice|C57BL/6xCB A/M19419(sr:mouse cdna to mrna, clone
Ca J hybridpump4) (de:mouse proline-rich
salivary protein mrna, partial cds.)
< td/>(nt:proline-rich salivary protein)
29975830_f2_2561215573190123−5Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-related
< td/>nuclear matrix protein (srm160)
mrna, complete cds.) (nt:160 kda)
25495392_f2_25812161 77871632543333−30 PseudomonasAF061267( de:pseudomonas stutzeri putative
stutzerialpha-ket oglutarate-
dependenthypophosphite
dioxygenase (htxa), binding protein
component htxb (htxb), inner
< td/>membrane component htxc (htxc),
atpase component htxd (htxd), inner
< td/>membrane component htxc ( . . .
32557793_f2_26412171778 8345114164−12 MycobacteriumQ04001(de:m tp40 antigen protein)
tuberculosis
25808576_f2 _2741218177891113370147−7Haloferax sp.P21561(sr:aa 2.2,) (de:hypothetical 50.6 kd
protein in the 5′ region of gyra and
gyrb (orf 3))
19739561_f2_279121917 790685822852109−217BacillusD69681(cl: surfactin synthetase:acetate--coa
subtilis/Bacillus< /i>ligase homology:acyl carrier protein
globigiihomology:gramicidin s synthetase i
repeat homology)
13098967_f2_2801220 177911572523387−36 Bordetella pertussisAF006000(de:bor detella pertussis d-3-
phosphoglycerate dehydrogenase
homolog (scra) and brg1 (brg1)
genes, complete cds.) (nt:orf4; similar
to salicylate hydroxylase)
9876082_f2_2841221177921089362440− 41mice|C57BL/6xCBA/AF013288(fn:ox idizes 9-cis retinol into 9-cis
Ca J hybridretinaldehyde) (sr:house mouse)
(ec:1.1.1.105) (de:mus musculus
9-cis retinol dehydrogenase
(rdh4) mrna, complete cds.)
< td/>(nt:membrane bound enzyme)
25507705_f2_28512221671556243−20BacillusG69795
subtilis/Bacillus
globigii
32525283_f2_286122317794331827−82 StreptomycesAL021409(pn:3-oxoacy l-(acyl-carrier-protein)
coelicolorsynthase) (de:streptomyces coelicolor
< td/>cosmid 3f7.) (nt:sc3f7.08, probable
3-oxoacyl-(acyl-carrier-protein))
1224177951086361371−34Klebsiel laContig555AGTC ORF with score 405 to:
pneumoniae(ai:7000828165)
(or: Enterobacter cloacae)
21485206_f2_291 1225177961257418161 −9mice|C57BL/6xCBA/S19560(c l:proline-rich protein) (sr:, house
Ca J hybridmouse)
12969501_f2_292< /td>1226177971458485 175−10MicrobacteriumX79027(de:m. ammoniaphilum genes mamir
ammoniaphilumand mamim.)
13025702_f2_2931227426141134−7Chinese oakAF083334(sr:chinese oak silkmoth)
silkmoth(de:antheraea pernyi
fibroin gene, complete cds.)
3385441_f2_30412281 77991701566385−36 KlebsiellaContig534A GTC ORF with score 519 to:
pneumoniae(ai:7000828790)
(or: Enterobacter cloacae)
9800706_f2_3111 2291780034511494−4Arabidopsis thalianaAC000098(sr:thale cress) (de:arabidopsis
thaliana chromosome 1 yac yup8h12
complete sequence.) (nt:est gblatts
1136 comes from this gene.)
11855091_f2_327123017801855284977−98Escherichia coliP00926(ec:4.2.1.14) (de:d-serine
dehydratase, (d-serine deaminase))
24713131_f3_3351231178021077358102− 2Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
29806652_f3_3361232336111359−33PseudomonasP95459 (de:major cold shock protein cspa)
aeruginosa
24614408_f3_344< /td>1233178044651549 8−2human herpesvirusU92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus 6
serotype b putative major immediate-
< td/>earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
10995332_f3_3461234 1780542661421642−61ParacoccusAJ223460 (de:paracoccus denitrificans flhs,
< td/>denitrificansflhr, abca, abcb, abcc, pqqe genesand
orf's.)
12985053_f3_352 123517806441146127mice|C57BL/6xCBA/AF062655( sr:house mouse) (de:mus musculus
CaJ hyhridplenty-of-prolines-101 mrna,
< td/>complete cds.) (nt:binds to several
sh3 domain containing proteins)
26032708_f3_3541236 17807468155122−6BurkholderiaU41162(sr:burkholderia cepacia
cepaciastrain=1761 6) (de:burkholderia
cepacia d-serine deaminase (dsd)
< td/>gene, complete cds.)
< td/>(nt:unidentified orf)
16036062_f3_36612371 78081452483
11775817_f3_3 771238178091038345−18PseudomonasU59457(de:pseudomonas aeruginosa ankyrin
aeruginosa(ankb) gene, complete cds.)
< td/>(nt:ankyrin)
9785968_f3_3791239178101410469
124017811303100
35659656_f3_3861241504167119−6CaenorhabditisZ82268
elegansf52b11, complete sequence.)
< td/>(nt:predicted using
< td/>genefinder; cdna est embl:d65629)
33458531_f3_3891242178131875624
26666 286_f3_39012431781414914 961740−179Serratia marcescensP19147(ec:3.1.3.1) (de:alkaline phosphatase
precursor, (apase))
24786080_f3_391124441791392403−36 Enterobacter cloacaeCONTIG513GTC ORF with score 445 to:
(ai:7501799795)
(or:Klebsiella pneumoniae)
29969705_f3_40317816630209112−4Homo sapiensI78877(cl:fos/jun dna-binding domain
homology) (sr:, man)
2870331_f3_404124617 8171419472103−2Y14166(sr:chicken) (de:gallus gallus mrna
domesticusfor attachment region binding
protein (arbp).)
29863965_f3_4061247546181146−9Homo sapiensPN0099(sr:, man)
4427162_f3_408124817 819696231158−9Herpes simplex virusAF015297(de:human herpesvirus 6 (strain
(type 6/strainuganda-1102) ie2hom mrna,
Ug anda-1102)complete cds.) (nt:similar to the
immediate-early 2 protein of human)
24729717_f3_415124917820588195119−4 human herpesvirusU13194(fn:transcripti onal regulation)
type 6 HRV-6(de:human herpesvirus 6 replication
origin-binding protein (hdrfo), partial
cds, helicase-primase component
(hdrf1), virion protein (hdlf1),
putative helicase (hdrf2), putative
phosphoprotein(cdrf1), replica . . .
12634542_f3_41812501782 1336111112−7< HIL>AspergillusContig5670GTC ORF with score 206 to:
fumigatus(ai:380588)
(or: Homo sapiens) (sr:homo sapiens
(tissue library: lambda-gem-11
(stratagene)) bloo)
< td/>(de:human mucin-2 gene, partial cds.)
34503292_f3_4251251 178222418805820−82CyanobacteriumS74707(cl:response regulator homology)
synechocystis(sr:pcc 6803,, pcc 6803)
< td/>(sr:pcc 6803,)
25803768_f3_42612521782355818596−3Contig548AG TC ORF with score 443 to:
pneumoniae(ai:7000843295)
(or: Enterobacter cloacae)
2946032_f3_4281 253178241674557370PseudomonasAF061 070(de:pseudomonas stutzeri orf117
stutzeri(orf117), orf86 (orf86) genes,
completecds; and ptxabcde operon,
partial sequence.)
< td/>(nt:putative atpase component of)
32526906_f3_429125417 82533010997−4 mice|C57BL/6xCBA/U95016(sr:house mouse) (de:mus musculus
CaJ hybridmyocyte nuclear factor-beta (mnf-b)
mrna, completecds.) (nt:hnf-3/forkhead,
aminoacids 279..389: winged helix)
29766658_f3_4301255178261257418133−5Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
16145283_f3_4321256513170
12969586_f 3_437125717828582193103−2MolluscumU60315(de:molluscum contagiosum virus
< td/>contagiosum virussubtype 1, complete genome.)
subtype 1(nt:contains large predicted
non-globular regions and)
20807131_f3_43812581 78291173390192−13 AchromobacterA61183
georgiopo litanum
32517881_f3_4411 259178302409802842MycobacteriumZ98 741(de:mycobacterium leprae cosmid
lepraeb22.) (nt:mlcb22.16c, possible
oxidoreductase, len: 596 aa;)
12989566_f3_44212601 7831852283111−3AF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-related
< td/>nuclear matrix protein (srm160)
mrna, complete cds.) (nt:160 kda)
35753537_f3_44512611 7832564187132−6P28968(sr:ab4p,chv-1) (de:glycoprotein x
type 1 EVH-1precursor)
31760458_f3_4 57126217833471156−9Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
29870917_f3_458126317 8341260419450−42Z82015(de:b. subtilis yuk(a,b,c,d,e,f),
subtilis/Bacil lusyuk(i,j,k,l,m) and ald genes.)
globigii
26738131_f3_461< /td>1264178358012661 81−12Burkholderia U41162(sr:burkholderia cepacia
cepaciastrain=1761 6) (de:burkholderia
cepacia d-serine deaminase (dsd)
< td/>gene, complete cds.) (nt:unidentified
orf)
30369705_f3_4641265178361314437866 −86SaccharopolysporaP33271(sr:, streptomyces erythracus)
erythraca(ec:1 .14.—.—)
(de:cytochrome p450 107b1,
(p450cviib1))
9895706_f3_46617837615204179−12CaenorhabditisU80846(sr:caenorhabditis elegans
elegansstrain=bris tol n2) (de:caenorhabditis
elegans cosmid k06a9.) (nt:partial
cds; coded for by c. elegans cdna
yk50c7.5)
31308506_f3_468< /td>1267178383061019 7−5Streptomyces(de:streptomyces roseofulvus
roseofulvusfre nolicin biosynthetic gene cluster,
complete sequence.) (nt:frnj;
second acp for putative starter unit)
14713918_f3_4691268 17839870289
6301028_f3_47 01269178401050349−4LycopersiconS14977(sr:, tomato)
esculentum
17046941_f3_47 41270178412280759−44Bacillus
subtilis/Bacillus
glob igii
34239518_f3_4761271 178421035344201 −12equine herpesvirusAF030027(fn:very large tegument protein)
type 4 EHV-4(de:equine herpesvirus 4 strain
ns80567, complete genome.)
(nt:counterpart of hsv-1 gene u136
and vzv gene 22)
6376592_f3_4781272178 43486161110−4 Bos primigeniusP02453(sr:, bovine) (de:collagen alpha 1(i)
tauruschain (fragments))
16532006_f3_4881273178441839612
24314 076_f3_49312741784585228 3220−18Vibrio U51896(sr:vibrio parahaemolyticus
parahacmolyticusstrain=bb22) (de:vibrio
parahaemolyticus lateral flagellar
lafx locus: lfgn gene, partial cds,
lfgm, lfga, lfgb, lfgc, and lfgd genes,
complete cds,and lfge gene, partial
cds.) (nt:potential lateral
flagellar p . . .
32230382_f3_49412751784 6417138
13097632_f3_495127617847486161
12133331_f3_49612771784848 9162107−3Homo sapiensAB002322(sr:homo sapiens male brain cdna to
mrna, clone_lib:pbluescriptii s)
(de:human mrna for
kiaa0324 gene, partial cds.)
14704791_f3_4971278 17849627208163−11 mice|C57BL/6xCBA/A28996(cl:proline-ric h protein) (sr:, house
Ca J hybridmouse)
16281636_f3_499< /td>1279178502571856 95−1DrosophilaQ24266(sr:, fruit fly) (de:transcription factor
< HIL>melanogasterbtd buttonhead))
525033_f3_500128017851573190106−3< /td>Orf virusB34768
34620716_f3_ 502128117852609202−14Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
4859705_f3_5041282178 53483161180−14Plasmid pAH4JC2320
10004030_c1_5061283178541023340164−9Canadian hardP10388(sr:, wheat) (de:glutenin, high
win ter wheatmolecular weight subunit dx5
precursor)
20839205_c1_50812 8417855672223234−20Enterobacter cloacaeCONTIG500GTC ORF with score 234 to:
(ai:7000760886)
(or:Pseudomonas aeruginosa)
4939067_c1_510178562538845199−12Micrococcus luteusJQ0405
6071058_c1_ 512128617857627208−17CyanobacteriumS75866(sr:pcc 6803,, pcc 6803)
synechocystis(sr:pcc 6803,)
36052081_c1_5151287178581092363358−33Helicobacter pyloriJHP17(de:regulatory functi:chemotaxis and
motil:chemotaxis protein)
(sr:strain j99)
31258540_c1_51612881 78591431476
22944215_c1_5 19128917860390129−5Cyanobacterium< /td>S75142(sr:pcc 6803,, pcc 6803)
synechocystis(sr:pcc 6803,)
11774040_c1_52412901786143214396−5Contig512AG TC ORF with score 96 to:
pneumoniae(ai:7000760902)
(or: Pseudomonas aeruginosa)
22735141_c1_528178621218405364−33Cyanobacterium(cl:response regulator homology)
synechocystis(sr:pcc 6803,, pcc 6803)
< td/>(sr:pcc 6803,)
25995826_c1_5291292178631125374398−37AcinetobacterCONTIG134 GTC ORF with score 398 to:
baumanniiC(ai:7000760907)
(or:Pseudomonas aeruginosa)
4892180_c1_53017864465154250−21SalmonellaP406 76(de:transcriptional regulator slya (salmolysin)
choleraesuis(cytolysin slya))
< HIL>serotype
< td/>typhimurium
26211016_c1_5311294178652112 703476−70Pseud omonasU93274(de:pseudomo nas aeruginosa yafe (yafe), lcub
aeruginosa(lcub), asd (asd), fimv (fimv), and hist (hist)
genes, complete cds; trpf (trpf) gene, partial
cds: and unknown gene.)
10047830_c1_53212951786640813594−4PQ0475(sr:, common tobacco)
4504158_c1_534129630099307−28KlebsiellaContig485AGTC ORF with score 307 to:
pneumoniae(ai:7000760912)
or:< HIL>Pseudomonas aeruginosa)
11771077_c1_53817868864287181−14KlebsiellaCon tig374AGTC ORF with score 217 to:
pneumoniae(ai:7000782681)
(or: Pseudomonas aeruginosa)
14978443_c1_548178691869622
16658465_c1_553129917870462153119−6Plasmodi umP08672(sr:berok,) (de:circumsporozoite
< td/>cynomolgiprotei n precursor (cs))
31886583_c1_5541300 17871513170112−4U92288(fn:helicase, helicase-primase
type 6 HHV-6complex) (de:human herpesvirus 6
serotype b putative major immediate-
< td/>earlygenes.) (nt:similar to
hhv6a u86, region ie-b)
12942826_c1_5631301 17872414137105−5P14328(s r:, slime mold) (de:spore coat
discoideumprotein sp96)
104667_c1_566130217 873753250475−45P54414(ec:3. 4.21.92)
denitrificans(de:(endopepti dase clp))
36580141_c1_5791303 1787421787253699−9999< /td>PseudomonasP15713(cc:3.1.4.3) (de:(phosphatidylcholine
aeruginosacholinephosphohydrolase
15636336_c1_5821304178751098365107−2CaenorhabditisZ68011(de:caenorhabditis elegans cosmid
eleganst21b6, complete sequence.)
< td/>(nt:similarity to xenopus f-
spondin precursor (pir acc.)
11849135_c1_5871305 178761032343105−2 PseudomonasZS4213(de :p. aeruginosa algy gene.)
< HIL>aeruginosa
16306892_c1_591 13061787717165711154−117SalmonellaP36555(de:hypothetical 61.6 kd protein in
choleraesuisbass/pmra-adiy intergenic region)
serotype
typhimurium
10036333_c1_598130717878900 299108−3Gallus gallusK02113(sr:chicken) (de:gallus gallus
domesticusv itellogenin gene coding for
phosvitin, exons 23 and 24.)
16145337_c1_60013081 7879375124
35785280_c1_60 7130917880462153
11069581_c1_6101310178811164387188−11Canadian hardS02262(cl:glutenin) (sr:, common wheat)
w inter wheat
1969681_c1_61713111 78821275424161−9P13826(sr:sal-i,) (de:circumsporozoite
protein (cs) (fragment))
32291703_c1_6201312178831215404
322835 18_c1_621131317884108936 2171−10CaenorhabditisAF067607(de:caenorhabditis elegans cosmid
elegansc18h7.) (nt:similar to cuticular
collagen; c18h7.3)
36038191 _c1_622131417885525174267−23KlebsiellaContig462AGTC ORF with score 267 to:
pneumoniae(ai:7000761000)
(or: Pseudomonas aeruginosa)
26041386_c1_6241788630841027465< /td>−41Escherichia coliAF044499(de:escheric hia coli strain ec50 rhse
accessory genetic element
vgreprotein, core protein, and
dsorf-e5 genes, complete cds.)
32710801_c1_6311316 178871431476189−12PlasmodiumP08675(sr :london,) (de:circumsporozoite
cynomolgiprotein precursor (cs))
20594505_c1_6381317 17888336111
24354503_c1_6 39131817889432143−12HaemophilusP44269(de:hypothetical protein hi1601
< HIL>influenzaeprecursor)
34650168_c1_640131917890 873290504−48Ha emophilusP44268(de:hypothetical protein hi1600)
influenzae
2473783_c1_648 13201789115125032296−238PseudomonasP05695(de:porin p precursor (outer
< HIL>aeruginosamembrane protein d1))
17047918_c1_64913211 7892468155
12589126_c1_65 0132217893564187
10728808_c1_652132317894771256
14960140_c1_65613 2417895354011791177 −119StreptomycesAL 031031(de:streptomyces coelicolor cosmid
coelicolor7c7.) (nt:sc7c7.16c, probable atp
dependent dna helicase,)
13130417_c1_658132517896549182131−6< /td>AlphaherpesvirusS04713(cl:herpesvirus immediate-early
< td/>pseudorabies virusprotein ie175)
P RV
35432208_c1_6681326178 97504167181−14Homo sapiensS53363(sr:, man) (mp: 11p15.5-11p15.5)
21580143_c1_67213 27178981719572143Canadian hardP10388(sr:, wheat) (de:glutenin, high
win ter wheatmolecular weight subunit
dx5 precursor)
4775_c1_674132817899972323474−45AeronionasU56832(fn :converts prolyl-imidic bonds from
hydrophilacis to trans) (cc:5.2.1.8)
(de:aeromonas hydrophila
< td/>fk506 binding protein (fkpa) gene,
< td/>completecds in 3.9 kb fragment.)
< td/>(nt:orf3; immunonphilin: peptidyl
prolyl isomerase:)
14949042_c2_67713291790090630197−2< /td>PseudomonasS29309
aerugin osa
14878563_c2_6811330< /td>1790113504491112 −113Pseudomonas putidaM57613(sr:pseudomo nas putida (strain ppg2)
< td/>dna) (de:pseudomonas putida
branched-chain keto acid
dehydrogenase operon (bkda1, bkda1
< td/>and bkda2), transacylase e2 (bkdb),
bkdr andlipoamide dehydrogenase
(lpdv) genes,
complete cds.) (nt:orf)
14111281_c2_68213311467488429−40< /td>Escherichia coliP21503(de:hypothetical 41.4 kd protein in
dmsc-pfla intergenic region (orf y))
7120942_c2_6871332179 039963311395−143AB012768(f n:methyltransferase)
< td/>aeruginosa(sr:< HIL>pseudomonas aeruginosa
< td/>(str:pao1) dna) (de:pseudomonas
aeruginosa gene for cher,
< td/>complete cds.) (nt:cher is involved
in methylation of transducer)
10816306_c2_6891333179041980659236− 18RickettsiaAJ235269Rickettsia prowazekii strain
prowazekiiMadrid E, complete genome.
31379165_c2_6911334471156177−14KlebsiellaContig534A
pneumoniae(ai:7000761069)
(or: Pseudomonas aeruginosa)
1270318_c2_693179061059352157−8Caenorhabditis AF067607(de:caenorhabditis elegans cosmid
elegansc18h7.) (nt:similar to cuticular
collagen; c18h7.3)
2938330_c2_70613362616871123−4Chlamydomonas550754
eugamet os
22135141_c2_708133717908666221102− 2cabbage looperAF000605(sr:cabbage looper) (de:trichoplusia ni
insect intestinal mucin iiml4 mrna,
< td/>complete cds.)
11194792_c2_7151338 179091482493
9875828_c2_7 171339179101230409−4Homo sapiensX63071(sr:human) (de:h. sapiens mrna for
novel dna binding protein.)
36016316_c2_7201340 179111209402
12007052 _c2_7241341179122046796−5Enterobacter cloacaeCONTIG419GTC ORF with score 105 to:
(ai:7000763709)
(or:Pseudomonas aeruginosa)
14961393_c2_72517913543180103−4Persian tobaccoJQ1686(sr:, persian tobacco)
16541392_c2_7261343402133101−5longfin squidS56117(sr:, longfin squid)
36579131_c2_728134417915717238128−5 AlphaherpesvirusS04713
< td/>pseudorabies virusprotein ie175)
P RV
35755125_c2_7341345179 161491496
4401915_c2_736< /td>1346179171167388 150−7Arancus diadematusU47855(de:aran cus diadematus fibroin-3
(adf-3) mrna, partial cds.)
29710936_c2_7391347 17918501166113−7P76111(de:hypothetical 15.9 kd protein in
tchb-ansp intergenic region)
14730290_c2_7401348432143154−10Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
14556881_c2_742134917 92028293106−5 Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
16142208_c2_745135017 921396131108−6Orf virusD34768
17004708_c2_ 746135117922417138−6Rhesus Epstein BarrU93909(sr:rhesus epstein barr virus)
virus(de:cercopithecine herpesvirus 15
nuclear antigen ebna-1 gene,
< td/>completecds.)
16926965_c2_75217923993330469−44Escherichia coliP23523(de:hypothetical 31.0 kd protein in
rnpb-soha intergenic region (orf 2))
2631907_c2_7531353179 242262753110−2Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
31742765_c2_77013541098365150−7Nephila clavipesA36068
12507328_ c2_7721355179261311436156−8Nephila clavipesA44112
14708457_ c2_776135617927711237131−6Rhesus Epstein BarrU93909(sr:rhesus epstein barr virus)
virus(de:cercopithecine herpesvirus 15
nuclear antigen ebna-1 gene,
< td/>completecds.)
32243877_c2_77717928849282107−2mice|C57BL/6xCBA/U32107 (sr:house mouse) (de:mus musculus
CaJ hybridtype vii collagen (col7a1) mrna,
< td/>complete cds.)
21688516_c2_7781358 1792979526492−4Contig452AGT C ORF with score 223 to:
pneumoniae(ai:7000824048)
(or: Enterobacter cloacae)
5275458_c2_7791 359179301599532194MicrobacteriumX7 9027(de:m. ammoniaphilum genes mamir
ammoniaphilumand mamim.)
31844561_c2_7801360459152
22082202_c2 _781136117932398113262580−268HaemophilusP45018(de:atp-dependent helicase hrpa
influenzaehomolog)
16277187_c2_7821362179331551 516112−2Mycoba cteriumAL022022(de:mycob acterium tuberculosis
tuberculosish 37rv complete genome; segment
148/162.) (nt:rv3508, (mtv023.15),
len: 1901. member of)
9781311_c2_7831363179 34546181255−22BacillusO07513(de:hit protein)
subtilis/Bacillus
globi gii
31677141_c2_7841364< /td>179351239412248⠈’18Escherichia coliAF044503(de:escheric hia coli strain ec11
unknown (498), hcp gene, complete
cds; and rhsg accessory genetic
element vgrg protein, core
component anddsorf-g1 genes,
complete cds.)
35213556_c2_7851365 17936408135127−7U54556(de :litomosoides sigmodontis
sigmodontismic rofilarial sheath proteins shp3a
< td/>(shp3a) and shp3 (shp3) genes,
complete cds.) (nt:structural protein;
similar to shp3 genes from)
35363176_c2_7911366 179372712903
16017781_c2_ 793136717938549182
26455383_c2_796136817939263103−5 KlebsiellaContig545AGTC ORF with score 138 to:
pneumoniae(ai:7000757952)
(or: Pseudomonas aeruginosa)
6290807_c2_79817940621206225−19Escherichia coliP34086(de:rna polymerase sigma-e factor
(sigma-24))
4145958_c2_8021370179411413470120−4Epstein-Barr virusP03211(sr:b95-8, human herpesvirus 4)
(de:ebna-1 nuclear protein)
12268763_c2_803137115635202303−23 9PseudomonasP32977
< td>aeruginosa
16150642_c2 _807137217943807268 233−18equine herpesvirusD88733(sr:equ ine herpesvirus 1 (strain:hhl)
type 1 EVH-1dna) (de:equine herpesvirus 1 dna
for membrane glcoprotein
complete cds.
22783541_c2_81313731 7944633210101−3P25315(de: nify protein)
brasilense
14113530_c2_8 2213741794534021133 115−3KlebsiellaContig560AGTC ORF with score 189 to:
pneumoniae(ai:7000777501)
(or: Pseudomonas aeruginosa)
35572677_c2_826179461827608
33786581_c2_833137617947891296128−5equine herpesvirusD88733(sr:equ ine herpesvirus 1 (strain:hhl)
type 1 EVH-1dna) (de:equine herpesvirus 1 dna
for membrane glycoprotein
complete cds.)
1424025_c2_83513771 79482468194−5 Enterobacter cloacaeCONTIG305GTC ORF with score 101 to:
(ai:91365) (or:Porphyromonas
< td/>gingivalis ) (sr:, bacteroides
gingivalis ) (ec:3.4.22.—)
< td/>(de:protease prth,)
4861027_c3_8381378 1794920166134−8CONTIG412GTC ORF with score 1554 to:
(ai:7501795377)
(or:Klebsiella pneumoniae)
32672967_c3_840179501158385279−25Enterobacter cloacaeCONTIG500GTC ORF with score 279 to:
(ai:7000761218)
(or:Pseudomonas aeruginosa)
32692207_c3_84717951831276165−9Alphaherpesvirus
pseudorabies virus
< td/>PRV
34474081_c3_854138117952909302130−6 mice|C57BL/6xCBA/S19560(cl:prolin e-rich protein) (sr:, house
Ca J hybridmouse)
35236330_c3_859< /td>1382179532550849 1412−144Escherichia coliM30198(sr:escherichi a coli (strain k-12) dna)
(de:e. coli recq gene complete cds,
and plda gene, 3′ end.)
12992632_c3_8651383 17954468155130−7S33121(cl:homeotic protein cdp:cut repeat
homology:homeobox homology)
(sr:, dog)
31927040_c3_86713841 7955555184147−9P03181(sr:b95-8, human herpesvirus 4)
(de:hypothetical bhlf1 protein)
22355131_c3_87413851119372954−96< /td>NeisseriaL07845( fn:d to 1 conversion of the glycero
gonorrhoeaemoiety of the) (sr:neisseria
gonorrhoeae (strain ms11b2) dna)
(de:neisseria gonorrhoeae ribokinase
< td/>(rbk) gene, 3′ end; adp-1-glycero-d-
mannoheptose epimerase (gme) gene,
< td/>complete cds.)
35676591_c3_8801386 179571917638138−5 Nephila clavipesA36068
33838542_ c3_8831387179581083360158−10KlebsiellaContig487AGTC ORF with score 470 to:
pneumoniae(ai:7000825199)
(or: Enterobacter cloacae)
35806955_c3_887 1388179592484827258 −19KlebsiellaConti g554AGTC ORF with score 483 to:
pneumoniae(ai:7000831452)
(or: Enterobacter cloacae)
1199156_c3_8881 389179601110369237KlebsiellaContig 554AGTC ORF with score 483 to:
pneumoniae(ai:7000831452)
(or: Enterobacter cloacae)
35754151_c3_889 139017961870289171KlebsiellaContig 554AGTC ORF with score 483 to:
pneumoniae(ai:7000831452)
(or: Enterobacter cloacae)
4379155_c3_9001 3911796282227399−5ArchaeoglobusA69334 (cl:transcriptional repressor glnr)
fulgidus
36072892_c3_902139217963573190140 −8Homo sapiensO00268(sr:, human) (de:(tafii135) (tafii-130)
11142881_c3_904139317964783260105−3 Homo sapiensQ15427(sr:, human) (de:spliceosome
< td/>associated protein 49 (sap 49)
(s(3b53))
25519776_c3_905139 4179651401466
12 368955_c3_9241395179661788
34257887_c3_9261396 1796726678881056−107Enterobacter cloacaeCONTIG499GTC ORF with score 1793 to:
(ai:7501787714)
(or:Klebsiella pneumoniae)
36614816_c3_92917968726241359−33Acinetobacter CONTIG211GTC ORF with score 359 to:
baumanniiC(ai:7000761307)
(or:Pseudomonas aeruginosa)
22683385_c3_931179692067688427−38Escherichia coliP46481(de:hypothetical 73.6 kd protein in
argr-cafa intergenic region (f655))
36349152_c3_93213991074357593−58Escherichia coliP46482(de:hypothetical 34.8 kd protein in
argr-cafa intergenic region)
25492077_c3_938140016895621569−161 Escherichia coliP29212(ec:6.2.1.3) (de:synthetase))
9901963_c3_939140 117972170I5661625Escherichia coliP29212(ec:6.2.1.3) (de:synthetase))
29791656_c3_95514 0217973696231124−5Arabidopsis thalianaAC000098(sr:thale cress) (de:arabidopsis
thaliana chromosome 1 yac yup8h12
complete sequence.) (nt:est
gblatts1136 comes from this
gene.)
34650655_c3_9561403< /td>1797426788106− 5LitomosoidesU54556 (de:litomosoides sigmodontis
sigmodontismic rofilarial sheath proteins shp3a
< td/>(shp3a) and shp3 (shp3) genes,
complete cds.) (nt:structural protein;
similar to shp3 genes from)
9895930_c3_95714041 797593030993−1Aeromonas sp.I39540(ec:3.2.1.14)
16 541008_c3_959140517976846281127−5Caenorhabditis U41557(sr:caenorhabditis elegans
elegansstrain=bris tol n2) (de:caenorhabditis
elegans cosmid c50f7.)
(nt:histidine-rich)
21967_c3_961< /td>140617977579192
29807330_c3_9701407179784 2314098−3equin e herpesvirusD88685(sr:equ ine herpesvirus 1 (strain:hhl)
type 1 EVH-1dna) (de:equine herpesvirus 1 dna
for tegument protein, partial cds.)
< td/>(nt:kpn i subfragment of orf24)
35679831_c3_9721408179791476491280−24HaemophilusP43711( ec:2.3.1.41) (de:ketoacyl-acp
influenzaesynthase iii) (kas iii))
15741667_c3_9801409 17980765254
16307956_c3_9 90141017981879292−32Mycobacterium< /td>AL022004(de:mycobacterium tuberculosis
tuberculosish 37rv complete genome; segment
40/162.) (nt:rv0851c, (mtv043.44c),
len: 275. unknown)
2204527_c3_9911411738245206−17BacillusO31553(de: hypothetical 11.9 kd protein in
subtilis/Bacillusacor-glva intergenic region)
globigii
35801028_c3_994< /td>1412179831782593 598−57StreptomycesAL031031(de:streptomyces coelicolor cosmid
coelicolor7c7.) (nt:sc7c7.16c, probable atp
dependent dna helicase,)
22469793_c3_9981413179841662553305−2 4Nephila clavipesAF027735(de:neph ila clavipes minor ampullate
silk protein misp1 mrna, partialcds.)
34636090_c3_10071414< /td>17985609202289∠’25Myxococcus xanthusU81516(de:myxococ cus xanthus abc
transporter homolog gene, partial
cds and rna polymerase
< td/>sigma-54 factor (rpon) gene,
< td/>complete cds.) (nt:orf: 3′ of rpon)
31652042_c3_1008141517986921306479−45StreptomycesAL031031(de:streptomyces coelicolor cosmid
coelicolor7c7.) (nt;sc7c7.17, possible
transcriptional regulatory)
14089028_c3_100914161798722627531264⠈’129Escherichia coliP13036(de:iron (iii) dicitrate transport
protein feca precursor)
469841_c3_10101417 179881890629860−87 RickettsiaAJ235269
prowazekiiMadrid E, complete genome.
10442816_c3_101214181212403141−9Enterobacter cloacaeCONTIG480GTC ORF with score 141 to:
(ai:7000761390)
(or:Pseudomonas aeruginosa)
3260043_f1_5 1419179901056351143 −6Herpes simplex virusAF015297(de:human herpesvirus 6 (strain
(type 6/strainuganda-1102) ie2hom mrna,
Ug anda-1102)complete cds.) (nt:similar to the
immediate-early 2 protein of human)
31492917_f1_191420 179911572523
9791566_f1 _22142117992426141−6Candida albicansCONTIG525GTC ORF with score 106 to:
3(ai:7000761414)
(or:Pseudomonas aeruginosa)
10800066_f1_32179931266421130−6KlebsiellaCont ig275AGTC ORF with score 199 to:
pneumoniae(ai:7000843618)
(or: Enterobacter cloacae)
6308563_f1_4614 231799457319091 −4KlebsiellaContig536A GTC ORF with score 91 to:
pneumoniae(ai:7000761438)
(or: Pseudomonas aeruginosa)
22520300_f1_601799582827599 −5KlebsiellaContig 421AGTC ORF with score 177 to:
pneumoniae(ai:7000821620)
(or: Enterobacter cloacae)
20206455_f1_671 425179961356451163no gb taxonomyU93872(sr:kaposi's sarcoma-associated
matchherpesvirus- human herpesvirus 8)
(de:kaposi's sarcoma-associated
herpesvirus glycoprotein m, dna
replication protein, glycoprotein, dna
replication protein, fliceinhibitory
< td/>protein and y-cyclin genes . . .
26307665_f1_72142617997 510169175−13< HIL>KlebsiellaContig551AGTC ORF with score 217 to:
pneumoniae(ai:7000761465)
(or: Pseudomonas aeruginosa)
413966_f1_73 142717998720239217KlebsiellaContig 551AGTC ORF with score 217 to:
pneumoniae(ai:7000761465)
(or: Pseudomonas aeruginosa)
7160200_f1_751428179991260419
1429180001896 631
16300711_f1_8414301800138431280120−4Sus scrofa domesticaS55316(sr:, domestic pig)
10167086_f1_85143118 002612203100−2DictyosteliumAB009080(s r:dictyostelium discoideum (str:ax2)
discoideumdna) (de:dictyostelium discoideum
< td/>gene for trfa, complete cds.)
22757216_f1_8814321 80031398465
2135841_f1_90 1433180041230409
23837791_f1_93143418005 1437478
34570187_f1_94143 518006831276204 −17KlebsiellaContig550 AGTC ORF with score 714 to:
pneumoniae(ai:7000838996)
(or: Enterobacter cloacae)
12141657_f1_951 43618007990329198KlebsiellaContig5 34AGTC ORF with score 278 to:
pneumoniae(ai:7000828862)
(or: Enterobacter cloacae)
2383561_f1_9614 37180081302433145silkwormS42886(cl:unassigne d collagens)
< td/>(sr:, silkworm)
30332817_f1_1021438 180091770589512−49 KlebsiellaContig218AGTC ORF with score 664 to:
pneumoniae(ai:7000815432)
(or: Enterobacter cloacae)
15832250_f1_103 143918010978325350Escherichia coliF65039
472965_f1_104 1440180111305434840−84Escherichia coliD90888(sr:escherichi a coli (strain:k12)
dna, clone_lib:kohara lambda
minise) (de:e. coli genomic dna,
kohara clone #438(58.9—59.3 min.).)
(nt:similar to (swissprot accession
number p37908))
31925681_f1_1061441636211418−39Escherichia coliP37643(de:region (o440))
31673916_f1_11014421104367543−53KlebsiellaContig460A
pneumoniae(ai:7000761502)
(or: Pseudomonas aeruginosa)
12995755_f1_11118014666221159−11KlebsiellaCon tig560AGTC ORF with score 565 to:
pneumoniae(ai:7000846961)
(or: Enterobacter cloacae)
16507340_f1_113 144418015798265372Arabidopsis thalianaU90439(sr:thale cress) (de:arabidopsis
thaliana chromosome ii bac t06d20
genomic sequence, complete
sequence.) (nt:unknown protein)
21517891_f1_11614451092363102−2Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
16925380_f1_121144618 017405134111−7Enterobacter cloacaeCONTIG424GTC ORF with score 211 to:
(ai:7501777927)
(or:Klebsiella pneumoniae)
17066580_f1_12218018702233138−7Boreogadus saidaU43200(de:boreogadu s saida antifreeze
< td/>glycopeptide afgp polyprotein
precursorgene, complete cds.)
< td/>(nt:cleavage of polyprotein at
conserved spacers r or)
12628805_f1_123144818 019720239432−40P45597(sr:, pvcampestris)
campestris(ec:2.7.3.9: 2.7.1.69)
(de:(eiii-fru)))
10726686_f1_12 41449180202016671−129KlebsiellaP45604(ec:2.7.1.69) (de:enzyme ii, abc
pneumoniaecomponent), (eii-nag)
30120958_f1_1291450 18021582193377−35< /td>KlebsiellaContig350A GTC ORF with score 377 to:
pneumoniae(ai:7000761521)
(or: Pseudomonas aeruginosa)
3322582_f1_1301802241713892 −2Candida albicansP40954(sr:, yeast) (ec:3.2.1.14)
(de:chitinase 3 precursor,)
26019416_f1_131145218023432143317−2 9KlebsiellaContig387AGTC ORF with score 317 to:
pneumoniae(ai:7000761523)
(or: Pseudomonas aeruginosa)
32525308_f1_134180242088695801−80HaemophilusP 44587(de:hypothetical protein hi0232)
influenzae
32689762_f1_13 5145418025402133136−8herpes simplex virusZ86099(fn:immediate early protein;
type 2 HSV-2transciptional) (de:herpes simplex
virus type 2 (strain hg52),
complete genome.)
21960887_f1_13614551617538421−40< /td>KlebsiellaContig397A GTC ORF with score 421 to:
pneumoniae(ai:7000761528)
(or: Pseudomonas aeruginosa)
12628930_f1_139180271494497
29791077_f1_141145718028486161257−22Enteroc occusCONTIG484GTC ORF with score 257 to:
faeciumC(ai:7000761533)
(o r:Pseudomonas aeruginosa)
14978958_f1_14418029447148114−6Homo sapiensA37232(sr:, man)
525042_f1_1451459180 30465154228−19PyrococcusAP000001(sr:< HIL>pyrococcus horikoshii (str:ot3)
horikoshiidna) (de:pyrococcus horikoshii ot3
genomic dna, 1-287000 nt. position
(1/7).) (nt:motif=prokaryotic
membrane lipoprotein lipid)
32438508_f1_1481460180311632543
16536693_f1 _149146118032387128 269−23KlebsiellaContig519AGTC ORF with score 171 to:
pneumoniae(de:pseudomona s fluorescens
hypothetical metabolite transport
protein, positive transcriptional
< td/>regulator (phnr),
phosphonoacetatehydrolase (phna),
2-phosphonopropionate transporter
(phnb), putative
putrescine/spermidine . . .
2910465_f1_150146218033 1893630544−52 ArchaeoglobusO30107(de:h ypothetical protein af0130)
fulgidus
32213566_f1_153< /td>1463180346212061 86−14AchromobacterA61183
georgiopolitanum
1173 8191_f1_1551464180352352 783367−30Strongylocentr otusS23809(cl:collagen alpha 2 (i) chain:fibrillar
< td/>purpuratuscollagen carboxyl-terminal
homology) (sr:, purple urchin)
469841_c3_10101417179881890629860−87Rickettsia prowazekiiAJ235269Ricket tsia prowazekii strain
Madrid E, complete genome.
10442816_c3_101214181212403141−9Enterobacter cloacaeCONTIG480GTC ORF with score 141 to:
(ai:7000761390) (or:
Pseudomonas aeruginosa)
3260043_f1_5 1419179901056351143 −6Herpes simplex virus (typeAF015297(de:human herpesvirus 6
6/stra in Uganda-1102)(strain uganda-1102) ie2hom
mrna, complete cds.) (nt:
similar to the immediate-early
< td/>2 protein of human)
31492917_f1_191420 179911572523
9791566_f1_2 2142117992426141106−6Candida albicansCONTIG525GTC ORF with score 106 to:
3(ai:7000761414) (or:
Pseudomonas aeruginosa)
10800066_f1_32179931266421130−6Klebsiella pneumoniaeContig275AGTC ORF with score 199 to:
(ai:7000843618) (or:
Enterobacter cloacae)
6308563_f1_4614 231799457319091 −4Klebsiella pneumoniaeContig536AGTC ORF with score 91 to:
(ai:7000761438) (or:
Pseudomonas aeruginosa)
22520300_f1_601799582827599 −5Klebsiella pneumoniaeContig421AGTC ORF with score 177 to:
(ai:7000821620) (or:
Enterobacter cloacae)
20206455_f1_671 425179961356451163no gb taxonomy matchU93872(sr:kaposi's sarcoma-
associated herpesvirus -
human herpesvirus 8) (de:
kaposi's sarcoma-associated
herpesvirus glycoprotein m,
dnareplication protein, glyco-
protein, dna replication
protein, fliceinhibitory
< td/>protein and v-cyclin
genes. . . .
26307665_f1_72142617997 510169175−13< HIL>Klebsiella pneumoniaeContig551AGTC ORF with score 217 to:
(ai:7000761465) (or:
Pseudomonas aeruginosa)
413966_f1_73 142717998720239217Klebsiella pneumoniaeContig551AGTC ORF with score 217 to:
(ai:7000761465) (or:
Pseudomonas aeruginosa)
7160200_f1_751428179991260419
1429180001896 631
16300711_f1_8414301800138431280120−4Sus scrofa domesticaS55316(sr:, domestic pig)
10167086_f1_85143118 002612203100−2Dictyostelium discoideumAB009080(sr:di ctyostelium discoideum
< td/>(str:ax2) dna) (de:
dictyostelium discoideum gene
for trfa, complete cds.)
22757216_f1_8814321 80031398465
2135841_f1_90 1433180041230409
23837791_f1_93143418005 1437478
34570187_f1_94143 518006831276204 −17Klebsiella pneumoniaeContig550AGTC ORF with score 714 to:
(ai:7000838996) (or:
Enterobacter cloacae)
12141657_f1_951 43618007990329198Klebsiella pneumoniaeContig534AGTC ORF with score 278 to:
(ai:7000828862) (or:
Enterobacter cloacae)
2383561_f1_9614 37180081302433145silkwormS42886(cl:unassigne d collagens) (sr:,
< td/>silkworm)
30332817_f1_1021 438180091770589512Klebsiella pneumoniaeContig218AGTC ORF with score 664 to:
(ai:7000815432) (or:
Enterobacter cloacae)
15832250_f1_103 143918010978325350Escherichia coliF65039
472965_f1_104 1440180111305434840−84Escherichia coliD90888(sr:escherichi a coli (strain:
k12) dna, clone_lib:kohara
lambda minise) (de:e. coli
genomic dna, kohara clone
< td/>#438(58.9-59.3 min.).) (nt:
similar to (swissprot accession
number p37908))
31925681_f1_1061441636211418−39Escherichia coliP37643(de:region (o440))
31673916_f1_11014421104367543−53Klebsiella pneumoniaeContig460AGTC ORF with score 543 to:
(ai:7000761502) (or:
Pseudomonas aeruginosa)
12995755_f1_11118014666221159−11Klebsiella pneumoniaeContig560AGTC ORF with score 565 to:
(ai:7000846961) (or:
Enterobacter cloacae)
16507340_f1_113 144418015798265372Arabidopsis thalianaU90439(sr:thale cress) (de:
arabidopsis thaliana
chromosome ii bac t06d20
genomic sequence, complete
sequence.) (nt:unknown
protein)
21517891_f1_116 1445180161092363102< /td>−2Boreogadus saidaU43200(de;boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage
of polyprotein at conserved
spacers r or)
16925380_f1_121144618 017405134111−7Enterobacter cloacaeCONTIG424GTC ORF with score 211 to:
(ai:7501777927) (or:
Klebsiella pneumoniae)
17066580_f1_12218018702233138−7Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage
of polyprotein at conserved
spacers r or)
12628805_f1_123144818 019720239432−40P45597(sr:,pvcampestri s) (ec:
2.7.3.9:2.7.1.69) (de:(eiii-
< td/>fru)))
10726686_f1_12414491802020166711268−129Klebsiella pneumoniaeP45604(ec:2.7.1.69) (de:enzyme ii,
abc component), (eii-nag))
30120958_f1_129145018021582193377−35 Klebsiella pneumoniaeContig350AGTC ORF with score 377 to:
(ai:7000761521) (or:
Pseudomonas aeruginosa)
3322582_f1_1301802241713892 −2Candida albicansP40954(sr:,yeast) (ec:3.2.1.14) (de:
chitinase 3 precursor,)
26019416_f1_131145218023432143317−2 9Klebsiella pneumoniaeContig387AGTC ORF with score 317 to:
(ai:7000761523) (or:
Pseudomonas aeruginosa)
32525308_f1_134180242088695801−80Haemophilus influenzaeP44587(de:hypothetical protein
hi0232)
32689762_f1_1351 45418025402133136herpes simplex virus type 2Z86099(fn:immediate early protein;
HSV-2transcriptional) (de:herpes
< td/>simplex virus type 2 (strain
hg52), complete genome.)
21960887_f1_13614551617538421−40< /td>Klebsiella pneumoniaeContig397AGTC ORF with score 421 to:
(ai:7000761528) (or:
Pseudomonas aeruginosa)
12628930_f1_139180271494497
29791077_f1_141145718028486161257−22Enteroc occus faeciumCONTIG484GTC ORF with score 257 to:
C(ai:7000761533) (or:
Pseudomonas aeruginosa)
14978958_f1_14418029447148114−6Homo sapiensA37232(sr:, man)
525042_f1_1451459180 30465154228−19Pyrococcus horikoshiiAP000001(sr:py rococcus horikoshii (str:
< td/>ot3) dna) (de:pyrococcus
horikoshii ot3 genomic dna,
1-287000 nt. position (1/7).)
(nt:motif=prokaryotic
membrane lipoprotein lipid)
32438508_f1_1481460180311632543
16536693_f1 _149146118032387128 269−23Klebsiella pneumoniaeContig519AGTC ORF with score 171 to:
(de:pseudomonas fluorescens
hypothetical metabolite
< td/>transport protein, positive
transcriptional regulator
(phnr), phosphonoacetate-
hydrolase (phna), 2-phos-
phonopropionate transporter
(phnb), putative putrescine/
spermidine . . .
2910465_f1_150146218033 1893630544−52 Archaeoglobus fulgidusO30107(de:hypothetical protein
af0130)
32213566_f1_1531 46318034621206186AchromobacterA611 83
georgiopolitanum
11738191_f1_1 551464180352352783−30Strongylocentrotus purpuratusS23809(cl:collagen alpha 2(i) chain:
fibrillar collagen carboxyl-
terminal homology) (sr:,
< td/>purple urchin)
15113317_f1_15814651917638707−70Haemophilus influenzaeP44993(de:hypothetical protein
hi1029)
2213876_f1_15914 66180372346781199Bacillus subtilis/BacillusG69776 (cl:hypothetical protein yddq)
globigii
16197532_f1_16114671803853117697< /td>−5Aspergillus fumigatusContig9629GTC ORF with score 116 to:
(ai:7000774232) (or:
Pseudomonas aeruginosa)
32676343_f2_16518039426141137−9Streptomyces fradiaeP20186(de:hypothetical 35.5 kd
protein in transposon tn4556)
14144131_f2_1681469549182
12636305_f2 _171147018041483160 94−2mice|C57BL/6xCBA/CaJS04 336(cl:unassigned ribonucleo-
hybridprotein repeat-containing
proteins:ribonucleoprotein
repeat homology) (sr:, house
< td/>mouse)
21664131_f2_1721471 18042834277133⠈’6Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-
related nuclear matrix protein
(srm160) mrna, complete
cds.) (nt:160 kda)
30364216_f2_17314721 8043483160108−4Z81503(de:caenorh abditis elegans
cosmid f14f7, complete
sequence.) (nt:predicted using
< td/>genefinder; similar to
collagen;)
16289767_f2_174147 318044471156127 −8Enterobacter cloacaeCONTIG495GTC ORF with score 167 to:
(ai:7501764598) (or:
Klebsiella pneumoniae)
24882087_f2_17518045444147
147518046801266290−25Pseudomo nas fluorescensL21198(fn:major outer membrane
protein) (sr:pseudomonas
fluorescens dna) (de:
pseudomonas fluorescens
(pgsb 7941) protein f (oprf)
gene, partialcds.) (nt:
putative)
7206258_f2_177147 618047771256196 −16Klebsiella pneumoniaeContig289AGTC ORF with score 271 to:
(ai:7000813428) (or:
Enterobacter cloacae)
35276030_f2_187 1477180482106999−5Enterobacter cloacaeCONTIG493GTC ORF with score 99 to:
(ai:7000761579) (or:
Pseudomonas aeruginosa)
21884577_f2_191180491152383116−5Enterobacter cloacaeCONTIG479GTC ORF with score 196 to:
(ai:7000797019) (or:
Pseudomonas aeruginosa)
26619580_f2_20218050831276100−2Mycobacterium tuberculosisAL021841(de: mycobacterium
tuberculosis h37rv complete
genome; segment 143/162.)
(nt:rv3345c, (mtv004.01c-
mtv016.45c), member of the
m.)
1963506_f2_2141480708235114−5Klebsiella pneumoniaeContig361AGTC ORF with score 105 to:
(ai:69657) (or:Human herpes-
virus 4) (cl:epstein-barr
virus nuclear antigen)
22133441_f2_21914811512503
25792902_ f2_220148218053657218389−36Erwinia amylovoraY09848(fn:activator of exopoly-
saccharide synthesis) (de:
e.amylovora resb gene.)
13151058_f2_221148318054924307
6370468_f2_2 221484180551632543
10441030_f2_223148518056138288−25Neisseria meningitidisP25138(ec:5.2.1.8) (de:(ec 5.2.1.8)
(rotamase))
13087953_f2_226 14861805775925299−2Escherichia coliD90774(sr:escherichi a coli (strain:
k12) dna, clone_lib:kohara
lambda minise) (de:e.coli
genomic dna, kohara clone
< td/>#263(30.5-30.9 min.).)
(nt:orf_id:o263#22;
similar to (swissprot
< td/>accession)
6519702_f2_228 148718058765254137−8Mycobacterium tuberculosisZ92539(de:my cobacterium
tuberculosis h37rv complete
genome; segment 47/162.) (nt:
rv1019, (mtcy10g2.30c), len:
197. probable)
16151956_f2_2311488 18059510169205−17< /td>Aspergillus fumigatusv3x12050.xGTC ORF with score 205 to:
(ai:7000761623) (or:
Pseudomonas aeruginosa)
22146030_f2_235180601386461261−22Enterobacter cloacaeCONTIG493GTC ORF with score 261 to:
(ai:7000761627) (or:
Pseudomonas aeruginosa)
32672915_f2_236180612347791 −5Enterobacter cloacaeCONTIG502GTC ORF with score 141 to:
(ai:1500696760) (or:Homo
sapiens) (sr:homo sapiens
cdna to mrna) (de:homo
sapiens mrna for n-wasp,
complete cds.)
16924166_f2_2431491 18062906301152−7AF015297(de:human herpesvirus 6
6/stra in Uganda-1102)(strain uganda-1102) ie2hom
mrna, complete cds.) (nt:
similar to the immediate-early
< td/>2 protein of human)
36032718_f2_2501492180631740579111−4Klebsiella pneumoniaeContig540AGTC ORF with score 131 to:
(ai:7000804742) (or:
Pseudomonas aeruginosa)
14322133_f2_256180641233410533−51Sphingomonas AF079317(de:sphingomonas
aromatic ivoransaromaticivorans plasmid
pn11, complete plasmid-
sequence.) (nt:putative inner
< td/>membrane protein similar to
b.)
11192930_f2_26614941956651786−78< /td>Cyanobacterium synechocystisS75742(sr:pcc 6803, , pcc 6803)
< td/>(sr:pcc 6803, )
14552041_f2_26714951806 6708235117−5< HIL>Homo sapiensE25372(cl:proline-rich protein) (sr:,
< td/>man) (mp:12p13.2-12p13.2)
31695430_f2_27018067873290220−18Enterobacter cloacaeCONTIG254GTC ORF with score 220 to:
(ai:7000761662) (or:
Pseudomonas aeruginosa)
36412568_f2_271180681089362124−7Pyrococcus horikoshiiAP000006(sr:py rococcus horikoshii (str:
< td/>ot3) dna, (cl:pyrococcus
horikoshi) (de:pyrococcus
horikoshii ot3 genomic dna,
1166001-1485000 nt. position
(6/7).)
26697582_f2_272 149818069459152246Enterococcus faeciumCONTIG143GTC ORF with score 399 to:
C(ai:7000726277) (or:
Streptococcus pneumoniae)
10244830_f2_27818070990329775−77Escherichia coliP37643(de:region (o440))
12635431_f2_2811500567188337−31Klebsiella pneumoniaeContig460AGTC ORF with score 337 to:
(ai:7000761673) (or:
Pseudomonas aeruginosa)
7119031_f2_28418072996331170−10Klebsiella pneumoniaeContig452AGTC ORF with score 473 to:
(ai:7000824041) (or:
Enterobacter cloacae)
22831875_f2_287 150218073795264334Bacillus subtilis/BocillusD70044
globigii
34632206_f2_2881503180742127708355−29Thermus thermophilus/Q56213(sr:,subspthe rmophilus) (cc:
T.aquaticus/T.flavus2.6.1.16) (de:amido-
< td/>transferase) (glucosamine-6-
< td/>phosphate synthase))
22005208_f2_289150418075936311110−3< /td>Klebsiella pneumoniaeP33906(de:atp-dependen t rna helicase
dead)
10682091_f2_29015 05180761038345156Nephila clavipesA44112
22301393_ f2_31015061807744414791−3Enterobacter cloacaeCONTIG337GTC ORF with score 395 to:
(ai:7501739904) (or:
Klebsiella pneumoniae)
9902325_f2_31818078576191114−4Homo sapiensAB002322(sr:homo sapiens male brain
< td/>cdna to mrna, clone_lib:
< td/>pbluescriptii s) (de:human
mrna for kiaa0324 gene,
< td/>partial cds.)
30210400_f2_3211508 180791251416116−3 Klebsiella pneumoniaeU10553(fn:trans-acting regulatory
< td/>protein of aco operon) (de:
klebsiella pneumoniae cg43
aco operon regulatory protein
acok(acok) gene, complete
cds.) (nt:contains a nucleotide
< td/>binding domain and)
12629031_f2_32215091 8080429142148−9U80846(sr:caenorh abditis elegans
strain=bristol n2) (de:
caenorhabditis elegans
cosmid k06a9.) (nt:partial cds;
coded for by c. elegans cdna
yk50c7.5)
5166037_f2_32415101808192730847 5−45Klebsiella pneumoniaeContig519AGTC ORF with score 171 to:
(de:pseudomonas fluorescens
hypothetical metabolite
< td/>transport protein, positive
transcriptional regulator
(phnr), phosphonoacetate-
hydrolase (phna), 2-
phosphonopropionate
transporter (phnb), putative
putrescine/spermidine . . .
24469431_f2_32915111808 2939312392−36 Klebsiella terrigenaP52666(de:bud operon transcriptional
< td/>regulator)
12145205_f2_33 01512180831416471−94Haemophilus influenzaeP43913(ec:3.1.11.6) (de:large
subunit))
5119783_f2_33118084942313285−25Rhizobium leguminosarumU39409(de:r hizobium leguminosarum
bv. trifoliibv. trifolii tfua (tfua), gene,
< td/>completecds.) (nt:orf1; high
similarity to members of the
lysr)
31895841_f2_332151418085801266120−7 Enterobacter cloacaeCONTIG24GTC ORF with score 133 to:
(ai:286830) (or:Ensis minor)
(sr:ensis minor (clone: 1/6)
male adult gonads cdna to
mrna) (de:ensis minor (clone
1/6) nuclear protein mrna,
< td/>complete cds.) (nt:putative)
2598881_f2_33315151808669022999−3 Escherichia coliP37674(de:hypothetical 17.5 kd
protein in avta-selb intergenic
< td/>region (o157a))
29970656_f2_3351516795264268−23Cyanobacterium synechocystisS75903(sr:pcc 6803, , pcc 6803) (sr:
pcc 6803, )
15739656_f2_33915171808 8480159180−13 Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage
of polyprotein at conserved
spacers r or)
6300082_f3_3441518180 89885294460−44Klebsiella pneumoniaeContig441AGTC ORF with score 612 to:
(ai:7000835977) (or:
Enterobacter cloacae)
22667086_f3_345 1519180901593530508 −49Klebsiella pneumoniaeContig441AGTC ORF with score 612 to:
(ai:7000835977) (or:
Enterobacter cloacae)
36506450_f3_350 15201809122687551379−141Staphylococcus aureusP37386(ec:3.6.1.—) (de:atpase))
33485005_f3_3571521180921575524
12364 062_f3_35815221809317705 89802−80Klebsiella pneumoniaeContig550AGTC ORF with score 1626 to:
(ai:7000838990) (or:
Enterobacter cloacae)
25832277_f3_359 1523180941026341687 −68Klebsiella pneumoniaeContig550AGTC ORF with score 687 to:
(ai:7000761751) (or:
Pseudomonas aeruginosa)
3366375_f3_360180951284427103−2AchromobacterL 81125(sr:pseudomonas sp (strain
georgiopolitanumimt37) dna) (de:pseudomonas
< td/>sp. (strain imt37) mono-
< td/>oxygenase subunit gene,
< td/>completecds.)
31876058_f3_3631809654618193−2Aspergillus fumigatusContig8154GTC ORF with score 433 to:
(ai:177837) (or:Zea mays)
< td/>(sr:, maize)
17038417_f3_364152618097927308151−7 Saccharomyces cerevisiaeQ04893(sr:,baker's yeast) (de:
hypothetical 113.1 kd protein
in pre5-fet4 intergenic region)
16135453_f3_3691527867288125−7Pyrococcus horikoshiiAP000002(sr:py rococcus horikoshii (str:
< td/>ot3) dna) (de:pyrococcus
horikoshii ot3 genomic dna,
287001-544000 nt. position
(2/7).) (nt:motif=prokaryotic
membrane lipoprotein lipid)
3395438_f3_3711528 18099178559494−1S29795(sr:, evening primrose)
picensis
36019441_f3_37 2152918100726241216−18Klebsiella pneumoniaeContig536AGTC ORF with score 662 to:
(ai:7000758235) (or:
Pseudomonas aeruginosa)
2226081_f3_374181011029342291−26Escherichia coliP45463(de:hypothetical
tra nscriptional regulator in
baca-ttda intergenic region)
15891467_f3_3761531203467799−1Mycobacterium tuberculosisZ83864(de:my cobacterium
tuberculosis h37rv complete
genome; segment 159/162.)
(nt:rv3854c, (mtcy01a6.14),
len: 489. possible)
33597541_f3_3981532 1810354618197−3Gallus gallus domesticusK02113(sr:chicken) (de:gallus gallus
vitellogenin gene coding for
phosvitin, exons 23 and24.)
32283283_f3_410153353417798−5 Klebsiella pneumoniaeContig545AGTC ORF with score 107 to:
(ai:139109) (or:Glycine max)
(cl:human dna-directed rna
polymerase ii largest chain)
(sr:, soybean) (ec:2.7.7.6)
32671908_f3_4111534181051806601163∠’8Dictyostelium discoideumP14328(sr:,slime mold) (de:spore
coat protein sp96)
30210965_f3_4171535 18106474157119−6U43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
14661393_f3_421153618 1072025674
16423325_f3_43 1153718108723240136−6Vibrio choleraeL19085(sr:vibrio cholerae (strain
n16961) dna) (de:vibrio
cholerea mannose-sensitive
hemagglutinin d (mshd) gene,
< td/>complete cds and mannose-
sensitive hemagglutinin e
(mshe) gene, partial cds.)
31741633_f3_4391538 18109564187150−9AF030027(fn:very large tegument
EHV-4protein) (de:equine herpes-
virus 4 strain ns80567,
complete genome.) (nt:
counterpart of hsv-1 gene ul36
and vzv gene 22)
31376011_f3_440153918 1101671556153−8P21561(sr:aa 2.2,) (de:hypothetical
50.6 kd protein in the 5′region
< td/>of gyra and gyrb (orf 3))
16026061_f3_444154018 111687228120−4Hemicentrotus pulcherrimusS42731(cl:collagen alpha 2(i) chain:
fibrillar collagen carboxyl-
terminal homology)
31734408_f3_4491541 1811241113699−4Streptomyces coriofaciensL20249(sr:st reptomyces coriofaciens
(library: isp 5485) dna) (de:
streptomyces coriofaciens
betaketoacyl synthase
homologue gene, partial cds.)
< td/>(nt:homologous to
saccharopolyspora erythraca)
24633438_f3_45018113573190
1543181141164387555−53Vibrio furnissiiP96166(ec:3.5.1.25) (de:deacetylase))
12589058_f3_4551 544181152061686732Xanthomonas campestrisP45597(sr:,pvcampestri s) (ec:2.7.3.9:
2.7.1.69) (de:(eiii-fru)))
12994577_f3_45715 4518116456151106−6longfin squidS56117(sr:, longfin squid)
12754501_f3_4591546181171068355154−8Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-
related nuclear matrix protein
(srm160) mrna, complete
cds.) (nt:160 kda)
20517261_f3_46015471 8118435914521131−115Enterobacter cloacaeCONTIG396GTC ORF with score 1739 to:
(ai:7501736107) (or:
Klebsiella pneumoniae)
34192941_f3_462181191566521147−10Aspergillus fumigatusContig9612GTC ORF with score 147 to:
(ai:7000761854) (or:
Pseudomonas aeruginosa)
862502_f3_4691549181201005334737−73Klebsiella pneumoniaeContig448AGTC ORF with score 2000 to:
(ai:7000821054) (or:
Enterobacter cloacae)
35176711_f3_470 155018121735244431Klebsiella pneumoniaeContig448AGTC ORF with score 2000 to:
(ai:7000821054) (or:
Enterobacter cloacae)
31508505_f3_471 1551181221512503121 −5Mycobacterium tuberculosisAL123456(de: mycobacterium
tuberculosis h37rv complete
genome; segment 15/162.) (nt:
rv0278c, (mtv035.06c), len:
957. member of m.)
15100753_f3_476155218 1231272423109−3Contig519AGTC ORF with score 171 to:
(de:pseudomonas fluorescens
hypothetical metabolite
< td/>transport protein, positive
transcriptional regulator
(phnr), phosphonoacetate-
hydrolase (phna), 2-
phosphonopropionate
transporter (phnb), putative
putrescine/spermidine . . .
14547707_f3_47815531812 41779592
6355030_f3_48415541812552817512 1−5Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
25972302_f3_487155518 1261158385359−33P37676(de:hypothetical 36.0 kd
protein in avta-selb intergenic
< td/>region precursor)
29928781_f3_4881556181271362453133−6 Streptomyces coelicolorAL031155(de:st reptomyces coelicolor
< td/>cosmid 3a7.) (nt:sc3a7.04,
questionable orf, : 384 aa; this
orf)
9884783_f3_489155718128831276181−11 Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
13127333_f3_491155818 1291554517
23964512_c1_49 7155918130489162167−12Streptomyces lividansP49322(de:hypothetical protein in
cpol 5′region (orf1)
(fragment))
2629788_c1_498156018131501166254 −22Streptomyces aureofaciensU21191(fn:unknown) (de:
streptomyces aureofaciens
glyceraldehyde-3-phosphate
dehydr ogenase(gap) gene,
< td/>complete cds, arac family
transcription activatorhomolog
and delta-5-3-ketosteroid
isomerase homolog (ksi)
< td/>genes, partial cds.) (nt:arac
family . . .
15021033_c1_50015611813 21587528189−12Bordetella pertussisP33445(de:hypothetical 33.8 kd
protein in fhac 3′region (orfa))
477066_c1_5071562 18133858285155−8PN0099(sr:, man)
11968917_c1_51315631 81341080359507−49 Klebsiella pneumoniaeContig519AGTC ORF with score 507 to:
(ai:7000761905) (or:
Pseudomonas aeruginosa)
10989775_c1_5141813546815591−2Caenorhabditis elegansAF000198(sr:caeno rhabditis elegans
strain=bristol n2) (de:
caenorhabditis elegans
cosmid t28f2.) (nt:similar to
cuticular collagen)
36148886_c1_5201565 18136876291134−8longfin squidS56117(sr:, longfin squid)
35728328_c1_5211566181371659552154−7Burkholderia cepaciaU41162(sr:burkhol deria cepacia
strain=17616) (de:
burkholderia cepacia d-serine
deaminase (dsd) gene,
< td/>complete cds.) (nt:unidentified
orf)
17070463_c1_5221567181381536511318 −26Pseudomonas syringaeP12374(sr:,pvtomato) (de:copper
< td/>resistance protein a precursor)
32156260_c1_5251568181391479492223−1 6Escherichia coliP18199(de:tyrosine-specific transport
protein (tyrosine permease))
13133327_c1_5261569181401611536180−1 3Klebsiella pneumoniaeContig387AGTC ORF with score 180 to:
(ai:7000761918) (or:
Pseudomonas aeruginosa)
31886391_c1_53018141759252147−7Burkholderia cepaciaU41162(sr:burkhol deria cepacia
strain=17616) (de:
burkholderia cepacia d-serine
deaminase (dsd) gene,
< td/>complete cds.) (nt:unidentified
orf)
36111661_c1_531157118142486161105< /td>−3Brassica napusU59446(sr:rape) (de:brassica napus
< td/>myrosinase-binding protein
related protein mrna, partial
cds.) (nt:divergently related to
myrosinase binding protein;)
6914581_c1_53215721155384114−3Neisseria gonorrhoeaeS75490(sr:nei sseria gonorrhoeae
ms11) (de:competence region:
iga=iga protease, coma=
< td/>transformation competence
< td/>(neisseria gonorrhoeae,
ms11, genomic, 3 genes,
2664 nt).)
11179081_c1_5341573 181442022673147−6 African clawed frogU85970(sr:african clawed frog) (de:
xenopus laevis middle
molecular weight neuro-
filament proteinnf-m(2) mrna,
< td/>complete cds.) (nt:neuronal
intermediate filament protein;
duplicated)
6540791_c1_53518145654217108−4Enterobacter cloacaeCONTIG481GTC ORF with score 255 to:
(ai:7000808495) (or:
Pseudomonas aeruginosa)
32286003_c1_54818146567188905−91Pseudomonas aeruginosaP40882(de:ferripyochel in binding
protein)
34494683_c1_549 15761814712994321324−135Escherichia coliA54227(cl:phosphoribosylamin o-
imidazole carboxylase carbon
dioxide-fixation chain:
phosphoribosylamino-
imidazole carboxylase carbon
dioxide-fixation chain
< td/>homology) (ec:2.1.2.—)
25885376_c1_5551577 18148477158134⠈’8Mycobacterium tuberculosisZ81368(de:my cobacterium
tuberculosis h37rv complete
genome; segment 106/162.)
(nt:rv2396, (mtcy253.25c),
len: 361. member of pe)
23960200_c1_557157818 149663220253−22Contig064AGTC ORF with score 484 to:
(ai:7000815407) (or:
Enterobacter cloacae)
36510806_c1_558 157918150960319139no gb taxonomy matchU52064(de:kaposi's sarcoma-
associated herpes-like virus
< td/>orf73 homolog gene, complete
cds.) (nt:herpesvirus saimiri
orf73 homolog)
11823793_c1_560158014854941647−16 9Haemophilus influenzaeP44518(de:signal recognition particle
protein (fifty-four homolog))
11751967_c1_5621581 181522558494−5Aspergillus fumigatusContig9333GTC ORF with score 136 to:
(ai:203141) (or:Rattus
norvegicus) (sr:, norway rat)
20210166_c1_57115821 8153951316926−93AF033497(de:pro teus mirabilis site-
< td/>specific recombinase (xerd)
gene, completecds.) (nt:
tyrosine recombinase family)
6509661_c1_5721583181547322431228−125Pseudomonas aeruginosaAF057031(de:ps eudomonas aeruginosa
< td/>putative th:disulfide inter-
change proteinprecursor
(dsbc) gene, complete cds.)
< td/>(nt:dsbc)
2088515_c1_57315 841815514734901875Pseudomonas aeruginosaP29365(ec:1.1.1.3) (de:homoserine
dehydrogenase, (hdh))
4386015_c1_5841585 18156630209188−15 Escherichia coliP39291(de:hypothetical 14.9 kd
protein in vacb-aidb intergenic
< td/>region (o133a))
17070342_c1_5871586717238
13088330_c 1_591158718158669222124−5Canis familiarisA45195(cl:guanylate cyclase catalytic
domain homology) (sr:, dog)
13072958_c1_59215881 81595337177893−1Q56214(sr:,subspthe rmophilus) (de:
T.aquaticus/T.flavusholliday junction dna helicase
ruvb)
26770193_c1_59315 8918160711236179−15Rickettsia prowazekiiAJ235269Ricket tsia prowazekii strain
Madrid E, complete genome.
25916333_c1_597159015455142601−270 Pseudomonas aeruginosaP14756(ec:3.4.24.26) (de:metallo-
proteinase))
6147517_c1_598< /td>1591181621152383 809−80Bacillus subtilis/BacillusP54550 (ec:1.—.—.—) (de:probable
globigiinadh-dependent flavin oxido-
reductase yqjm,)
4956508_c1_6011592 181631308435221−15Streptomyces pristinaespiralisS70171
11927056_c1_615159318164579123−6Alphaherpesvirus pseudo-X79983(de:pseudorabies virus dna.)
ra bies virus PRV(nt:putative assembly protein)
32526043_c1_61815941521506
32667532_ c1_623159518166570189120−6Lyme disease spirocheteD70170(sr:, lyme disease spirochete)
7073432_c1_627159618167696231
15822950 _c1_6291597181681521506< /td>810−79Pseudomonas aeruginosaU79580(de:pseu domonas aeruginosa
< td/>pilk gene, partial cds; and pill,
< td/>chpa, chpb, chpc, chpd, and
chpe genes, complete cds.)
< td/>(nt:cheay homolog)
25567930_c1_63215981005334176−10< /td>Gallus gallus domesticusI50206(cl:collagen alpha 2(i) chain:
fibrillar collagen carboxyl-
terminal homology) (sr:,
< td/>chicken)
11963257_c1_63415 9918170894297141−9Klebsiella pneumoniaeContig429AGTC ORF with score 141 to:
(ai:7000762026) (or:
Pseudomonas aeruginosa)
2214691_c1_63518171813270142−10Klebsiella pneumoniaeContig429AGTC ORF with score 142 to:
(ai:7000762027) (or:
Pseudomonas aeruginosa)
24738900_c1_63618172774257108−6Klebsiella pneumoniaeContig411AGTC ORF with score 108 to:
(ai:7000762028) (or:
Pseudomonas aeruginosa)
25672941_c1_638181731509502
14569568_c1_6401603181741239< /td>41297−2Plasmodi um knowlesiP04922(sr:nuri,) (de:circumsporozoite
< td/>protein precursor (cs))
14975752_c1_6501604 18175801266286−25 Haemophilus influenzaeP45277(de:hypothetical
< td>transcriptional regulator
hi1623)
32714583_c1_652160518176975324139 −7Mycobacterium tuberculosisAL123456(de: mycobacterium
tuberculosis h37rv complete
genome; segment 123/162.)
(nt:rv2839c, (mtcy16b7.03),
len: 900. probable infb.)
2477041_c1_6531606 181771542513126−5 Aspergillus fumigatusContig8665GTC ORF with score 154 to:
(ai:7501004138) (or:Mus
< td/>musculus) (sr:house mouse)
(de:mus musculus plenty-of-
< td/>prolines-101 mrna, complete
cds.) (nt:binds to several sh3
domain containing proteins)
31656291_c2_6571607 181781311436100−3< /td>Klebsiella pneumoniaeContig548AGTC ORF with score 90 to:
(ai:158543) (or:Orf virus) (sr:
orf virus (strain nz2) dna) (de:
orf virus homologue of retro-
viral pseudoprotease gene,
< td/>completecds.) (nt:orf4)
6355205_c2_65916081896290−4 Klebsiella pneumoniaeContig498AGTC ORF with score 90 to:
(ai:7000762051) (or:
Pseudomonas aeruginosa)
30167642_c2_66318180414137110−7Enterobacter cloacaeCONTIG495GTC ORF with score 172 to:
(ai:7000779951) (or:
Pseudomonas aeruginosa)
9769381_c2_66418181462153104−4Enterobacter cloacaeCONTIG495GTC ORF with score 140 to:
(ai:7000779952) (or:
Pseudomonas aeruginosa)
16915933_c2_665181821314437
13016333_c2_668161218183441146151−9Nephila clavipesAF027972(de:neph ila clavipes
flagelliform silk protein (flag)
mrna, partialcds.)
6505168_c2_670161318184522173135−8 Nephila clavipesAF027972(de:neph ila clavipes
flagelliform silk protein (flag)
mrna, partialcds.)
7213517_c2_671161418185813270
3192918 2_c2_675161518186432143< /td>194−16Klebsiella pneumoniaeContig519AGTC ORF with score 194 to:
(ai:7000762067) (or:
Pseudomonas aeruginosa)
20885266_c2_6781818716775581601< /td>−164Acinetobacter calcoaceticusP31002(ec:1.1.1.205 ) (de:
dehydrogenase) (impdh)
(impd))
16839191_c2_6791 6171818815905291982 −205Escherichia coliP04079(ec:6.3.5.2) (de:amido-
< td/>transferase) (gmp synthetase))
11897155_c2_684161818189591196124− 5Nephila clavipesAF027735(de:neph ila clavipes minor
< td/>ampullate silk protein misp1
< td/>mrna, partialcds.)
13147887_c2_688161918190429314304335 −9999Escherichia coliP15254(ec:6.3.5.3) (de:synthase)
(formylglycinamide ribotide
amidotransferase) (fgarat))
2628913_c2_6891620909302141−7Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
11207291_c2_691162118 192420139159−12Contig114AGTC ORF with score 305 to:
(ai:7000815986) (or:
Enterobacter cloacae)
13025331_c2_692 162218193666221105Klebsiella pneumoniaeContig553AGTC ORF with score 138 to:
(ai:7000757300) (or:
Pseudomonas aeruginosa)
31880211_c2_69518194612203117−6Enterobacter cloacaeCONTIG511GTC ORF with score 126 to:
(ai:7000725906) (or:
Streptococcus pneumoniae)
11172542_c2_698181951209402
1433416_c2_702162518196861286124−5Epstein-Barr virusP03181(sr:b95-8, human herpesvirus
4) (de:hypothetical bhlf1
< td/>protein)
5339708_c2_704162 618197900299
151 21081_c2_708162718198228 7590−4Klebsiella pneumoniaeContig389AGTC ORF with score 438 to:
(ai:7000835991) (or:
Enterobacter cloacae)
16838162_c2_709 162818199507168105Nephila clavipesAF027735(de:neph ila clavipes
minor ampullate silk protein
misp1 mrna, partialcds.)
4567088_c2_720162918200360119417−3 9Escherichia coliS07951(cl:escherichi a coli
ribosomal protein 119)
(mp:57 min)
22444181_c2_72116301 8201480159116−7X93605(de:zymomon as mobilis
pdhb & lpd genes & orf's
< td/>4, 5, 6, 7 & 8.)
12000641_c2_725163118 20219564
14704752_c2_733< /td>1632182031467488 2311−240Pseudomonas aeruginosaX65033(ec:4.2.99.2) (de:p.aeruginosa
hom and thrc genes for homo-
< td/>serine dehydrogenase and
threonine synthase.)
30267905_c2_736163318204369122
13175837 _c2_740163418205840279157−8Nephila clavipesU37520(de:nephil a clavipes
dragline silk protein
spidroin 1 gene,
< td/>partialcds.)
15758518_c2_745182061071356248−19Epstein-Barr virusP03211(sr:b95-8, human herpesvirus
4) (de:ebna-1 nuclear protein)
11222955_c2_7471636489162171−12Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
34614767_c2_757163718 208420139122−6Dictyostelium discoideumP14328(sr:,slime mold) (de:spore
coat protein sp96)
4010455_c2_75816381 8209702233275−24O33809(de:hypothetic al 20.8 kd
serot ype typhimuriumprotein in mesj-cutf
intergenic region)
20829756_c2_75916391770589149−7Homo sapiensY13247(sr:human) (de:homo
sapiens fb19 mrna.)
31492711_c2_766164018211372123109−6 Paracentrotus lividusA32249(cl:collagen alpha 2(i)
chain:fibrillar collagen
carboxyl-terminal homology)
(sr:, common urchin)
3335841_c2_771164118212567188131−7 Aspergillus fumigatusContig9870GTC ORF with score 111 to:
(ai:195953) (or:Homo
sapiens) (sr:, man)
20000313_c2_77516421 82131332443
32509761_c2_7 77164318214828275−20Escherichia coliP33218(de:hypothetical 23.7 kd
protein in ptrb-purt
intergenic region (orf153))
2610332_c2_779164417585851026−10 3Pseudomonas oleovoransQ00593(ec:1.1.99.—) (de:alcohol
dehydrogenase (acceptor),)
30277062_c2_7801645182161674557583∠’56Burkholderia cepaciaU29532(fn:4-methyl-o-phth alate/
phthalate permease) (de:
burkholderia cepacia
plasmid pmop pdxa homolog
gene, partial cds, 4-methyl-o-
phthalate reductase (mopa)
and 4-methyl-o-phthalate-
permease (mopb) genes,
complete cds.)
32672191_c2_7821646 18217843280144−7P03211(sr:b95-8, human herpesvirus
4) (de:ebna-1 nuclear
protein)
33791280_c2_784 1647182181479492
29821041_c2_7871648182191194397160−9Achromobac terA61183
georgiopolitanum
13025833_c2_793164918220 1119372471−45 Cyanobacterium synechocystisS75143(cl:response regulator
homology) (sr:pcc 6803, , pcc
6803) (sr:pcc 6803, )
20838342_c2_79416501822 111763911242−126P28353(de:peptide chain release
serotype typhimuriumfactor 2 (rf-2))
14180431_c2_795165115245071663−171 Acinetobacter calcoaceticusQ43990(ec:6.1.1.6) (de:lysyl-
< td/>trna synthetase, (lysine--
trna ligase) (lysrs))
25432091_c2_7961652780259170−13Mycobacterium tuberculosisZ83866(de:my cobacterium
tuberculosis h37rv complete
segment 133/162.) (nt:rv3066,
(mtcy22d7.15c), len: 202.
some similarity)
21745381_c2_797165318224648215
1062609 2_c2_7991654182251986661 305−26Escherichia coliE65030
14859387_c2_8 041655182261293430−15Enterobacter cloacaeCONTIG495GTC ORF with score 195 to:
(ai:7000762196) (or:
Pseudomonas aeruginosa)
22917711_c2_805182271152383212−17Klebsiella pneumoniaeContig502AGTC ORF with score 488 to:
(ai:7000839835) (or:
Enterobacter cloacae)
7057783_c2_8081 6571822824068012287 −237Escherichia coliP00864(ec:4.1.1.31) (de:
phosphoenolpyruvate
carboxylase, (pepcase) (pepc))
34266588_c2_8091658354118323−29Bordetella pertussisS66937(de:orf1 . . . orf3 {transposon-
like sequence} (bordetella
pertussis, genomic, 3 genes,
2300 nt).)
6337692_c3_81016591 8230450149590−57S66937(de:orf1 . . . orf3 {transposon-
like sequence} (bordetella
pertussis, genomic, 3 genes,
2300 nt).)
22947893_c3_8111660 18231573190
30175751_c3_8 12166118232540179−16Aquifex aeolicusC70408
29696043_ c3_8151662182331821606116−3Caenorhabditis elegansAF026211(sr:caeno rhabditis elegans
strain=bristol n2) (de:
caenorhabditis elegans
cosmid t13b5.) (nt:similar to
cuticular collagen)
3251651_c3_82116633429114298−1common tobaccoS34666(cl:phaseolus glycine-rich
protein 1.0) (sr:, common
tobacco)
32520217_c3_8401 664182351350449417Haemophilus influenzaeP44931(de:hyopthetical protein
hi0906)
21043_c3_8441665 18236681226287⠈’25Acinetobacter baumanniiCONTIG192GTC ORF with score 287 to:
C(ai:7000762236) (or:
Pseudomonas aeruginosa)
12223915_c3_85518237420139
166718238483160236−19Mycobact erium lepraeAL023635(de:mycoba cterium laprae
cosmid b1243.) (nt:
mlcb1243.36, unknown, : 385
aa; similar to)
29946958_c3_861166818 239618205138−7Sus scrofa domesticaS55316(sr:, domestic pig)
30722902_c3_86216691 8240609202144−8U92288(fn:helicase, helicase-
type 6 HHV-6primase complex) (de:
human herpesvirus 6 serotype
b putative major immediate-
< td/>carlygenes.) (nt:similar to
hhv6a u86, region ie-b)
31760465_c3_8631670 182411296431200−14Enterobacter cloacaeCONTIG273GTC ORF with score 135 to:
(ai:105942) (or:Equine
< td/>herpesvirus 1) (sr:ab4p,ehv-1)
< td/>(de:glycoprotein x precursor)
29963333_c3_8661671182421200399
1690204 2_c3_868167218243963320< /td>318−28Escherichia coliP43337(de:hypothetical 21.4 kd
protein in pabb-sdaa
intergenic region)
26032943_c3_8691673726241114−6Escherichia coliP76188(de:hypothetical 14.4 kd
protein in sodc-nema
intergenic region precursor)
31649167_c3_8701674182451590529117−3 blue musselAF015539(sr:blue mussel) (de:
mytilus edulis precollagen p
(precol-p) mrna, complete
cds.)
26432881_c3_87216 75182461131376122Saimiriine herpesvirus 2Q01042(sr:11,) (de:immediate-
early protein)
29426030_c3_878167641113691−3Aspergillus fumigatusContig136GTC ORF with score 865 to:
(ai:74529) (or:Petunia x
hybrida) (sr:,petunia) (de:
glycine-rich cell wall
structural protein 1
precursor)
20521905_c3_8821677 18248333110270⠈’23Haemophilus influenzaeP44382(de:30s ribosomal protein s16)
34409783_c3_88316781 8249543180360−33P44568(de:16s rrna processing
< td/>protein rimm)
26597555_c3_8841679 18250765254885−88 Escherichia coliP07020(ec:2.1.1.31) (de:methyl-
transferase))
6522583_c3_885< /td>1680182519393121 50−10Bacillus subtilis/BacillusP49851 (de:hypothetical 20.1 kd
globigiiprotein in hmp 5′region
< td/>(orf1))
6503216_c3_8871681182521248415116−3Dictyostelium discoideumAB009080(sr:di ctyostelium discoideum
< td/>(str:ax2) dna) (de:
dictyostelium discoideum gene
for trfa, complete cds.)
31907212_c3_8901682 18253102033999−2P10163(sr:,human) (de:salivary
proline-rich protein po
precursor (allele s))
1386638_c3_8921683182 5471423792−4l ongfin squidS56117(sr:, longfin squid)
12125781_c3_8981684182551353450
14579528_c3 _9001685182561335444341−31Escherichia coliP39292(de:(o232))
34647563_c3_90216861825716 95564485−46Esc herichia coliA65093
17072930_c3_9 151687182581839612−174Escherichia coliP21893(ec:3.1.—.—) (de:single-
stranded-dna-specific
< td/>exonuclease recj,)
11798140_c3_923168818259405134
12394657_c3_ 925168918260768255
10331656_c3_9271690182611196
31886037_c3_93018262456151103−5Klebsiella pneumoniaeContig275AGTC ORF with score 280 to:
(ai:7000843367) (or:
Enterobacter cloacae)
24472701_c3_938 169218263360119104Homo sapiensI53641(sr:, man) (mp:11p15.5-
11p15.5)
7166430_c3_943 1693182641962653426< /td>−39Bacillus subtilis/F70028
Bacillus globigii
36441592_c3_945 1694182651368455321 −29Rhodospirillum centenumU64519(de:rhodos pirillum centenum
(Rhodocista centenaria)cheay, chew, chey, cheb, and
cher genes, complete cds.)
12712701_c3_9461695 182661566521176−10Treponema pallidumAE001215(de:trep onema pallidum
section 31 of 87 of the
complete genome.) (nt:similar
to gp:1765973 percent ident:
99.75;)
16303966_c3_94716 9618267888295148−7Oryctolagus cuniculusP27884(sr:,rabbit) (de:brain calcium
channel bi-2 protein)
35281418_c3_94916971053350436−41< /td>Archaeoglobus fulgidusA69380
29822931_ c3_953169818289543180195−14Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
14972580_c3_960169918 270456151102−5Aspergillus fumigatusContig3749GTC ORF with score 178 to:
(ai:203144) (or:Rattus
norvegicus) (sr:, norway rat)
12987832_c3_96217001 82712181726337−29 Archaeoglobus fulgidusB69328
2438391_c 3_964170118272828275427−40Ralstonia eutrophaI39569
16490668_ c3_965170218273807268114−5Klebsiella pneumoniaeContig463AGTC ORF with score 544 to:
(ai:7000821222) (or:
Enterobacter cloacae)
32427062_c3_967 170318274681226106Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
24745832_c3_970170418 275555184263−23CONTIG311GTC ORF with score 263 to:
(ai:7000762362) (or:
Pseudomonas aeruginosa)
13161653_c3_9711827644714894−2mice|C57BL/6xCBA/CaJP54728
< td>hybridcomplementing complex
58 kd protein) (p58))
31739068_f1_117061 82771818605407−38 Rhodococcus sp.Q53139(sr:ni86/21,) (ec:1.3.1.54) (de:
precorrin-6x reductase,)
12367292_f1_21707 18278660219213−18< /td>Enterobacter cloacaeCONTIG304GTC ORF with score 213 to:
(ai:7000762372) (or:
Pseudomonas aeruginosa)
13008513_f1_41708182792172723472−42Pseudomonas aeruginosaAF051693(de:ps eudomonas aeruginosa
< td/>hydroxamate-type
ferrisiderophore receptor
(pfua) gene, complete cds.)
< td/>(nt:pfua)
12785282_f1_5170 9182801764587162−7equine herpesvirus typeAF030027(fn:very large tegument
4 EHV-4protein) (de:equine herpes-
virus 4 strain ns80567,
complete genome.) (nt:
counterpart of hsv-1 gene
ul36 and vzv gene 22)
32292830_f1_121710182 811002333120−7Klebsiella pneumoniaeContig508AGTC ORF with score 224 to:
(ai:7000810707) (or:
Pseudomonas aeruginosa)
15875337_f1_1318282915304324−29Norway spruceQ08632(sr:,norway spruce:picea
< td/>excelsa) (ec:1.—.—.—) (de:
short-chain type
dehydrogenase/reductase,)
12635205_f 1_19171218283519172 189−13mice|C57BL/6xCBA/CaJA F062655(sr:house mouse) (de:mus
< td/>hybridmusculus plenty-of-prolines-
101 mrna, complete cds.) (nt:
binds to several sh3 domain
containing proteins)
16835455_f1_2417131071356132−5MicrobacteriumX79027
< td/>ammoniaphilummarnir and mamim.)
11069780_f1_25171418285432143118−6 Dictyostelium discoideumP14328(sr:,slime mold) (de:spore
coat protein sp96)
13129005_f1_2617151 82861587528921−92 Pseudomonas aeruginosaAF012537(de:ps eudomonas aeruginosa
< td/>acetyl-coa synthetase gene,
< td/>partial cds; andarginine and
ornithine binding protein
(aotj), membrane protein
(aotq), membrane protein
(aotm), aoto (aoto), atpase
(aotp), andargr (argr) genes,
complete cds.) ( . . .
10448783_f1_27171618287 1653550
12711631_f1_35171718288173757853 7−52Escherichia coliC64923(cl:hypothetical protein
hi0135)
36113908_f1_3617 1818289981326382−35Cyanobacterium syncchocystisS77308(sr:pcc 6803, , pcc 6803)
< td/>(sr:pcc 6803, )
16147593_f1_37171918290 20467
5081466_f1_4018291726241262−23Acinetobacter baumanniiCONTIG160GTC ORF with score 262 to:
C(ai:7000762410) (or:
Pseudomonas aeruginosa)
178762_f1_42 1721182921755584741 −73Mycobacterium tuberculosisZ84724(de:my cobacterium
tuberculosis h37rv complete
genome; segment 21/162.)
(nt:rv0418, (mtccy22g10.15),
len: 500 aa, lpql,)
33728456_f1_431722 18293522173263−23 Klebsiella pneumoniaeContig560AGTC ORF with score 106 to:
(ai:1500692508) (or:
Boreogadus saida) (de:
boreogadus saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
16805388_f1_441723182 941482493171−9Herpes simplex virusAF015297(de:human herpesvirus 6
(type 6/strain Uganda-1102)(strain uganda-1102) ie2hom
mrna, complete cds.) (nt:
similar to the immediate-
< td/>early 2 protein of human)
16895907_f1_491724 1829521671152−10P00886(ec:4.1.2.15) (de:synthetase)
< td/>(3-deoxy-d-arabino-
heptulosonate 7-phosphate
synthase))
30364458_f1_511725182961404467189 −11mice|C57BL/6xCBA/CaJAF062655 (sr:house mouse) (de:mus
< td/>hybridmusculus plenty-of-prolines-
101 mrna, complete cds.) (nt:
binds to several sh3 domain
containing proteins)
30345457_f1_531726441146110−6Drosophila melanogasterP50887(sr:,fruit fly) (de:60s
ribosomal protein 122)
33706916_f1_57172718 2981281426159−8X89715(sr:baker's yeast) (de:
s.cerevisiae aob567, aof1001,
aoe110, aoe264 and aoe130
genes.)
11042917_f1_58172 81829925584
1068 2080_f1_6117291830070823 5118−4Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
15093917_f1_631730183 01963320486−46Moraxella sp.P24640(ec:3.1.1.3) (de:lipase 3
precursor, (triacylglycerol
lipase))
12708312_f1_64< /td>1731183021287428
19740681_f1_691732183037 77258192−15Asp ergillus fumigatusContig2661GTC ORF with score 192 to:
(ai:7000762439) (or:
Pseudomonas aeruginosa)
34479192_f1_711830413894621554−159Acinetobacter calcoaceticusP94132(ec:1.5.5.1) (de:
dehydrogenase) (electron-
< td/>transferring-flavoprotein
dehydrogenase))
16423780_f1_81173418305 1623540110−3< i>Drosophila melanogasterK02620(sr:d. melanogaster dna,
clone tm17) (de:d.
melanogaster tropomyosin
gene isoform 33 (9b0, exon
10b.) (nt:tropomyosin isoform
34 (9b))
29428816_f1_8617351 83061545514295−26 Klebsiella pneumoniaeContig522AGTC ORF with score 650 to:
(ai:7000788463) (or:
Pseudomonas aeruginosa)
33603317_f1_8918307131443797−4Enterobacter cloacaeCONTIG495GTC ORF with score 105 to:
(ai:7000799777) (or:
Pseudomonas aeruginosa)
12698906_f1_99183081026341150−7Microbacterium X79027(de:m.ammoniaphilum genes
< td/>ammoniaphilummamir and mamim.)
13152011_f1_1001738459152195−16Klebsiella pneumoniaeContig226AGTC ORF with score 195 to:
(ai:7000762470) (or:
Pseudomonas aeruginosa)
26050043_f1_108183101344447370−34Klebsiella pneumoniaeContig538AGTC ORF with score 507 to:
(ai:7000833443) (or:
Enterobacter cloacae)
32628291_f1_110 174018311993330247Klebsiella pneumoniaeContig538AGTC ORF with score 287 to:
(ai:7000833448) (or:
Enterobacter cloacae)
12317786_f1_112 174118312534177103Klebsiella pneumoniaeContig511AGTC ORF with score 93 to:
(ai:343484) (or:Pneumocystis
carinii) (sr:pneumocystis
carinii cdna to mrna) (de:
pneumocystis carinii (clone
gp22) major surface glyco-
protein (msg)mrna, 3′ end.)
< td/>(nt:‘one of multiple genes
< td/>encoding the major surface)
2598338_f1_1131742819272263−22African clawed frogS07498(cl:dermal gland protein apeg:
< td/>trefoil homology) (sr:, african
clawed frog)
15835056_f1_1151743 18314405134132−8A61183
georgiopol itanum
31891416_f1_11617 441831527691106 −5Homo sapiensS16506(sr:, man)
20900325_f1_11817451 8316414137171−13Contig508AGTC ORF with score 465 to:
(ai:7000827259) (or:
Enterobacter cloacae)
33597567_f1_119 174618317630209137Acinetobacter baumanniiCONTIG174GTC ORF with score 146 to:
C(ai:7000763044) (or:
Pseudomonas aeruginosa)
4011541_f1_12018318858285203−17Klebsiella pneumoniaeContig508AGTC ORF with score 203 to:
(ai:7000762490) (or:
Pseudomonas aeruginosa)
4947706_f1_12218319594197131−6Drosophila melanogasterK02620(sr:d. melanogaster dna,
clone tm17) (de:d.
melanogaster tropomyosin
gene isoform 33 (9b0, exon
10b.) (nt:tropomyosin isoform
34 (9b))
5181638_f1_12317491 8320318105
4400203_f1_128 1750183211368455313−28Enterobacter cloacaeCONTIG353GTC ORF with score 423 to:
(ai:7501738653) (or:
Klebsiella pneumoniae)
11845890_f1_12918322882293550−53Klebsiella pneumoniaeContig376AGTC ORF with score 550 to:
(ai:7000762499) (or:
Pseudomonas aeruginosa)
15660425_f1_131183231029342134−6Plasmodium knowlesiP04922(sr:nuri,) (de:circumsporozoite
< td/>protein precursor (cs))
25598556_f1_1321753 18324936311338−31 Enterobacter cloacaeCONTIG353GTC ORF with score 338 to:
(ai:7000762502) (or:
Pseudomonas aeruginosa)
520316_f1_133175418325594197244 −21Klebsiella pneumoniaeContig235AGTC ORF with score 245 to:
(ai:7000819432) (or:
Enterobacter cloacae)
10269817_f1_134 1755183261545514105 −3mice|C57BL/6xCBA/CaJU46463(sr:house mouse) (de:mus
< td/>hybridmusculus glutamine repeat
protein-1 mrna, complete
cds.) (nt:grp-1)
1254706_f1_1351756 18327423140131−9Aspergillus fumigatusContig9906GTC ORF with score 215 to:
(ai:7500759215) (or:
Candida albicans)
13145831_f1_139175718328355811853198−9999Haemophilus influenzaeP45128(de:transcriptio n-repair
coupling factor (tref))
20053193_f1_1401758768255191−14Micrococcus luteusJQ0405
22533330_f1 _1431759183301944647251−21Klebsiella pneumoniaeContig279AGTC ORF with score 441 to:
(ai:7000827322) (or:
Enterobacter cloacae)
24730291_f1_144 1760183317052341036 −104Pseudomonas aeruginosaP37452(ec:3.4.21.88) (de:lexa
repressor.)
16056531_f1_154 176118332873290375−35Enterobacter cloacaeCONTIG509GTC ORF with score 375 to:
(ai:7000762524) (or:
Pseudomonas aeruginosa)
660202_f1_158176218333429142100 −5Aspergillus fumigatusContig4476GTC ORF with score 149 to:
(ai:7000775270) (or:
Pseudomonas aeruginosa)
30572955_f1_15918334936311
1764183351206401
33672783_f1_164176518336447148105−6< /td>Acinetobacter baumanniiCONTIG202GTC ORF with score 116 to:
C(ai:7000762535) (or:
Pseudomonas aeruginosa)
21885205_f1_16518337873290127−5Saccharomyces cerevisiaeS59310(mp:13r)
24710007_f1_1661767183382708992−4Aspe rgillus fumigatusContig10292GTC ORF with score 229 to:
(ai:59485) (or:Saccharomyces
< td/>cerevisiae ) (sr:baker's
yeast) (de:s.cerevisiae
chromosome ix cosmid 9168.)
(nt:mal5, sta1, : 1367, cai:
0.3, amyh yeast p08640)
21586687_f1_168176854318097−2 Homo sapiensQ07283(sr:,human) (de:trichohyalin)
7300028_f1_16917 691834030099174 −13Enterobacter cloacaeCONTIG162GTC ORF with score 174 to:
(ai:7000762539) (or:
Pseudomonas aeruginosa)
31458283_f1_17618341675224104−4Aspergillus fumigatusContig9367GTC ORF with score 182 to:
(ai:99171) (or:Dictyostelium
< td/>discoideum ) (de:dictyostelium
< td/>discoideum sp96 gene for
spore coat protein sp96.)
10828957_f1_1971771183421011336123−5Streptomyces coelicolorAL021529(de:st reptomyces coelicolor
< td/>cosmid 10a5.) (nt:sc10a5.35,
possible ntp pyrophospho-
hydrolase, len:)
14177205_f1_1981772 18343621206124−5D90774(sr:escherichi a coli (strain:
k12) dna, clone_lib:kohara
lambda minise) (de:e.coli
genomic dna, kohara clone
< td/>#263(30.5-30.9 min.).)
(nt:orf_id:o263#22; similar to
(swissprot accession)
5213205_f1_2081773 1834457619197−5Enterobacter cloacaeCONTIG416GTC ORF with score 290 to:
(ai:7501753985) (or:
Klebsiella pneumoniae)
2041390_f1_214183451512503653−64Neisseria gonorrhoeaeAF071224(de:n eisseria gonorrhoeae
penicillin binding protein 3
(pbp3) gene, complete cds.)
< td/>(nt:pbp3)
12288968_f1_2211 7751834622574153−11Escherichia coliX70111(de:e.coli rmf gene for
ribosome modulation factor.)
32147833_f1_22217761272423275−24< /td>Klebsiella pneumoniaeContig450AGTC ORF with score 432 to:
(ai:7000820977) (or:
Enterobacter cloacae)
2526436_f1_2231 77718348429142142Acanthamoeba castellaniiAF085185(de:a canthamoeba castellanii
myosin-ia (mia) gene,
< td/>complete cds.) (nt:myosin-i)
10241641_f1_2241778< /td>1834931111036
358 29128_f1_234177918350420 139114−7longfin squidS56117(sr:, longfin squid)
11213191_f1_26217801835118246071729−178< /td>Escherichia coliP75776(de:hypothetical abc
transporter atp-binding protein
ybhf)
16288330_f1_264178 1183521158385756−75Escherichia coliP75774(de:hypothetical 41.6 kd
protein in moae-rhle
intergenic region)
12195758_f1_26517821122373333−30Pseudomonas aeruginosaX99514(fn:outer membrane
component of multidrug
efflux) (de:p.aeruginosa
mexc, mexf & oprn genes.)
31765807_f1_26617831452483676−66Vibrio choleraeAB012956(sr:vibr io cholerae (str:mo45)
< td/>dna) (de:vibrio cholerae
genes for o-antigen synthesis,
< td/>strain mo45, complete cds.)
< td/>(nt:unknown)
12761316_f1_27418355576191107−6Enterobacter cloacaeCONTIG500GTC ORF with score 107 to:
(ai:7000723201) (or:no gb
taxonomy match) (de:human
papillomavirus type 80 e6,
e7, e1, e2, e4, 12, and 11
genes.) (nt:putative)
29942291_f2_2841785< /td>183561194397681⠈’67Bacillus megateriumAJ000758(fn:involved in cobalamin
synthesis) (de:bacillus
< td/>megaterium 16kb genomic
sequence, cobalamin operon.)
16885780_f2_2871786402133191−15Enterobacter cloacaeCONTIG304GTC ORF with score 270 to:
(ai:7501734140) (or:
Klebsiella pneumoniae)
13089787_f2_28818358639212146−8Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
35276056_f2_289178819 35926186124−7 Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
33698782_f2_291178918 36029497107−5 Homo sapiensM74027(sr:homo sapiens (tissue
library: lambda-gem-11
(stratagene)) bloo) (de:
human mucin-2 gene, partial
cds.)
525656_f2_2921790< /td>18361951316510∠’49Rhodobacter capsulatusAF010496(de:rh odobacter capsulatus
< td/>strain sb1003, partial
genome.)
11175812_f2_294 1791183621152383123 −5Homo sapiensU82987(sr:human) (de:human bcl-2
< td/>binding component 3 (bbc3)
mrna, partial cds.) (nt:bbc3;
approximately 500 base pairs
< td/>missing from 5′
10605140_f2_29717921 8363813270123−4U52064(de:kaposi's sarcoma-
associated herpes-like virus
< td/>orf73 homolog gene, complete
cds.) (nt:herpesvirus saimiri
orf73 homolog)
14929081_f2_30117931224407164−10< /td>Enterobacter cloacaeCONTIG486GTC ORF with score 304 to:
(ai:7000797020) (or:
Pseudomonas aeruginosa)
36510461_f2_3041836530961031668< /td>−65Escherichia coliP39182(de:histidine-binding< /td>
periplasmic protein precursor
(hbp))
31344137_f2_305 1795183661401466829 −83Escherichia coliP20091(de:histidine transport system
permease protein hism)
34614217_f2_3091796 183672076894−5Aspergillus fumigatusContig4048GTC ORF with score 145 to:
(ai:380588) (or:Homo
sapiens) (sr:homo sapiens
(tissue library: lambda-gem-
11 (stratagene)) bloo) (de:
human mucin-2 gene, partial
cds.)
30707291_f2_311179 718368441146117 −6Homo sapiensAB002322(sr:homo sapiens male brain
< td/>cdna to mrna, clone_lib:
< td/>pbluescriptii s) (de:human
mrna for kiaa0324 gene,
< td/>partial cds.)
24506965_f2_3151798 183691020339459−43Pseudomonas aeruginosaS12643
2230706 6_f2_3171799183701164387 1016−102Pseudomonas putidaS64687
35628330_f2 _321180018371687228 103−3Orf virusD34768
36067068_f2_32218372348115
180218373759252109−4upland cottonL17308(sr:gossypium hirsutum
(strain coker 312) fiber cdna
to mrna) (de:gossypium
hirsutum proline-rich cell wall
protein mrna, completecds.)
16126456_f2_3371803< /td>18374987328114∠’4Pseudomonas alcaligenesU84154(de:pse udomonas alcaligenes
insertion sequence is1491
putativetransposase subunit
genes, complete cds.)
16605040_f2_3381804 183752586861
3167278_f2_3 391805183761353450−5Alphaherpesvirus pseudo-P11675(sr:indiana-funkhauser/becker ,
rabies virus PRVprv) (de:immediate-early
protein ie180)
10680166_f2_3451806183771119372677−66Rhodobacter capsulatusAF010496(ec.2.1.1.—) (de:rhodobacter
capsulatus strain sb1003,
partial genome.)
3223437_f2_3521807978325274−24Aspergillus fumigatusContig2004GTC ORF with score 274 to:
(ai:7000762722) (or:
Pseudomonas aeruginosa)
33409713_f2_35618379447148115−7longfin squidS56117(sr:, longfin squid)
16511562_f2_358180918380516171138−8 Saccharomyces cerevisiaeP47179(sr:,baker's yeast) (de:
precursor)
103527_f2_364181 0183811368455388−36MethanobacteriumG6 9051
thermoautotrophicum
4305406_ f2_365181118382837278558−54Pseudomonas aeruginosaL42622(sr:pseu domonas aeruginosa
< td/>(strain pao1) dna) (de:
pseudomonas aeruginosa
< td/>pilz and holb genes, complete
cds.) (nt:orf3)
4428802_f2_368181268422793−2Klebsiella pneumoniaeContig168AGTC ORF with score 93 to:
(ai:7000762738) (or:
Pseudomonas aeruginosa)
3152336_f2_37318384753250138−9Staphylococcus epidermidisCONTIG081GTC ORF with score 138 to:
C(ai:7000762743) (or:
Pseudomonas aeruginosa)
12292681_f2_37718385930309160−8equine herpesvirus type 1D88734(sr:equine herpesvirus 1
EVH-1< /td>(strain:bk343, isolate:3f clone)
dna) (de:equine herpesvirus 1
dna for membrane glyco-
protein, complete cds.)
445818_f2_381181518 386642213119−4Alphaherpesvirus pseudo-S04713(cl:herpesvirus immediate-
< td>rabies virus PRVearly protein ie175)
6103838_f2_3871816 18387322810752003−207< /td>Escherichia coliP21513(ec:3.1.4.—) (de:ribonuclease
c, (rnase c))
26265638_f2_389181718 38899333096−2 Volvox carteriS22697
10260293_f 2_390181818389510169105−3Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
21502067_f2_394181918 390152150699−3Enterobacter cloacaeCONTIG292GTC ORF with score 489 to:
(ai:236122) (or:Escherichia
coli) (sr:escherichia coli dna)
(de:escherichia coli tolqra
gene cluster dna.) (nt:orf 4;
putative)
14348936_f2_3951820 183912112703252 −18Canadian hard winter wheatB30843(cl:glutenin) (sr:, common
wheat)
33618776_f2_400182 1183921989662280−24Klebsiella pneumoniaeContig508AGTC ORF with score 364 to:
(ai:7000763040) (or:
Pseudomonas aeruginosa)
32166456_f2_407183931122373417−39Klebsiella penumoniaeContig376AGTC ORF with score 550 to:
(ai:7000762499) (or:
Pseudomonas aeruginosa)
14978401_f2_4101839427089223−19Klebsiella pneumoniaeContig376AGTC ORF with score 577 to:
(ai:7000819414) (or:
Enterobacter cloacae)
33792208_f2_411 182418395525174138Enterobacter cloacaeCONTIG353GTC ORF with score 138 to:
(ai:7000762781) (or:
Pseudomonas aeruginosa)
21886525_f2_41318396405134113−7Enterobacter cloacaeCONTIG353GTC ORF with score 113 to:
(ai:7000762783) (or:
Pseudomonas aeruginosa)
13800830_f2_42218397450149116−6Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
650256_f2_42518271839 8633210
17031917_f2_43318281839925418467 33−73Klebsiella pneumoniaeContig559AGTC ORF with score 733 to:
(ai:7000762803) (or:
Pseudomonas aeruginosa)
33864417_f2_438184001293430231−19Coxiella burnetiiP45680(de:hypothetical 15.8 kd
protein in fmu-rpmh
intergenic region)
11895933_f2_4441830250583498−4Homo sapiensI39004(sr:, man) (mp:9p21-9p21)
29429068_f2_4491831 18402432143130⠈’8Volvox carteriS22697
31673956_f 2_4501832184031656551
15034706_f2_4591833184041098365328−29Pseudomonas aeruginosaU73506(de:pseu domonas aeruginosa
< td/>ornithine utilization regulatory
< td/>(orur)gene, complete cds.) (nt:
regulatory locus for ornithine
utilization)
32546958_f2_463183418405456151177 −14Klebsiella pneumoniaeContig525AGTC ORF with score 177 to:
(ai:7000762833) (or:
Pseudomonas aeruginosa)
19777080_f2_464184066362111050−106Pseudomonas aeruginosaAF053982(de:ps eudomonas aeruginosa
< td/>putative molybdoterin-guanine
< td/>dinucleotidebiosynthesis
protein a (moba) and
cytochrome c precursor
protein(snr1) genes, complete
cds; and unknown genes.)
1253541_f2_465183618407861286783−78Pseudomonas aeruginosaAF053982(de:ps eudomonas aeruginosa
< td/>putative molybdoterin-guanine
< td/>dinucleotidebiosynthesis
protein a (moba) and
cytochrome c precursor
protein(snr1) genes, complete
cds; and unknown genes.)
32449068_f2_4671837531176107−4Homo sapiensM13228(sr:human la-n-5 neuro-
blastoma cell dna; clone n-
myc1) (de:human n-myc
< td/>oncogene protein mrna.) (nt:
n-myc protein)
11223577_f2_47118381506501
36041375_ f2_4731839184101362453156−7Homo sapiensQ07283(sr:,human) (de:trichohyalin)
3339181_f2_47818 4018411345114155−11Escherichia coliP42617(de:hypothetical 11.1 kd
protein in exur-tdcc
intergenic region)
26056533_f2_4791841384127124−8Escherichia coliP42618(de:hypothetical 15.1 kd
protein in exur-tdcc
intergenic region)
36114383_f2_48118421392463729−72Cyanobacterium synechocystisS76527(sr:pcc 6803, , pcc 6803)
< td/>(sr:pcc 6803, )
33708152_f2_49718431841 4450149139−10 Klebsiella pneumoniaeContig450AGTC ORF with score 139 to:
(ai:7000762867) (or:
Pseudomonas aeruginosa)
16135080_f2_49818415483160109−7Aspergillus fumigatusContig9443GTC ORF with score 109 to:
(ai:7000762868) (or:
Pseudomonas aeruginosa)
12366327_f2_499184162055684197−15Mycobacterium tuberculosisAL123456(de: mycobacterium
tuberculosis h37rv complete
genome; segment 150/162.)
(nt:rv3569c, (mtcy06g11.16c),
len: 291. probable)
31349016_f2_5091846 184171023340102−2< /td>Alphaherpesvirus pseudo-S04713(cl:herpesvirus immediate-
< td>rabies virus PRVearly protein ie175)
6519791_f2_5111847 184181536511109−2 no gb taxonomy matchU52064(de:kaposi's sarcoma-
associated herpes-like virus
< td/>orf73 homolog gene, complete
cds.) (nt:herpesvirus saimiri
orf73 homolog)
26267643_f2_5151848519172129−8Medicago sativaY16672(de:medicago sativa mrna
for putative arginine/serine-
rich splicingfactor.)
11041531_f2_51718 4918420990329157−8Saccharomyces cerevisiaeP08640(sr:,baker's yeast) (ec:3.2.1.3)
(de:glucosidase) (1,4-alpha-d-
glucan glucohydrolase))
32558336_f2_52018 50184211236411
1 4972715_f2_524185118422927157−11Aquifex aeolicusF70487
34244831_ f2_532185218423669222178−14Klebsiella pneumoniaeContig525AGTC ORF with score 178 to:
(ai:7000762902) (or:
Pseudomonas aeruginosa)
9852000_f2_545184241335444174−11Helicobacter pyloriF64606
17039705_f3 _561185418425429142 126−7Pseudomonas aeruginosaP15276(de:algr3))
12391418_f3_562185518426 546181102−3Caenorhabditis elegansZ83107(de:caenorh abditis elegans
cosmid f32a7, complete
sequence.) (nt:predicted using
< td/>genefinder; similar to
claustrin)
4895958_f3_5661856 184272247748132 −5mice|C57BL/6xCBA/CaJAF062655( sr:house mouse) (de:mus
< td/>hybridmusculus plenty-of-prolines-
101 mrna, complete cds.) (nt:
binds to several sh3 domain
containing proteins)
12508506_f3_5691857 18428906301159−8Micrococcus luteusJQ0405
16192931_f3 _5701858184291110369536−51Pseudomonas aeruginosaAF055999(de:ps eudomonas aeruginosa
< td/>hemin uptake locus, hypo-
< td/>thetical proteinphuw (phuw),
atpase component (phuv),
abc-type permease (phuu),
periplasmic binding protein
(phut), hemin degrading factor
(phus), and outer membrane
hemin receptor (ph . . .
13067826_f3_57418591843 050716892−2ra inbow troutQ04617(sr:,rainbow trout:salmo
gairdneri) (de:corticotropin-
lipotropin a precursor (pro-
< td/>opiomelanocortin) (pomc))
12364505_f3_57718601002333101−5longfin squidS56117(sr:, longfin squid)
3260458_f3_5781861 184321200399199−12equine herpesvirus type 1D88733(sr:equine herpesvirus 1
EVH-1< /td>(strain:hh1) dna) (de:equine
< td/>herpesvirus 1 dna for
membrane glycoprotein,
complete cds.)
7227061_f3_58118621 84331290429862−86 Campylobacter jejuniP45493(ec:3.5.1.32) (de:-
< td/>(hippuricase))
31897556_f3_583 186318434417138105−5Cyanobacterium synechocystisS76563(sr:pcc 6803, , pcc 6803)
< td/>(sr:pcc 6803, )
29949057_f3_58418641843 5834277742−73 Escherichia coliP52094(de:histidine transport system
permease protein hisq)
36501018_f3_5851865 18436867288106−3X96713(de:g.palli da mrna for
collagen.) (nt:putative)
15908516_f3_5871866< /td>184371293430116⠈’4Trypanosoma cruziA40215
31382308_f3_ 5881867184381113370 163−9Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
16922580_f3_593186818 4391500499104−5Contig386AGTC ORF with score 104 to:
(ai:7000762963) (or:
Pseudomonas aeruginosa)
2601002_f3_59618440615204
1870184411965654252−18Oryctola gus cuniculusP16230(sr:,rabbit) (de:precursor
(hcp))
30708336_f3_598 18711844227691
187218443765254166−10Haloferax sp.P21561(sr:aa 2.2,) (de:hypothetical
50.6 kd protein in the 5′region
< td/>of gyra and gyrb (orf 3))
1430216_f3_6011873184 441680559196−15Contig347AGTC ORF with score 349 to:
(ai:7000795297) (or:
Pseudomonas aeruginosa)
26303933_f3_60218445483160113−5Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
36458456_f3_603187518 4461506501
12995816_f3_60 41876184471272423−70Escherichia coliP00888(ec:4.1.2.15) (de:synthetase)
< td/>(3-deoxy-d-arabino-
heptulosonate 7-phosphate
synthase))
675808_f3_605 187718448402133111−5human herpesvirus type 6U92288(fn:helicase, helicase-primase
HHV-6complex) (de:human herpes-
virus 6 serotype b putative
major immediate-earlygenes.)
(nt:similar to hhv6a u86,
region ie-b)
33869416_f3_6061878 18449615204117−4AF000298(sr:caeno rhabditis elegans
strain=bristol n2) (de:
caenorhabditis elegans
cosmid w03d2.) (nt:weak
similarity to collagens;
< td/>glycine- and)
16208412_f3_60818791 84502016671134−8CONTIG450GTC ORF with score 220 to:
(ai:7501795304) (or:
Klebsiella pneumoniae)
13089787_f3_609184511107368
14188331_f3_610188118452501166171−13Pseudom onas denitrificansP21635(de:code protein)
3256891_f3_61118821104367112−3Trypanosoma cruziA44937(cl:kinetoplast-assoc iated
< td/>protein)
33478158_f3_61618 83184541005334458Acinetobacter calcoaceticusP94132(ec:1.5.5.1) (de:dehydro-
genase) (electron-transferring-
flavoprotein dehydrogenase))
29822506_f3_624188 418455702233122 −5Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
16538555_f3_628188518 456510169104−4Homo sapiensAF053536(sr:human) (de:homo sapiens
a-kinase anchoring like
protein mrna, complete cds.)
< td/>(nt:akalp)
7042705_f3_6311 886184571629542112human herpesvirus type 6U92288(fn:helicase, helicase-primase
HHV-6complex) (de:human herpes-
virus 6 serotype b putative
major immediate-earlygenes.)
(nt:similar to hhv6a u86,
region ie-b)
3932307_f3_63618871 8458867288190−13AF002133(de:mycobac terium avium
< td/>strain gir10 transcriptional
< td/>regulator (mav81)gene, partial
cds, aconitase (acn), invasin 1
(inv1), invasin 2(inv2),
transcriptional regulator
(moxr), ketoacyl-
reductase(fabg), enoyl-
reductase (inha) and ferro . . .
15103575_f3_64018881845 92016691−4Haemophilus influenzaeU20229(de:haem ophilus influenzae
< td/>bola (bola), glutathione
reductase (gor), phosphatidyl-
serine decarboxylase (psd),
30k protein (rpmf), genes,
complete cds.) (nt:orf121)
1277012_f3_642188918460606201570−55 Escherichia coliP27244(de:hypothetical 23.2 kd
protein in rne-rpmf intergenic
< td/>region (orfy))
11175767_f3_6521890711236254−21Boreogadus saidaU43200(de:boreogadu s saida anti-
< td/>freeze glycopeptide afgp poly-
< td/>protein precursorgene,
complete cds.) (nt:cleavage of
polyprotein at conserved
spacers r or)
12995431_f3_655189118 462636211102−4Aspergillus fumigatusContig9346GTC ORF with score 102 to:
(ai:7000763025) (or:
Pseudomonas aeruginosa)
29317706_f3_6571846347715892−5Enterobacter cloacaeCONTIG470GTC ORF with score 92 to:
(ai:7000763027) (or:
Pseudomonas aeruginosa)
12520635_f3_6611846463621192−1mice|C57BL/6xCBA/CaJAJ001116 (sr:house mouse) (de:mus
< td/>hybridmusculus mrna for uncx4.1
protein.)
30198417_f3_666189418465735244180 −12Herpes simplex virus (type 6/AF015297(de:human herpesvirus 6
strain Uganda-1102)(strain uganda-1102) ie2hom
mrna, complete cds.) (nt:
similar to the immediate-early
< td/>2 protein of human)
16117205_f3_668189518466540179129−6 Homo sapiensAF048977(fn:splicing factor) (sr:human)
< td/>(de:homo sapiens ser/arg-
related nuclear matrix protein
(srm160) mrna, complete cds.)
< td/>(nt:160 kda)
2616660_f3_669189618 467564187152−11S76054(sr:pcc 6803, , pcc 6803)
< td/>(sr:pcc 6803, )
22551030_f3_67018971846 81320439364−34Klebsiella pneumoniaeContig508AGTC ORF with score 364 to:
(ai:7000763040) (or:
Pseudomonas aeruginosa)
22469466_f3_67118469459152207−17Klebsiella pneumoniaeContig508AGTC ORF with score 207 to:
(ai:7000763041) (or:
Pseudomonas aeruginosa)
10944462_f3_67418470717238100−2CollectotrichumL76169(sr:collectotrichum
gloeos porioidesgloeosporioides
(individual_isolate 21808,
strai) (de:colletotrichum
gloeosporioid es reverse
transcriptase gene, 3′ endof
< td/>cds.) (nt:orf2)
33775958_f3_6771900 184711173390268−23 Bacillus subtilis/BacillusP54527 (de:hypothetical 270.0 kd
globigiiprotein in spo0a-mmga
< td/>intergenic region)
2438933_f3_6781901184721779592354−32Klebsiella pneumoniaeContig441AGTC ORF with score 662 to:
(ai:7000835900) (or:
Enterobacter cloacae)
6742816_f3_6791 902184731095364130Enterobacter cloacaeCONTIG343GTC ORF with score 130 to:
(ai:7000763049) (or:
Pseudomonas aeruginosa)
16697812_f3_688184741587528178−12Aspergillus fumigatusContig9906GTC ORF with score 286 to:
(ai:7501763937) (or:
Klebsiella pneumoniae)
6772636_f3_68918475954317179−14Klebsiella pneumoniaeContig508AGTC ORF with score 341 to:
(ai:7000827262) (or:
Enterobacter cloacae)
13072766_f3_690 1905184761056351144 −7Homo sapiensP17600(sr:,human) (de:synapsins ia
and ib (brain protein 4.1))
36070216_f3_6911906 18477116438797−2Q60987(sr:,mouse) (de:transcription
hybridfactor bf-1 (brain factor 1)
(bf1))
32713278_f3_69619071847829196116−7< /td>Klebsiella pneumoniaeContig550AGTC ORF with score 116 to:
(ai:7000763066) (or:
Pseudomonas aeruginosa)
25864503_f3_70018479576191137−9Enterobacter aerogenesP08848(sr:,aero bacter aerogenes)
< td/>(de:cell division inhibitor)
32520308_f3_70919091848035461181973− 98Klebsiella pneumoniaeContig534AGTC ORF with score 973 to:
(ai:7000763079) (or:
Pseudomonas aeruginosa)
2817790_f3_716184812169722193−14Pseudomonas aeruginosaJQ0133
3580652 8_f3_7171911184821932643 813−81Escherichia coliS56616(cl:soluble lytic trans-
glycosylase) (ec:3.2.1.—)
(mp:100 min)
15682058_f3_71819121 84831977658
31891452_f3_7 23191318484471156−6mice|C57BL/6xCBA/CaJAF06 2655(sr:house mouse) (de:mus
< td/>hybridmusculus plenty-of-prolines-
101 mrna, complete cds.) (nt:
binds to several sh3 domain
containing proteins)
31895790_f3_7241914 184851908635
12629456 _f3_7271915184861545514< /td>186−14Enterobacter cloacaeCONTIG461GTC ORF with score 423 to:
(ai:7501780823) (or:
Klebsiella pneumoniae)
2226376_f3_7301848725584367 −34Pseudomonas aeruginosaAF053982(de:ps eudomonas aeruginosa
< td/>putative molybdoterin-guanine
< td/>dinucleotidebiosynthesis
protein a (moba) and
cytochrome c precursor
protein(snr1) genes, complete
cds; and unknown genes.)
15022137_f3_7381917654217417−39Rhizobium leguminosarumQ52828(de:gsta protein)
34646026_f3_739191894531492−3Aspergillus fumigatusContig6884GTC ORF with score 92 to:
(ai:7000763109) (or:
Pseudomonas aeruginosa)
12931277_f3_74018490873290167−10Xanthomonas campestrisU70053(de:xant homonas campestris
< td/>gumn, gumo, and gump
genes, complete cds.)
21724206_f3_7411920 1849113414461071−108Pseudomonas putidaAF029714(de:pseudo monas putida
repressor (phan), regulatory
< td/>protein (pham), enoyl-coa
hydratase i (phaa), enoyl-coa
hydratase ii (phab), 3-
hydroxyacyl-coa dehydro-
genase (phac), ketothiolase
(phad), phenylacetyl-coa
ligase (phae), ring-
< td/>oxidation . . .
29938580_f3_74719211849 2450149
21892961_f3_75219221849369923217 3−13Klebsiella pneumoniaeContig275AGTC ORF with score 199 to:
(ai:7000843618) (or:
Enterobacter cloacae)
7052083_f3_7531 923184941089362173Klebsiella pneumoniaeContig479AGTC ORF with score 325 to:
(ai:7000801275) (or:
Pseudomonas aeruginosa)
16041452_f3_763184952475824284−24Enterobacter cloacaeCONTIG334GTC ORF with score 458 to:
(ai:7501749986) (or:
Klebsiella penumoniae)
24694567_f3_76918496567188
192618497642213132−8Klebsiella pneumoniaeContig529AGTC ORF with score 94 to:
(ai:200446) (or:Mus
< td/>musculus) (sr:, house mouse)
14495841_f3_776192718498900299122−7 Hydra magnipapillataA41132(cl:unassign ed collagens)
10057907_f3_777192818499321106
9792667_ f3_780192918500543180119−8Klebsiella pneumoniaeContig446AGTC ORF with score 134 to:
(ai:5500701468) (or:Equine
< td/>herpesvirus 4) (fn:very large
< td/>tegument protein) (de:equine
< td/>herpesvirus 4 strain ns80567,
complete genome.) (nt:
counterpart of hsv-1 gene ul36
and vzv gene 22)
10547683_f3_805193018 5011065354717−71C64816
20211638_f3_8 111931185021185394−86Escherichia coliP75775(de:hypothetical 42.1 kd
protein in moae-rhlc
intergenic region)
2558257_f3_8131932185031338445314−27Pseudomonas aeruginosaX99514(fn:outer membrane
component of multidrug
efflux) (de:p.aeruginosa
mexe, mexf & oprn genes.)
16531381_f3_8181933954317350−32Aquifex aeolicusD70462
16276087_ f3_827193418505672223
24640756_c1_835193518506954317309−27Bacillus subtilis/BacillusI40425 (ec:3.1.1.1)
globigii
43 82831_c1_836193618507600 199
35572961_c1_855193718 508330109
2082711_c1_856< /td>1938185097622533 02−27Enterobacter cloacaeCONTIG447GTC ORF with score 302 to:
(ai:7000763226) (or:
Pseudomonas aeruginosa)
36222655_c1_8651851030099
1940185111188395714−70Agrobact erium tumefaciensAB006858(sr:a grobacterium tumefaciens
(TI PLASMID PTIBO542)(strain:maff301001) plasmid:
pti-sakur) (de:agrobacterium
< td/>tumefaciens plasmid pti-
sakura trai and trb(b, c, d, e,
j, k, l, f, g, h and i) genes,
complete cds.) (nt:probable
conjugal transfer protein)
14272716_c1_87319411239412
29346031_ c1_8741942185131239412171−11Agrobacterium tumefaciensAB006858(sr:a grobacterium tumefaciens
(TI PLASMID PTIBO542)(strain:maff301001) plasmid:
pti-sakur) (de:agrobacterium
< td/>tumefaciens plasmid pti-
sakura trai and trb(b, c, d, e,
j, k, l, f, g, h and i) genes,
complete cds.) (nt:probable
conjugal transfer protein)
16151716_c1_8751943345114
10838205_c 1_877194418515753250133−6Volvox carteriS22697
36064783_c 1_879194518516561186121−5Nephila clavipesAF027735(de:neph ila clavipes minor
< td/>ampullate silk protein misp1
< td/>mrna, partialcds.)
32292936_c1_8811946185171332443454∠’43Agrobacterium tumefaciensAB006858(sr:a grobacterium tumefaciens
(TI PLASMID PTIBO542)(strain:maff301001) plasmid:
pti-sakur) (de:agrobacterium
< td/>tumefaciens plasmid pti-
sakura trai and trb(b, c, d, e,
j, k, l, f, g, h and i) genes,
complete cds.) (nt:probable
conjugal transfer protein)
33629068_c1_8841947501166
32630406_c 1_889194818519492163114−4Nephila clavipesAF027735(de:neph ila clavipes minor
< td/>ampullate silk protein misp1
< td/>mrna, partialcds.)
14316576_c1_8931949185201746581959∠’96Escherichia coliP05021(ec:1.3.3.1) (de:(dhodchase))
4943765_c1_896195 018521432143154 −11Caenorhabditis elegansU55366(sr:caenorh abditis elegans
strain=bristol n2) (de:
caenorhabditis elegans
cosmid f41f3.) (nt:similar to
cuticle collagen)
11929780_c1_9021951 185221542513
34425628 _c1_904195218523747248617−60Bordetella pertussisP16574(de:virulence factors putative
position transcription
regulator bvga)
32552308_c1_9061953 18524996331236−17 mice|C57BL/6xCBA/CaJS59856(cl:collagen alpha 1(i) chain:
h ybridfibrillar collagen carboxyl-
terminal homology:von
willebrand factor type c
repeat homology) (sr:, house
< td/>mouse)
12300707_c1_9151954 185251395464668 −67Methanococcus jannaschiiL77117(de:meth anococcus jannaschii
< td/>section 116 of 150 of the
complete genome.) (nt:similar
to gb:x75879 sp:p54144 pid:
2299143)
26383586_c1_919195 518526408135104 −5Herpesvirus papioU23857(fn:binds to orip to permit
replication of the) (de:herpes-
virus papio brrf2 homolog
gene, partial cds, ebnal,
bkrf2homolog and bkrf3
< td/>homolog genes, complete cds,
and bkrf4 homologgene,
partial cds.) (nt:similar to
ebnal of epstein-bar v . . .
12236683_c1_92719561852 71299432157−7 Acanthamoeba castellaniiAF085185(de:a canthamoeba castellanii
myosin-ia (mia) gene,
< td/>complete cds.) (nt:myosin-i)
31275968_c1_9321957< /td>185282439812639⠈’62Mycobacterium lepraeP53435(ec:1.1.99.5) (de:glycerol-3-
< td/>phosphate dehydrogenase,)
34104683_c1_938195 818529618205152 −10Epstein-Barr virusP03211(sr:b95-8, human herpesvirus
4) (de:ebna-1 nuclear protein)
36041716_c1_93919591389462144−6Pseudomonas aeruginosaZ54213(de:p.ae ruginosa algy gene.)
13021006_c1_9441960185311701566103−2Pinctada fucataD86074(sr:pinctada fucata cdna to
mrna) (de:pinctada fucata
mrna for insoluble protein,
complete cds.)
31926091_c1_9451961 18532720239
6892666_c1_95 2196218533402133110−6Aspergillus fumigatusContig8029GTC ORF with score 318 to:
(ai:120145) (or:Bos taurus)
(sr:, cattle)
31427037_c1_9541963699232106−3mice|C57BL/6xCBA/CaJA55817(cl:unassig ned ser/thr or tyr-
hyb ridspecific protein kinases:
protein kinase homology) (sr:,
< td/>house mouse)
10680142_c1_958196418535420139106−5 Chlamydomonas reinhardtiiS50755
strain UTEX 1061
14978900_c1_96119651 853624982133−9Klebsiella pneumoniaeContig559AGTC ORF with score 133 to:
(ai:7000763331) (or:
Pseudomonas aeruginosa)
32442881_c1_96218537567188
196718538816271317−29Klebsiel la pneumoniaeContig435AGTC ORF with score 386 to:
(ai:7000841537) (or:
Enterobacter cloacae)
6745642_c1_9661 96818539834277322Vibrio parahaemolyticusPC2359(n:na+/h+ antiporter
< td/>hypothetical 175 protein)
16917080_c1_96719691200399789−78< /td>Escherichia coliP75949(de:hypothetical 37.6 kd
protein in fhue-ndh intergenic
< td/>region)
35443780_c1_97018541966321438−41Enterobacter cloacaeCONTIG434GTC ORF with score 1384 to:
(ai:7501766959) (or:
Klebsiella pneumoniae)
6520831_c1_972185422124707666−66Enterobacter cloacaeCONTIG434GTC ORF with score 666 to:
(ai:7000763342) (or:
Pseudomonas aeruginosa)
22838341_c1_973185431149382138−7miceS50883(sr:mice macrophage) (de:
putative transcription
regulator {clone t2, repetitive
< td/>sequence}(mice, macrophage,
mrna, 1263 nt).) (nt:method:
conceptual translation
supplied by author.)
4765643_c1_974197314194721633−168 Escherichia coliH64733(cl:arginine permease) (mp:
2.6 min)
4506433_c1_975197418 54513474481359−139ActinobacillusU24492(de:actinobacillus pleuro-
pleuropneumoniaepneumoniae 48 kda outer
< td/>membrane protein(aopa) gene,
< td/>complete cds.) (nt:similar to
ngra gene of vibrio
alginolyticus)< /td>
290908_c1_97619751854612154041392−142< HIL>Vibrio alginolyticusAB008030(sr:vibrio alginolyticus dna)
(de:vibrio alginolyticus genes
< td/>for na-translocating nadh-
< td/>quinonereductase complex,
nqr operon, complete
genome.) (nt:hydrophobic
< td/>membrane protein with pi
8.14; ngrb.)
21914680_c1_9771976185471449482759−75Haemophilus influenzaeP43958(de:hypothetical protein
hi0168/169)
17033265_c1_9791854812754241586< /td>−163Haemophilus influenzaeD64052(pn:nadh-quinone reductase
beta chain, na+-translocating)
10652016_c1_981 19781854980726894Haemophilus influenzaeP43960(de:hypothetical protein
hi0173)
5197540_c1_98619 79185501254417385Haemophilus influenzaeP45247(de:hypothetical abc
transporter atp-binding protein
hi1549)
11970183_c1_9871 980185511056351180Klebsiella pneumoniaeContig508AGTC ORF with score 180 to:
(ai:7000763357) (or:
Pseudomonas aeruginosa)
32708527_c1_994185521149382727−72Haemophilus influenzaeP44491(ec:2.7.1.130) (de:tetra-
< td/>acyldisaccharide 4′-kinase,
(lipid a 4′-kinase))
6385456_c1_995198218553987328684− 67Escherichia coliP04951(ec:2.7.7.38) (de:synthetase)
< td/>(cmp-2-keto-3-
deoxyoctulosonic acid
synthetase) (cks))
16578506_c1_99819831855450116696−5CONTIG007GTC ORF with score 207 to:
C(ai:7000811044) (or:
Pseudomonas aeruginosa)
9894758_c1_1000185551071356199−12Herpes simplex virus (type 6/AF015297(de:human herpesvirus 6
strain Uganda-1102)(strain uganda-1102) ie2hom
mrna, complete cds.) (nt:
similar to the immediate-
< td/>early 2 protein of human)
20125952_c1_10061985693230168−12Bacillus subtilis/BacillusG70044
globigii
16831555_c1_10101855718962227 −19Haemophilus influenzaeG64051(cl:esch erichia coli ribosomal
protein 132)
16103155_c1_10111987 185581014337731−72Salmonella choleraesuisAF044668(fn:putative role in fatty acid
ser otype typhimuriumor phospholipid) (de:
salmonella typhimurium
(g30k) gene, partial cds; and
50s ribosomalprotein 132
(rpmf), plsx (plsx), 3-oxoacyl-
< td/>acyl carrier proteinsynthase iii
(fabh), malonyl coa-acyl
carrier prote . . .
35666015_c1_10121988185 5910443471540−158 Pseudomonas aeruginosaU91631(de:pseu domonas aeruginosa
< td/>plsx protein homolog (plsx)
gene, partialcds; and malonyl-
coa:acyl carrier protein
transacylase (fabd), 3-oxo-
acylacyl carrier protein
reductase (fabg), acyl
carrierprotein (acpp), and 3-
oxoacyl-acyl carrier . . .
26370387_c1_10151989185 6012664212119−219 Pseudomonas aeruginosaU91631(de:pseu domonas aeruginosa
< td/>plsx protein homolog (plsx)
gene, partialcds; and malonyl-
coa:acyl carrier protein
transacylase (fabd), 3-oxo-
acylacyl carrier protein
reductase (fabg), acyl
carrierprotein (acpp), and 3-
oxoacyl-acyl carrier . . .
4110452_c1_102019901856 19873281445−148P52024(ec:2.7.7.7) (de:dna poly-
< td/>merase iii, delta subunit,)
25519512_c1_1021199118562390129617−60 Pseudomonas aeruginosaL42622(sr:pseu domonas aeruginosa
< td/>(strain paol) dna) (de:
pseudomonas aeruginosa pilz
and holb genes, complete
cds.) (nt:involved in bio-
genesis of type 4 fimbriae)
11048783_c1_1025199218563441146100−4< /td>Caenorhabditis elegansU88170(sr:caenorh abditis elegans
strain=bristol n2) (de:
caenorhabditis elegans
cosmid c10g11.) (nt:coded for
by c. elegans cdna yk65e4.5;
coded for by)
2223506_c1_1026199318 564654217132−7MethanobacteriumB69008
thermoaut otrophicum
14975792_c1_102818565405134105−4Saccharomyces cerevisiaeP47179(sr:,baker's yeast) (de:
precursor)
26769182_c1_1029 199518566588195235Acinetobacter baumanniiCONTIG230GTC ORF with score 235 to:
C(ai:7000763399) (or:
Pseudomonas aeruginosa)
22838586_c1_1030 1996185671365454132< /td>−8Aspergillus fumigatusContig10032GTC ORF with score 132 to:
(ai:7000763400) (or:
Pseudomonas aeruginosa)
9948887_c1_103118568930309841−84Bradyrhizobium japonicumP53575(de:transfer flavoprotein small
< td/>subunit) (etfss))
6056401_c1_103519981239412888−89< /td>Clostridium acetobutylicumContig243HGTC ORF with score 888 to:
(ai:7000763405) (or:
Pseudomonas aeruginosa)
15754152_c1_1047 199918570621206125−6Haemonchus contortusB44984(cl:unassigned collagens)
35797931_c1_1057200018571462153107−5 Acinetobacter baumanniiCONTIG180GTC ORF with score 159 to:
C(ai:195953) (or:Homo
sapiens) (sr:, man)
4160083_c1_105920011 857225885
11198757_c1_106 0200218573993330144−6mice|C57BL/6xCBA/CaJAF062 655(sr:house mouse) (de:mus
< td/>hybridmusculus plenty-of-prolines-
101 mrna, complete cds.) (nt:
binds to several sh3 domain
containing proteins)
10441627_c1_1061200318574732243173−13 Mycobacterium tuberculosisAL123456(de: mycobacterium
tuberculosis h37rv complete
genome; segment 124/162.)
(nt:rv2850c, (mtcy24a1.07),
possible)
31274191_c1_1063 200418575417138 97−3Acanthamoeba castellaniiP19706(sr:,amoeba) (de:myosin
< td/>heavy chain ib (myosin heavy
< td/>chain il))
2745830_c1_106420051 85761644547198−15 Aspergillus fumigatusContig7828GTC ORF with score 198 to:
(ai:7000763434) (or:
Pseudomonas aeruginosa)
12711442_c1_1066 2006185771470489124< /td>−7Acinetobacter baumanniiCONTIG160GTC ORF with score 124 to:
C(ai:7000763436) (or:
Pseudomonas aeruginosa)
22128143_c1_1070 2007185781767588135< /td>−5Canadian hard winter wheatP10387(sr:,wheat) (de:glutenin,
high molecular weight subunit
dy10 precursor)
33792590_c1_1076200818579513170134−9 Rhodobacter capsulatusP14172(sr:,rho dopseudomonas
capsulata) (de:hypothetical
28.2 kd protein in ampr
5′region)
1276012_c1_1078 2009185801575524
13016455_c1_10792010185811464487
7166638_c1_10802011 1858223176
10444533_c 1_108120121858382227393−1Mycobacterium smegmatisAF027770(de:myc obacterium smegmatis
iron uptake genes, fxba (fxba)
gene, partial cds; and fxta
(fxta), fxtb (fxtb), fxbb (fxbb),
fxbc(fxbc), fxtc (fxtc), fxtd
(fxtd), fxte (fxte), and fxtf
(fxtf)genes, complete cds.)
< td/>(nt:similar to membrane b . . .
36369666_c1_10882013185 84843280296−26Streptomyces coelicolorAL031031(de:st reptomyces coelicolor
< td/>cosmid 7c7.) (nt:sc7c7.17,
possible transcriptional
< td/>regulatory)
24032001_c1_1 0902014185851002333 149−8Oryza sativaD16685(sr:oryza sativa (strain
aichiasaki) (library: lambda
embl3) seedlin) (ec:1.1.1.27)
(de:rice gene for lactate
dehydrogenase, complete
cds.)
34242663_c1_10912 01518586543180138Homo sapiensAB002322(sr:homo sapiens male brain
< td/>cdna to mrna, clone_lib:
< td/>pbluescriptii s) (de:human
mrna for kiaa0324 gene,
< td/>partial cds.)
4504842_c1_10962016 185872241746120−3 Chinese oak silkmothAF083334(sr:chinese oak silkmoth) (de:
antheraca pernyi fibroin gene,
< td/>complete cds.)
35650276_c1_1098201718588594197562−54Escherichia coliP76264(de:hypothetical 22.1 kd
protein in manz-cspc
intergenic region)
26666708_c1_109920181203400177−12< /td>AchromobacterF36145
georg iopolitanum
26032030_c2_1101 20191859010593521391 −142Klebsiella pneumoniaeP13092(ec:2.7.1.87) (de:phospho-
transferase) (sph))
24042150_c2_11052020450149
31541261_c2 _1112202118592540179
16927158_c2_11222022185931338445
29474033_c2_1123202318594438145103< /td>−5Acinetobacter baumanniiCONTIG143GTC ORF with score 104 to:
C(ai:7500720039) (or:
Clostridium acetobutylicum)
22829693_c2_1125< /td>2024185955521831 21−5Dictyostelium discoideumP36417(sr:,slime mold) (de:g-box
binding factor (gbf))
11073452_c2_11272025408135269−23Entamoeba histolyticaY14328(de:ent amoeba histolytica
mrna for 3e1 protein.)
30191531_c2_1128202618597579192343−31 Klebsiella pneumoniaeContig525AGTC ORF with score 497 to:
(ai:7000829287) (or:
Enterobacter cloacae)
25391652_c2_1131202718598840279148 −9Klebsiella pneumoniaeContig525AGTC ORF with score 358 to:
(ai:7000829290) (or:
Enterobacter cloacae)
32445453_c2_11342028185999453141385−141Achromobacter D38633(sr:pseudomonas sp. (strain:
georgiopolitanumkks102) dna) (de:
pseudomonas sp. bphr gene
for regulatory protein,
complete cds.)
15802155_c2_1142202918600525174
16229843_c2_ 114520301860124758241231−125Agrobacterium tumefaciensP54910(de:conjugal transfer protein
(TI PLASMID PTIBO542)trbe precursor)
25675442_c2_11512031186021236411353− 32Plasmid RK2H44020
16288341_c2_1155203218603888295103 −2minor jackknife clamL41834(sr:ensis minor (clone: 1/6)
male adult gonads cdna to
mrna) (de:ensis minor (clone
1/6) nuclear protein mrna,
< td/>complete cds.) (nt:putative)
10054042_c2_11632033 1860429798
16034 388_c2_11662034186052337 7781540−158Haemophilus influenzaeP44524(de:hypothetical protein
hi0116/115)
25601637_c2_1171 203518606618205209−17Escherichia coliF65081
31385207_c2_1 1732036186072295764 1230−125Bordetella parapertussisP40330(ec:2.7.3.— ) (de:virulence
sensor protein bvgs
precursor,)
11880416_c2_11832037186081407468
2038186091515504266−22Strepto myces lividansAF072709(de:stre ptomyces lividans
amplifiable element aud4:
< td/>putativetranscriptional
regulator, putative ferredoxin,
putative cytochromep450
oxidoreductase, and putative
oxidoreductase genes,
completecds; and unknown
genes.) (nt:orf2; similar . . .
3183333_c2_118820391861 028594
31538516_c2_119520401861115455147 57−75Escherichia coliP12281(de:molybdopterin bio-
synthesis moca protein)
16297928_c2_11972041 18612441146125−7Caenorhabditis elegansP17656(de:cuticle collagen 2)
31877307_c2_1199204218 6131629542723−73AL123456(de: mycobacterium
tuberculosis h37rv complete
genome; segment 100/162.)
(nt:rv2251, (mtv022.01), len:
475. unknown but similar)
35625806_c2_12012043 1861430241007615−5 9Escherichia coliP76407(de:hypothetical 32.0 kd
protein in ogrk-gatr
intergenic region)
9782711_c2_12022044837278419−39Escherichia coliB64835
5992666_c2_12 052045186162946981−166Escherichia coliP43672(de:abc transporter atp-
binding protein uup)
24885208_c2_12072046 18617492163106−4Q16676(sr:,human) (de:forkhead-
related transcription factor 4
(freac-4))
21696016_c2_1210204 71861821637203176Pseudomonas fragiP28793(ec:4.2.1.17:5.3.3.8: 1.1.1.35:-
< td/>5.1.2.3) (de:hydroxybutyryl-
coa epimerase,))
25667292_c2_12112048< /td>1861911913961863 −192Pseudomonas fragiJS0624(cl:acetyl-coa acetyl-
transferase)
13023916_c2_1215204918620891296266< /td>−23Klebsiella pneumoniaeContig559AGTC ORF with score 266 to:
(ai:7000763585) (or:
Pseudomonas aeruginosa)
5947942_c2_12181862124681
205118622588195284−25Klebsiell a pneumoniaeContig435AGTC ORF with score 284 to:
(ai:7000763590) (or:
Pseudomonas aeruginosa)
15676066_c2_1227 205218623507168101−4Aspergillus fumigatusv1x1c353.xGTC ORF with score 355 to:
(ai:345414) (or:Arabidopsis
thaliana) (sr:thale cress c24)
(de:glycine-rich protein
{clone atgrp-1} (arabidopsis
< td/>thaliana, c24,mrna partial,
740 nt).) (nt:this sequence
comes from FIG. 3a; atgrp)
13129035_c2_12302053444147408−38Klebsiella pneumoniaeContig508AGTC ORF with score 408 to:
(ai:7000763600) (or:
Pseudomonas aeruginosa)
17036290_c2_1231 20541862522875171−13Enterobacter cloacaeCONTIG355GTC ORF with score 678 to:
(ai:7501784160) (or:
Klebsiella pneumoniae)
13010216_c2_1232 205518626576191149−9human herpesvirus type 6U13194(fn:transcriptional regulation)
HHV-6(de:human herpesvirus 6
replication origin-binding
protein (hdrfo), partial cds,
helicase-primase component
(hdrf1), virion protein(dhlf1),
< td/>putative helicase (hdrf2),
putative phosphoprotein
(edrf1), replica . . .
9824043_c2_123420561862 718366111458−149AF058302(de:s treptomyces roseofulvus
frenolicin biosynthetic gene
cluster, complete sequence.)
< td/>(nt:gapx; probably not the
g3phd isoform used in)
9800955_c2_1240205718 628327108166−12P43957(de:hypothetical protein
hi0167)
24635163_c2_1241 205818629507168411Vibrio alginolyticusS65528(pn:na+-trans locating nadh-
< td/>quinone reductase, chain
< td/>gamma)
13172555_c2_1243205 91863071423796⠈’5Enterobacter cloacaeCONTIG484GTC ORF with score 195 to:
(ai:7000788528) (or:
Pseudomonas aeruginosa)
13776030_c2_1246 206018631426141120−6Homo sapiensAB011167(sr:homo sapiens male brain
< td/>cdna to mrna, clone_lib:
< td/>pbluescriptii s) (de:homo
sapiens mrna for kiaa0595
protein, partial cds.)
10399055_c2_12472061186321086361656−64Haemophilus influenzaeP44550(de:hypothetical lipoprotein
hi0172 precursor)
4553568− c2_12492062186331404467< /td>2118−219Pseudomonas fluorescensU91523(fn:transfers reducing
equivalents between nad and)
(de:pseudomonas fluorescens
soluble pyridine nucleotide-
transhydrogenase (sth) gene,
< td/>complete cds.)
12365751_c2_1257206318634498165137−10Klebsiella pneumoniaeContig508AGTC ORF with score 180 to:
(ai:7000763357) (or:
Pseudomonas aeruginosa)
32160455_c2_1258 206418635342113273−24Escherichia coliP75957(de:hypothetical abc
transporter atp-binding protein
ycfv)
35755466_c2_125920 65186361314437719Escherichia coliP75958(de:hypothetical 45.3 kd
protein in mfd-cobb intergenic
< td/>region)
6251711_c2_1261186371641546319−25Nephila clavipesAF027735(de:neph ila clavipes minor
< td/>ampullate silk protein misp1
< td/>mrna, partialcds.)
32055142_c2_12642067< /td>18638468155140∠’10Neisseria gonorrhocaeU79563(de:nei sseria gonorrhacae
tonb (tonb), exbb (exbb) and
exbd (exbd)genes, complete
cds.)
5352206_c2_126620 6818639354117165−12Escherichia coliP75844(de:hypothetical 6.9 kd
protein in msba-kdsb
intergenic region)
6303751_c2_12672069447148100−6Enterobacter cloacaeCONTIG470GTC ORF with score 100 to:
(ai:7000763637) (or:
Pseudomonas aeruginosa)
35251631_c2_1268 2070186411236411665< /td>−65Escherichia coliL14557(sr:escherichi a coli (strain
rdd012) dna) (de:escherichia
coli udp-n-acetylpyruvoyl-
glucosamine reductase (murb)
gene, complete cds, and biotin
operon repressor/biotin
holoenzyme(bira) gene, 3′
end.)
35286381_c2_12702071< /td>18642759252131∠’6AchromobacterA61183
ge orgiopolitanum
34652153_c2_127320721864323778
31291080_c2_127720731864411 70389391−38Ric kettsia prowazekiiAJ235269Ricket tsia prowazekii strain
Madrid E, complete genome.
13144165_c2_1284207410893621216−12 4Pseudomonas aeruginosaU91631(de:pseu domonas aeruginosa
< td/>plsx protein homolog (plsx)
gene, partialcds; and malonyl-
coa:acyl carrier protein
transacylase (fabd), 3-oxo-
acylacyl carrier protein
reductase (fabg), acyl
carrierprotein (acpp), and 3-
oxoacyl-acyl carrier . . .
4195187_c2_128520751864 624681366−33< HIL>Pseudomonas aeruginosaU91631(de:pseu domonas aeruginosa
< td/>plsx protein homolog (plsx)
gene, partialcds; and malonyl-
coa:acyl carrier protein
transacylase (fabd), 3-oxo-
acylacyl carrier protein
reductase (fabg), acyl
carrierprotein (acpp), and 3-
oxoacyl-acyl carrier . . .
29801081_c2_12892076186 4780126699−2< HIL>Paramecium bursariaU42580(de:parame cium bursaria
Chlorella virus 1chlorella virus 1, complete
genome.) (nt:lys-, pro-rich,
papk (10x); similar to wheat
< td/>pro-,)
12772882_c2_1290207 7186481092363625−61Escherichia coliP28306(de:hypothetical 38.2 kd
protein in pabc-holb
intergenic region)
35282257_c2_12912078828275118−4Nephila clavipesAF027735(de:neph ila clavipes minor
< td/>ampullate silk protein misp1
< td/>mrna, partialcds.)
35667642_c2_12932079< /td>18650861286117∠’4Plasmodium cynomolgiP08674(sr:gombak,) (de:
circumsporozoite protein
precursor (cs))
16881590_c2_12962080186511125374551−53Archaeoglobus fulgidusF69413
20173291_ c2_130220811865244714