English


  
Home
Agents
Assignees
Inventors
Examiners
Contact
Links

Packaging cells comprising codon-optimized gagpol sequences and lacking lentiviral accessory proteins


No:

6958226 -

Application no:

09393795 -

Filed date:

1999-09-10 -

Issue date:

2005-10-25

Kind:

B1


Claims:

22

Drawing sheets:

29

Abstract:


Novel packaging cell lines useful for generating viral accessory protein independent HIV-derived retroviral vector particles, methods of constructing such packaging cell lines and methods of using the viral accessory protein independent HIV-derived retroviral vector particles are disclosed.

US Classes:



Inventors:



Agents:


Assignees:


Claims:


What is claimed is:

1. A packaging cell line comprising: a) a mammalian cell; b) a first retroviral nucleotide sequence in the cell which comprises a codon optimized coding sequence for a HIV gagpol and lacks coding sequences for HIV accessory proteins, Rev response element and constitutive transport elements; c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein; and d) a third retroviral nucleotide sequence in the cell which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration, wherein said packaging cell line produces a HIV-derived retroviral vector particle.

2. A packaging cell line of claim 1 wherein the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G).

3. A packaging cell line of claim 1 wherein the heterologous envelope protein is the amphotropic envelope of the Moloney leukemia virus.

4. A packaging cell line comprising: a) a mammalian cell; b) a first retroviral nucleotide sequence in the cell which comprises a codon optimized coding sequence for a HIV gagpol and lacks coding sequences for HIV accessory proteins, Rev response element and constitutive transport elements; and a second retroviral nucleotide sequence in the cell which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration.

5. A packaging cell line comprising: a) a mammalian cell; b) a first retroviral nucleotide sequence in the cell which comprises a codon optimized coding sequence for a HIV gagpol and lacks coding sequences for HIV accessory proteins, Rev response element and constitutive transport elements; and c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein.

6. A method of producing a packaging cell line which produces a HIV-derived retroviral vector particle, comprising co-transfecting mammalian host cells with: a) a first plasmid comprising a codon optimized DNA sequence which encodes HIV gagpol proteins and lacks DNA sequences encoding HIV accessory proteins, Rev response element and constitutive transport elements; b) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and c) a third plasmid comprising a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration, thereby producing a packaging cell line which produces a HIV-derived retroviral vector particle.

7. A method of claim 6 wherein the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G).

8. A method of claim 6 wherein the heterologous envelope protein is the amphotropic envelope protien of the Moloney leukemia virus.

9. A method of producing a HIV-derived retroviral vector particle comprising the steps of: a) co-transfecting mammalian host cells with: i) a first plasmid comprising a codon optimized DNA sequence which encodes HIV gagpol proteins and lacks DNA sequences encoding HIV accessory proteins, Rev response element and constitutive transport elements; ii) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and iii) a third plasmid comprising a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration, b) maintaining the transfected cells under conditions suitable for virus particle production; and c) recovering virus particle produced in step b).

10. A method of claim 9 wherein the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G).

11. A method of claim 9 wherein the heterologous envelope protein is the amphotropic envelope of the Moloney leukemia virus.

12. A packaging cell line comprising: a) a mammalian cell; b) a first retroviral nucleotide sequence in the cell which comprises a codon optimized coding sequence for a lentivirus gagpol and lacks coding sequences for lentivirus accessory proteins, Rev response element and constitutive transport elements; c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein; and d) a third retroviral nucleotide sequence in the cell which comprises a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration, wherein said packaging cell line produces a lentivirus-derived retroviral vector particle.

13. A packaging cell line of claim 12 wherein the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G).

14. A packaging cell line of claim 12 wherein the heterologous envelope protein is the amphotropic envelope of the Moloney leukemia virus.

15. A packaging cell line comprising: a) a mammalian cell; b) a first retroviral nucleotide sequence in the cell which comprises a codon optimized coding sequence for lentivirus gagpol and lacks coding sequences for lentivirus accessory proteins, Rev response element and constitutive transport elements; and c) a second retroviral nucleotide sequence in the cell which comprises a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration.

16. A packaging cell line comprising: a) a mammalian cell; b) a first retroviral nucleotide sequence in the cell which comprises a codon optimized coding sequence for lentivirus gagpol and lacks coding sequences for lentivirus accessory proteins, Rev response element and constitutive transport elements; and c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein.

17. A method of producing a packaging cell line which produces a lentivirus-derived retroviral vector particle, comprising co-transfecting mammalian host cells with: a) a first plasmid comprising a codon optimized DNA sequence which encodes lentivirus gagpol proteins and lacks DNA sequences encoding lentivirus accessory proteins, Rev response element and constitutive transport elements; b) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and c) a third plasmid comprising a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration, thereby producing a packaging cell line which produces a lentivirus-derived retroviral vector particle.

18. A method of claim 17 wherein the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G).

19. A method of claim 17 wherein the heterologous envelope protein is the amphotropic envelope of the Moloney leukemia virus.

20. A method of producing a lentivirus-derived retroviral vector particle comprising the steps of: a) co-transfecting mammalian host cells with: i) a first plasmid comprising a codon optimized DNA sequence which encodes lentivirus gagpol proteins and lacks DNA sequences encoding lentivirus accessory proteins, Rev response element and constitutive transport elements; ii) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and iii) a third plasmid comprising a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration, b) maintaining the transfected cells under conditions suitable for virus particle production; and c) recovering virus particle produced in step b).

21. A method of claim 20 wherein the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G).

22. A method of claim 20 wherein the heterologous envelope protein is the amphotropic envelope of the Moloney leukemia virus.

Text:


BACKGROUND OF THE INVENTION

Retroviral vectors based on lentiviruses, such as human immunodeficiency viruses (HIV), can infect nondividing cells, and integration of proviral DNA occurs without the need for cell division. These properties make lentiviruses attractive for gene transfer into nondividing cells, such as hepatocytes, myofibers, hematopoietic stem cells, and neurons.

However, the use of lentivirus vectors, particularly HIV vectors, particularly for gene therapy, is hampered by concern over their safety. Thus, a need for the development of lentivirus vectors, particularly HIV vectors, with improved safety, particularly for gene therapy, exists.

SUMMARY OF THE INVENTION

The present invention relates to novel packaging cell lines useful for generating viral accessory protein independent lentivirus-derived, particularly HIV-derived, retroviral vector particles, to construction of such cell lines and to methods of using the accessory protein independent lentivirus-derived retroviral vector particles to introduce DNA of interest into cells (e.g, eukaryotic cells such as animal (particularly mammalian), plant or yeast cells or prokaryotic cells such as bacterial cells). In a preferred embodiment, the packaging cell lines of the present invention are stable packaging cell lines.

In one embodiment of the invention, packaging cell lines for producing a viral accessory protein independent lentivirus-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); and (b) a retroviral nucleotide sequence in the cell which comprises a coding sequence for lentivirus gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol proteins.

In second embodiment of the invention, packaging cell lines for producing a viral accessory protein independent lentivirus-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); (b) a first retroviral nucleotide sequence in the cell which comprises a coding sequence for lentivirus gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol proteins; and (c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein.

In a third embodiment of the invention, packaging cell lines for producing a viral accessory protein independent lentivirus-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); (b) a first retroviral nucleotide sequence in the cell which comprises a coding sequence for lentivirus gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol proteins; (c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein; and (d) a third retroviral nucleotide sequence which comprises a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration.

In a fourth embodiment of the invention, packaging cell lines for producing a viral accessory protein independent lentivirus-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); (b) a retroviral nucleotide sequence in the cell which comprises a coding sequence for lentivirus gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol proteins; and (c) a retroviral nucleotide sequence which comprises a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration.

In a fifth embodiment of the invention, packaging cell lines for producing a viral accessory protein independent HIV-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); and (b) a retroviral nucleotide sequence in the cell which comprises a coding sequence for HIV gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the HIV gagpol proteins.

In sixth embodiment of the invention, packaging cell lines for producing a viral accessory protein independent HIV-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); (b) a first retroviral nucleotide sequence in the cell which comprises a coding sequence for HIV gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; and (c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein.

In a seventh embodiment of the invention, packaging cell lines for producing a viral accessory protein independent HIV-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); (b) a first retroviral nucleotide sequence in the cell which comprises a coding sequence for HIV gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; (c) a second retroviral nucleotide sequence in the cell which comprises the coding sequence for a heterologous envelope protein; and (d) a third retroviral nucleotide sequence which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration.

In a eighth embodiment of the invention, packaging cell lines for producing a viral accessory protein independent HIV-derived retroviral vector particles comprise (a) a cell (e.g., mammalian cell); (b) a retroviral nuclcotide sequence in the cell which comprises a coding sequence for HIV gagpol, wherein said coding sequence has been codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; and (c) a retroviral nucleotide sequence which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration.

Alternatively, each of the packaging cell lines described herein can be produced using (1) a retroviral nucleotide sequence which comprises a codon optimized gag coding sequence and (2) a retroviral nucleotide sequence which comprises a codon optimized pol coding sequence, in place of the retroviral nucleotide sequence which comprises a codon optimized gagpol coding sequence.

In a particular embodiment, the heterologous envelope protein is the G glycoprotein of vesicular stomatitis virus (VSV G). In another embodiment, the heterologous envelope protein is the amphotropic envelope of the Moloney leukemia virus (MLV).

Cell lines for producing a viral accessory protein independent lentivirus-derived retroviral vector particles are produced by transfecting host cells (e.g., mammalian host cells) with a plasmid comprising a DNA sequence which encodes lentivirus gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol proteins. Depending upon the particular cell line being produced, the host cells are also co-transfected with a plasmid comprising a DNA sequence which encodes a heterologous envelope protein, or a plasmid comprising a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration, or both of these plasmids. Alternatively, host cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a lentivirus gag protein and a plasmid comprising a codon optimized DNA sequence encoding a lentivirus pol protein, in place of the plasmid comprising a codon optimized DNA sequence encoding both lentivirus gagpol proteins.

Cell lines for producing a viral accessory protein independent HIV-derived retroviral vector particles are produced by co-transfecting host cells (e.g., mammalian host cells) with a plasmid comprising a DNA sequence which encodes HIV gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis to improve expression of the HIV gagpol proteins. Depending upon the particular cell line being produced, the host cells are also co-transfected with a plasmid comprising a DNA sequence which encodes a heterologous envelope protein, or a plasmid comprising a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration, or both of these plasmids. Alternatively, host cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a HIV gag protein and a plasmid comprising a codon optimized DNA sequence encoding a HIV pol protein, in place of the plasmid comprising a codon optimized DNA sequence encoding both HIV gagpol proteins.

The present invention also relates to methods of producing viral accessory protein independent lentivirus-derived retroviral vector particles, comprising co-transfecting host cells (e.g., mammalian host cells) with (a) a first plasmid comprising a DNA sequence which encodes lentivirus gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis to improve expression of the lentivirus gagpol proteins; (b) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration. Alternatively, host cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a lentivirus gag protein and a plasmid comprising a codon optimized DNA sequence encoding a lentivirus pol protein, in place of the first plasmid comprising a codon optimized DNA sequence encoding both lentivirus gagpol proteins.

In a particular embodiment, the invention relates to methods of producing viral accessory protein independent HIV-derived retroviral vector particles, comprising co-transfecting host cells (e.g., mammalian host cells) with (a) a first plasmid comprising a DNA sequence which encodes HIV gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis to improve expression of the HIV gagpol proteins; (b) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration. Alternatively, host cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a HIV gag protein and a plasmid comprising a codon optimized DNA sequence encoding a HIV pol protein, in place of the first plasmid comprising a codon optimized DNA sequence encoding both HIV gagpol proteins.

The present invention also relates to viral accessory protein-independent retroviral particles produced by or obtainable by (obtained by) the methods described herein.

The present invention further relates to isolated DNA encoding a codon optimized lentivirus gagpol, isolated DNA encoding the gag coding region of a codon optimized lentivirus gagpol, and isolated DNA encoding the pol coding region of a codon optimized lentivirus gagpol. In a particular embodiment, the present invention relates to isolated DNA encoding a codon optimized HIV gagpol, isolated DNA encoding the gag coding region of a codon optimized HIV gagpol, and isolated DNA encoding the pol coding region of a codon optimized HIV gagpol.

The packaging cell lines and viral particles of the present invention can be used for gene therapy or gene replacement with improved safety. The packaging cell lines and viral particles of the present invention can also be used in development and production of vaccines, and in production of biochemical reagents. Gene therapy vectors produced with the cell lines of the present invention are expected to be valuable medical therapeutics.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram of an expression cassette containing the codon optimized gagpol genes. The DNA was constructed in multiple segments, which arc indicated at the top as 1/3, 2/3, 3/3 (A, B, C and D) and HIN. Restriction sites used to assemble the cloned segments are indicated above the kilobasepair (Kb) ruler. Below the ruler are multiple features showing the location of the human cytomegalovirus (CMV) promoter, human betaglobin sequences (Bglobin), mRNA sequences (thinner line represents intronic sequence), the gag and pol open reading frames, the individual proteolytic fragment coding sequences (p17_MA, p24_CA, p7, p6, PR, p51_RT, RNaseH and integrase (IN)) and each synthetic oligonucleotide used in the assembly process (multiple adjacent open arrows).

FIG. 2 is a table which depicts codon usage frequencies in genes which are highly expressed and in the codon optimized gagpol open reading frame of the HIV packaging construct described herein.

FIG. 3 is a schematic representation of the HIV provirus and a prior art three-plasmid expression system used for generating a pseudotyped HIV-based vector by transient transfection as described in Naldini et al., Science, 272:263-267 (1996).

FIG. 4 is a list of some characteristics relating to the HIV Rev protein.

FIG. 5 is a list of some points relating to codon optimization of HIV gagpol.

FIG. 6 is a partial DNA sequence of HIV gag (SEQ ID NO: 1), showing inactivation of inhibitory sequences as described in Schwartz, S. et al., J. Virol., 66(12):7176-7182 (1992).

FIG. 7 a plot of the % (G+C) content of wildtype HIV gagpol sequences and theoretically codon optimized HIV gagpol sequences. The percent of bases, either G or C, was calculated for a 30 nucleotide moving window for the entire length of the gagpol gene, and the value plotted versus nucleotide position. Diamonds=HIV gagpol sequences; squares=full optimal back-translation for gag open reading frame; triangles=full optimal back-translation for pol open reading frame; CO=codon optimized.

FIGS. 8A-8E depict the alignment of the nucleotide sequences and predicted amino acid sequences for the gag coding region of a wildtype HIV gagpol and a codon optimized HIV gagpol. "NL4-3 genbank.SEQ" indicates the nucleotide sequence (SEQ ID NO:2) and predicted amino acid sequence (SEQ ID NO:3) for the gag coding region of a wildtype HIV gagpol. "pHDMHgpm2.seq" indicates the nucleotide sequence (SEQ ID NO:4) and predicted amino acid sequence (SEQ ID NO:5) for the gag coding region of a codon optimized HIV gagpol. The "NL4-3 genbank.SEQ" sequences are publicly available at the NIH GenBank sequence repository (Accesssion No. M19921).

FIGS. 9A-9L depict the alignment of the nucleotide sequences and predicted amino acid sequences for the pol coding region of a wildtype HIV gagpol and a codon optimized HIV gagpol. "NL4-3 genbank.SEQ" indicates a nucleotide sequence (SEQ ID NO:6) and a predicted amino acid sequence (SEQ ID NO:7) for the pol coding region of a wildtype HIV gagpol available in the NIH GenBank sequence repository (Accesssion No. M19921). The nucleotide and amino acid sequences for the pol coding region available in the GenBank sequence repository contain two sequence errors, which are indicated in FIGS. 9A-9L with shading. "pNL4-3.seq" indicates the correct nucleotide sequence (SEQ ID NO:8) and predicted amino acid sequence (SEQ ID NO:9) for the pol coding region of a wildtype HIV gagpol. "pHDMHgpm2.seq" indicates the nucleotide sequence (SEQ ID NO: 10) and predicted amino acid sequence (SEQ ID NO: 11) for the pol coding region of a codon optimized HIV gagpol.

FIGS. 10A-10D depict the DNA sequence (SEQ ID NO:12) for pHDMHgpm2. The CMV enhancer/promoter is at nucleotides 97 to 679, human betaglobin sequences (Bglobin) are at nucleotides 761 to 864, 865 to 1303 and 5710 to 6469 (end of Bglobin is at nucleotdes 6445 to 6469), mRNA sequences are at nucleotides 680 to 778 and 1255 to 592 1, SV40 origin of replication is at hucleotides 8796 to 8908, beta-lactamase (bla) coding region is at nucleotides 6709 to 7569, intron sequences are at nucleotides 779 to 1254, the codon optimized gag coding region is at nucleotides 1318 to 2820, the codon optimized pol coding region is at nucleotides 2619 to 5624 and the poly A site is at nucleotides 5897 to 5921.

FIG. 11 is a circular map of plasmid pHDMHgpm2.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to novel packaging cell lines useful for generating viral accessory protein independent lentivirus-derived, particularly HIV-derived, retroviral vector particles, to construction of such cell lines and to methods of using the accessory protein independent lentivirus-derived retroviral vector particles to introduce DNA of interest into cells (e.g, eukaryotic cells such as animal (particularly mammalian), plant or yeast cells or prokaryotic cells such as bacterial cells). In a particular embodiment, the packaging cell lines of the present invention are stable packaging cell lines.

The cell lines are engineered to express the lentivirus proteins necessary for virus particle formation (gagpol proteins), without containing DNA sequences from lentivirus accessory proteins (tat, vif, vpr, vpu, nef and rev proteins and Rev response element (RRE)). Additionally, no viral sequences (such as cis-acting elements termed constitutive transport elements (CTEs)) will be expressed as RNA of any kind. DNA sequences for lentivirus gagpol are codon optimized by extensively mutagenizing the sequences to improve expression and to reduce the risk of recombination between transfer vector sequences and gagpol messenger RNA. This greatly improves the safety of virus preparations generated from these cell lines. In a particular embodiment, the DNA sequences for lentivirus gagpol are not codon optimized in the overlap region between the gag and pol sequences and in cis-acting signals necessary for translation of pol.

Examples of lentiviruses include human immunodeficiency viruses (e.g., HIV-1, HIV-2, HIV-3), bovine lentiviruses (e.g., bovine immunodeficiency viruses, bovine immunodeficiency-like viruses, Jembrana disease viruses), equine lentiviruses (e.g., equine infectious anemia viruses), feline lentiviruses (e.g., feline immunodeficiency viruses, panther lentiviruses, puma lentiviruses), ovine/caprine lentiviruses (e.g., Brazilian caprine lentiviruses, caprine arthritis-encephalitis viruses, Maedi-Visna viruses, Maedi-Visna-like viruses, Maedi-Visna-related viruses, ovine lentiviruses, Visna lentiviruses), Simian AIDS retroviruses (e.g., human T-cell lymphotropic virus type 4), simian immunodeficiency viruses, simian-human immunodeficiency viruses, human lymphotrophic viruses (e.g., type III), simian T-cell lymphotrophic viruses.

In another embodiment, cell lines are engineered to express the HIV proteins necessary for virus particle formation (gagpol proteins), without containing DNA sequences from HIV accessory proteins (tat, vif, vpr, vpu, nef and rev proteins and Rev response element (RRE)). Additionally, no viral sequences (such as cis-acting elements termed constitutive transport elements (CTEs)) will be expressed as RNA of any kind. DNA sequences for a HIV gagpol are codon optimized by mutagenesis to improve expression and to reduce the risk of recombination between transfer vector sequences and gagpol messenger RNA. In a particular embodiment, the DNA sequences for HIV gagpol are not codon optimized in the overlap region between the gag and pol sequences and in cis-acting signals necessary for translation of pol.

Alternatively, each of the packaging cell lines described herein can be produced using (1) a nucleotide sequence which comprises a codon optimized gag coding sequence and (2) a nucleotide sequence which comprises a codon optimized pol coding sequence, in place of the nucleotide sequence which comprises a codon optimized gagpol coding sequence. In this embodiment, the gag and pol coding sequences can be completely codon optimized

Benefits of the present invention include the removal of potentially harmful lentivirus accessory proteins and other viral sequences, and the reduction of the risk of recombination to produce replication competent virus.

Packaging cell lines for producing a viral accessory protein independent lentivirus-derived retroviral vector particles comprise a mammalian cell and a retroviral nucleotide sequence comprising a coding sequence for a lentivirus gagpol which has been codon optimized. In a particular embodiment the packaging cell lines further comprise a retroviral nucleotide sequence comprising a coding sequence for a heterologous envelope protein. In a second embodiment, the packaging cell lines further comprise a retroviral nucleotide sequence comprising a coding sequence for a heterologous envelope protein and a retroviral nucleotide sequence which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration. In third embodiment, the packaging cell lines further comprise a retroviral nucleotide sequence which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration. Alternatively, the packaging cell lines of the present invention comprise a retroviral nucleotide sequence which comprises a codon optimized gag coding sequence and (2) a retroviral nucleotide sequence which comprises a codon optimized pol coding sequence, in place of the retroviral nucleotide sequence which comprises a codon optimized gagpol coding sequence.

The coding sequence(s) for lentivirus gagpol which has (have) been codon optimized results in improved expression of the lentivirus gagpol proteins and reduces the risk of recombination between the transfer vector and gagpol messenger RNA. Codon optimization of the coding sequence(s) for lentivirus gagpol was obtained by mutagenizing for each particular amino acid residue, specific nucleic acid bases in a codon for the particular amino acid residue to a nucleic acid base which is present in a codon which occurs at a high frequency in genes which are highly expressed for the same amino acid residue. In a particular embodiment, the resulting optimized codon also does not cause introduction of mRNA splicing signals into the codon optimized sequence. Thus, in a particular embodiment, codon optimization of the coding sequence(s) for lentivirus gagpol is obtained by mutagenizing for each particular amino acid residue, specific nucleic acid bases in a codon for the particular amino acid residue to a nucleic acid base that is present in a codon which (1) occurs at a high frequency in genes which are highly expressed for the same amino acid residue and (2) does not cause introduction of mRNA splicing signals into the codon optimized sequence. Codon optimization typically results in the removal of nucleic acid base A-rich instability elements.

In a particular embodiment, the coding sequence for a HIV gagpol (pNL4-3; available through the AIDS repository, NIH; Adachi et al., J Virol., 59:284-291 (1986)) has been codon optimized to improve translational efficiency of the HIV gagpol proteins and reduce the risk of recombination between the transfer vector and HIV gagpol messenger RNA. Two hundred thirty-seven base pairs (237 bp) consisting of the gag pol overlap and cis-acting signals necessary for translation of pol (nucleotides 2583 to 2819 of SEQ ID NO: 12) were not optimized. The HIV gagpol sequence obtained using the codon optimization process does not differ at the amino acid level from the wildtype HIV gagpol sequence, but differs at the nucleotide level from the HIV gagpol sequence. A codon optimized HIV gag sequence is shown in FIGS. 8A-8E (pHDMHgpm2.seq) (SEQ ID NO:4). A codon optimized HIV pol sequence is shown in FIGS. 9A-9L (pHDMHgpm2.seq) (SEQ ID NO: 10).

A plasmid comprising DNA sequences which encode codon optimized lentivirus gagpol proteins is also referred to herein as a packaging construct. This plasmid includes a promoter which drives the expression of the gagpol proteins, such as the human cytomegalovirus (hCMV) immediate early promoter. This plasmid is defective for the production of the viral envelope and accessory proteins tat, vif, vpr, vpu, nef and rev and the Rev response element (RRE). The packaging construct also does not contain viral sequences which are transcribed into mRNA, such as constitutive transport elements (CTEs).

A packaging construct comprising a codon optimized HIV gagpol is depicted in FIG. 1 and in FIG. 11. FIGS. 10A-10D depict the DNA sequence (SEQ ID NO:12) for the packaging construct pHDMHgpm2. This packaging construct (pHDMHgpm2) was constructed as follows: Plasmid pMDA.HIVgp mam was generated by chemical synthesis and PCR assembly (which is described in, for example, Stemmer et al., Gene, 164:49-53 (1995)) of 215 different oligonucleotides. The DNA sequence for pMDA.HIVgp mam is the same as the DNA sequence for pMDA.HIVgp jtg except for 4.3 kb which was codon optimized using the DNAStar program (LaserGene, Madison, Wis.). Two hundred thirty-seven base pairs (237 bp) consisting of the gag pol overlap and cis-acting signals necessary for translation of pol (nucleotides 2583 to 2819 of SEQ ID NO: 12) were not optimized due to dual reading frame constraints. A NsiI site 5' of IN was preserved to aid fusion with wildtype sequences. Several single or double base pair silent mutations were introduced either to prevent potential splice donors and acceptors, or by the synthesis process. pMDA.HIVgp jtg was derived from HIV-1 strain NL4-3. The protease mutation that is present in the NL4-3 NIH GenBank sequence was then repaired (FIG. 9B), changing the nucleotide present at position 2948 of SEQ ID NO:12 from a "G" to a "C", thereby producing the codon present at nucleotide positions 2948 to 2950 of SEQ ID NO: 12 which encodes an arginine instead of the glycine present in the NL4-3 GenBank amino acid sequence. The resulting plasmid was named pMDHgpmam. The EcoRI-HindIII fragment of pMDHgpmam was inserted into pHDM2b, a high copy version of the pMD vector (Ory, D. et al., Proc. Natl. Acad. Sci. USA, 93(21):11400-11406 (1996)), to produce plasmid pHDMHgpm. The sequencing mutation that is present in the RNase domain of the NL4-3 NIH GenBank sequence was repaired (FIG. 9H), changing the codon present at nucleotide positions 4724 to 4726 of SEQ ID NO:12 from "GGG" to "AAG", thereby producing a codon encoding a lysine instead of the glycine present in the NL4-3 GenBank amino acid sequence. The resulting plasmid was named pHDMHgpm2. Codon usage frequencies in the codon optimized gagpol open reading frame of the packaging construct pHDMHgpm2 are shown in FIG. 2.

As used herein, a heterologous envelope protein permits pseudotyping of particles generated by the packaging construct and includes the G glycoprotein of vesicular stomatitis virus (VSV G) and the amphotropic envelope of the Moloney leukemia virus (MLV). A plasmid comprising a DNA sequence which encodes a heterologous envelope protein is also referred to herein as an envelope coding plasmid.

The terms "mammal" and "mammalian", as used herein, refer to any vertebrate animal, including monotremes, marsupials and placental, that suckle their young and either give birth to living young (eutharian or placental mammals) or are egg-laying (metatharian or nonplacental mammals). Examples of mammalian species include humans and other primates (e.g., monkeys, chimpanzees), rodents (e.g., rats, mice, guinea pigs) and ruminents (e.g., cows, pigs, horses).

Examples of mammalian cells include human (such as HeLa cells, 293T cells, NIH 3T3 cells), bovine, ovine, porcine, murine (such as embryonic stem cells), rabbit and monkey (such as COS1 cells) cells. The cell may be a non-dividing cell (including hepatocytes, myofibers, hematopoietic stem cells, neurons) or a dividing cell. The cell may be an embryonic cell, bone marrow stem cell or other progenitor cell. Where the cell is a somatic cell, the cell can be, for example, an epithelial cell, fibroblast, smooth muscle cell, blood cell (including a hematopoietic cell, red blood cell, T-cell, B-cell, etc.), tumor cell, cardiac muscle cell, macrophage, dendritic cell, neuronal cell (e.g., a glial cell or astrocyte), or pathogen-infected cell (e.g., those infected by bacteria, viruses, virusoids, parasites, or prions).

Typically, cells isolated from a specific tissue (such as epithelium, fibroblast or hematopoietic cells) are categorized as a "cell-type." The cells can be obtained commercially or from a depository or obtained directly from an animal, such as by biopsy. Alternatively, the cell need not be isolated at all from the animal where, for example, it is desirable to deliver the virus to the animal in gene therapy.

To produce the cell lines of the present invention for producing a viral accessory protein independent lentivirus-derived retroviral vector particles, mammalian host cells are co-transfected with (a) a first plasmid comprising DNA sequence which encode lentivirus gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis, as described above, to improve expression of the lentivirus gagpol proteins; and (2) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein, or a retroviral nucleotide sequence which comprises a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration, or both, under conditions appropriate for transfection of the cells.

In a particular embodiment, to produce the cell lines of the present invention for producing viral accessory protein independent HIV-derived retroviral vector particles mammalian host cells were cotransfected with (a) a first plasmid comprising DNA sequence which encode HIV gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis, as described above, to improve expression of the HIV gagpol proteins; and (2) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein, or a retroviral nucleotide sequence which comprises a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration, or both, under conditions appropriate for transfection of the cells.

Virus stocks consisting of viral accessory protein independent lentivirus-derived, particularly HIV-derived, retroviral vector particles of the present invention are produced by maintaining the transfected cells under conditions suitable for virus production (e.g., in an appropriate growth media and for an appropriate period of time). Such conditions, which are not critical to the invention, are generally known in the art. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor University Press, New York (1989); Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York (1998); U.S. Pat. No. 5,449,614; and U.S. Pat. No. 5,460,959, the teachings of which are incorporated herein by reference.

To generate viral accessory protein independent lentivirus-derived retroviral vector particles, mammalian host cells can be co-transfected with (a) a first plasmid comprising DNA sequence which encode lentivirus gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis, as described above, to improve expression of the lentivirus gagpol proteins; (b) a second plasmid comprising a DNA sequence which encodes a heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of interest and lentivirus cis-acting sequences required for packaging, reverse transcription and integration. Alternatively, mammalian cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a lentivirus gag protein and a plasmid comprising a codon optimized DNA sequence encoding a lentivirus pol protein, in place of the first plasmid comprising a codon optimized DNA sequence encoding both lentivirus gagpol proteins. Alternatively, mammalian host cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a lentivirus gag protein and a plasmid comprising a codon optimized DNA sequence encoding a lentivirus pol protein, in place of the first plasmid comprising a codon optimized DNA sequence encoding both lentivirus gagpol proteins.

In a particular embodiment, the invention relates to methods of producing viral accessory protein independent HIV-derived retroviral vector particles, comprising co-transfecting mammalian host cells with (a) a first plasmid comprising DNA sequence which encode HIV gagpol proteins, wherein said DNA sequence has been codon optimized by mutagenisis, as described above, to improve expression of the HIV gagpol proteins; (b) a second plasmid containing a DNA sequence which encodes a heterologous envelope protein; and (c) a third plasmid comprising a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration. Alternatively, mammalian host cells are transfected with a plasmid comprising a codon optimized DNA sequence encoding a HIV gag protein and a plasmid comprising a codon optimized DNA sequence encoding a HIV pol protein, in place of the first plasmid comprising a codon optimized DNA sequence encoding both HIV gagpol proteins.

Virus particles produced by the methods described herein, using a codon optimized HIV packaging construct produced as described herein, were compared by Western analysis with virus particles produced as described in Naldini et al., Science, 272:263-267 (1996), using the packaging construct plasmid pCMV.DELTA.R8.2. Both the immunological reactivity and the proteolytic processing were confirmed to be indistinguishable.

A plasmid comprising a DNA sequence of interest and HIV cis-acting sequences required for packaging, reverse transcription and integration is also referred to herein as a transfer vector. A transfer vector, as used herein, refers to a vehicle which is used to introduce a DNA of interest into a eurkaryotic cell, particularly a mammalian cell. FIG. 3 depicts an example of a transfer vector.

DNA sequence of interest, as used herein, include all or a portion of a gene or genes encoding a nucleic acid product whose expression in a cell or a mammal is desired. In a particular embodiment, the nucleic acid product is a heterologous therapeutic protein. Examples of therapeutic proteins include antigens or immunogens, such as a polyvalent vaccine, cytokines, tumor necrosis factor, interferons, interleukins, adenosinc dearninase, insulin, T-cell receptors, soluble CD4, growth factors, such as epidermal growth factor, human growth factor, insulin-like growth factors, fibroblast growth factors), blood factors, such as Factor VIII, Factor IX, cytochrome b, glucocerebrosidase, ApoE, ApoC, ApoAI, the LDL receptor, negative selection markers or "suicide proteins", such as thymidine kinase (including the HSV, CMV, VZV TK), anti-angiogenic factors, Fc receptors, plasminogen activators, such as t-PA, u-PA and streptokinase, dopamine, MHC, tumor suppressor genes such as p53 and Rb, monoclonal antibodies or antigen binding fragments thereof, drug resistance genes, ion channels, such as a calcium channel or a potassium channel, adrenergic receptors, hormones (including growth hormones) and anti-cancer agents. In another embodiment, the nucleic acid product is a gene product to be expressed in a cell or a mammal and which product is otherwise defective or absent in the cell or mammal. For example, the nucleic acid product can be a functional gene(s) which is defective or absent in the cell or mammal.

DNA sequence of interest includes DNA sequences (control sequences) which are necessary to drive the expression of the gene or genes. The control sequences are operably linked to the gene. The term "operably linked", as used herein, is defined to mean that the gene is linked to control sequences in a manner which allows expression of the gene (or the nucleic acid sequence). Generally, operably linked means contiguous. Control sequences include a transcriptional promoter, an optional operator sequence to control transcription, a sequence encoding suitable MRNA ribosomal binding sites and sequences which control termination of transcription and translation. In a particular embodiment, a recombinant gene encoding a desired nucleic acid product can be placed under the regulatory control of a promoter which can be induced or repressed, thereby offering a greater degree of control with respect to the level of the product produced.

As used herein, the term "promoter" refers to a sequence of DNA, usually upstream (5') of the coding region of a structural gene, which controls the expression of the coding region by providing recognition and binding sites for RNA polymerase and other factors which may be required for initiation of transcription. Suitable promoters are well known in the art. Exemplary promoters include the SV40, CMV and human elongation factor (EFI) promoters. Other suitable promoters are readily available in the art (see, eg., Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, Inc., New York (1998); Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor University Press, New York (1989); and U.S. Pat. No. 5,681,735).

A DNA sequence of interest can be isolated from nature, modified from native sequences or manufactured de novo, as described in, for example, Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York (1998); and Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor University Press, New York. (1989). DNA sequences can be isolated and fused together by methods known in the art, such as exploiting and manufacturing compatible cloning or restriction sites.

The packaging cell lines and viral particles of the present invention can be used, in vitro, in vivo and ex vivo, to introduce DNA of interest into a eukaryotic cell (e.g., a mammalian cell) or a mammal (e.g., a human or other mammal or vertebrate). The cells can be obtained commercially or from a depository or obtained directly from a mammal, such as by biopsy. The cells can be obtained from a mammal to whom they will be returned or from another/different mammal of the same or different species. For example, using the packaging cell lines or viral particles of the present invention, DNA of interest can be introduced into nonhuman cells, such as pig cells, which are then introduced into a human. Alternatively, the cell need not be isolated from the mammal where, for example, it is desirable to deliver vial particles of the present invention to the mammal in gene therapy.

Ex vivo therapy has been described, for example, in Kasid et al., Proc. Natl. Acad. Sci. USA, 87:473 (1990); Rosenberg et al., N. Engl. J. Med., 323:570 (1990); Williams et al., Nature, 310:476 (1984); Dick et al., Cell, 42:71 (1985); Keller et al., Nature, 318:149 (1985); and Anderson et al., U.S. Pat. No. 5,399,346.

Methods for administering (introducing) viral particles directly to a mammal are generally known to those practiced in the art. For example, modes of administration include parenteral, injection, mucosal, systemic, implant, intraperitoneal, oral, intradernal, transdermal (e.g., in slow release polymers), intramuscular, intravenous including infusion and/or bolus injection, subcutaneous, topical, epidural, etc. Viral particles of the present invention can, preferably, be administered in a pharmaceutically acceptable carrier, such as saline, sterile water, Ringer's solution, and isotonic sodium chloride solution.

The dosage of a viral particle of the present invention administered to a mammal, including frequency of administration, will vary depending upon a variety of factors, including mode and route of administration; size, age, sex, health, body weight and diet of the recipient mammal; nature and extent of symptoms of the disease or disorder being treated; kind of concurrent treatment, frequency of treatment, and the effect desired.

The teachings of all the articles, patents, patent applications and GenBank sequences cited herein are incorporated by reference in their entirety.

While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

SEQUENCE LISTING <100> GENERAL INFORMATION: <160> NUMBER OF SEQ ID NOS: 12 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 1 <211> LENGTH: 396 <212> TYPE: DNA <213> ORGANISM: Homo sapien <400> SEQUENCE: 1 atgggtgcga gagcgtcagt attaagcggg ggagaattag atcgatggga aaaaattcgg 60 ttaaggccag ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag 120 ctagaacgat tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata 180 ctgggacagc tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat 240 acagtagcaa ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct 300 ttagacaaga tagaggaaga gcaaaacaaa agtaagaaaa aagcacagca agcagcagct 360 gacacaggac acagcaatca ggtcagccaa aattac 396 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 2 <211> LENGTH: 1503 <212> TYPE: DNA <213> ORGANISM: Homo sapien <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)...(1503) <400> SEQUENCE: 2 atg ggt gcg aga gcg tcg gta tta agc ggg gga gaa tta gat aaa tgg 48 Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 1 5 10 15 gaa aaa att cgg tta agg cca ggg gga aag aaa caa tat aaa cta aaa 96 Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Gln Tyr Lys Leu Lys 20 25 30 cat ata gta tgg gca agc agg gag cta gaa cga ttc gca gtt aat cct 144 His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 35 40 45 ggc ctt tta gag aca tca gaa ggc tgt aga caa ata ctg gga cag cta 192 Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 50 55 60 caa cca tcc ctt cag aca gga tca gaa gaa ctt aga tca tta tat aat 240 Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 65 70 75 80 aca ata gca gtc ctc tat tgt gtg cat caa agg ata gat gta aaa gac 288 Thr Ile Ala Val Leu Tyr Cys Val His Gln Arg Ile Asp Val Lys Asp 85 90 95 acc aag gaa gcc tta gat aag ata gag gaa gag caa aac aaa agt aag 336 Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 100 105 110 aaa aag gca cag caa gca gca gct gac aca gga aac aac agc cag gtc 384 Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly Asn Asn Ser Gln Val 115 120 125 agc caa aat tac cct ata gtg cag aac ctc cag ggg caa atg gta cat 432 Ser Gln Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His 130 135 140 cag gcc ata tca cct aga act tta aat gca tgg gta aaa gta gta gaa 480 Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 145 150 155 160 gag aag gct ttc agc cca gaa gta ata ccc atg ttt tca gca tta tca 528 Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 165 170 175 gaa gga gcc acc cca caa gat tta aat acc atg cta aac aca gtg ggg 576 Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 180 185 190 gga cat caa gca gcc atg caa atg tta aaa gag acc atc aat gag gaa 624 Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 195 200 205 gct gca gaa tgg gat aga ttg cat cca gtg cat gca ggg cct att gca 672 Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala 210 215 220 cca ggc cag atg aga gaa cca agg gga agt gac ata gca gga act act 720 Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 225 230 235 240 agt acc ctt cag gaa caa ata gga tgg atg aca cat aat cca cct atc 768 Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr His Asn Pro Pro Ile 245 250 255 cca gta gga gaa atc tat aaa aga tgg ata atc ctg gga tta aat aaa 816 Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 260 265 270 ata gta aga atg tat agc cct acc agc att ctg gac ata aga caa gga 864 Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 275 280 285 cca aag gaa ccc ttt aga gac tat gta gac cga ttc tat aaa act cta 912 Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 290 295 300 aga gcc gag caa gct tca caa gag gta aaa aat tgg atg aca gaa acc 960 Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 305 310 315 320 ttg ttg gtc caa aat gcg aac cca gat tgt aag act att tta aaa gca 1008 Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 325 330 335 ttg gga cca gga gcg aca cta gaa gaa atg atg aca gca tgt cag gga 1056 Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 340 345 350 gtg ggg gga ccc ggc cat aaa gca aga gtt ttg gct gaa gca atg agc 1104 Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 355 360 365 caa gta aca aat cca gct acc ata atg ata cag aaa ggc aat ttt agg 1152 Gln Val Thr Asn Pro Ala Thr Ile Met Ile Gln Lys Gly Asn Phe Arg 370 375 380 aac caa aga aag act gtt aag tgt ttc aat tgt ggc aaa gaa ggg cac 1200 Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 385 390 395 400 ata gcc aaa aat tgc agg gcc cct agg aaa aag ggc tgt tgg aaa tgt 1248 Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 405 410 415 gga aag gaa gga cac caa atg aaa gat tgt act gag aga cag gct aat 1296 Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 420 425 430 ttt tta ggg aag atc tgg cct tcc cac aag gga agg cca ggg aat ttt 1344 Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe 435 440 445 ctt cag agc aga cca gag cca aca gcc cca cca gaa gag agc ttc agg 1392 Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg 450 455 460 ttt ggg gaa gag aca aca act ccc tct cag aag cag gag ccg ata gac 1440 Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp 465 470 475 480 aag gaa ctg tat cct tta gct tcc ctc aga tca ctc ttt ggc agc gac 1488 Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Ser Asp 485 490 495 ccc tcg tca caa taa 1503 Pro Ser Ser Gln 500 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 3 <211> LENGTH: 500 <212> TYPE: PRT <213> ORGANISM: Homo sapien <400> SEQUENCE: 3 Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 1 5 10 15 Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Gln Tyr Lys Leu Lys 20 25 30 His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 35 40 45 Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 50 55 60 Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 65 70 75 80 Thr Ile Ala Val Leu Tyr Cys Val His Gln Arg Ile Asp Val Lys Asp 85 90 95 Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 100 105 110 Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly Asn Asn Ser Gln Val 115 120 125 Ser Gln Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His 130 135 140 Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 145 150 155 160 Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 165 170 175 Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 180 185 190 Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 195 200 205 Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala 210 215 220 Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 225 230 235 240 Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr His Asn Pro Pro Ile 245 250 255 Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 260 265 270 Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 275 280 285 Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 290 295 300 Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 305 310 315 320 Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 325 330 335 Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 340 345 350 Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 355 360 365 Gln Val Thr Asn Pro Ala Thr Ile Met Ile Gln Lys Gly Asn Phe Arg 370 375 380 Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 385 390 395 400 Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 405 410 415 Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 420 425 430 Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe 435 440 445 Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg 450 455 460 Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp 465 470 475 480 Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Ser Asp 485 490 495 Pro Ser Ser Gln 500 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 4 <211> LENGTH: 1503 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Codon optimized form of HIV gag coding region <400> SEQUENCE: 4 atg ggc gcc cgc gcc tcc gtg ctg tcc ggc ggc gag ctg gac aag tgg 48 Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 1 5 10 15 gag aag atc cgc ctg cgc ccc ggc ggc aag aag cag tac aag ctg aag 96 Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Gln Tyr Lys Leu Lys 20 25 30 cac atc gtg tgg gcc tcc cgc gag ctg gag cgc ttc gcc gtg aac ccc 144 His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 35 40 45 ggc ctg ctg gag acc tcc gag ggc tgc cgc cag atc ctg ggc cag ctg 192 Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 50 55 60 cag ccc tcc ctg caa acc ggc tcc gag gag ctg cgc tcc ctg tac aac 240 Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 65 70 75 80 acc atc gcc gtg ctg tac tgc gtg cac cag cgc atc gac gtg aag gac 288 Thr Ile Ala Val Leu Tyr Cys Val His Gln Arg Ile Asp Val Lys Asp 85 90 95 acc aag gag gcc ctg gac aag atc gag gag gag cag aac aag tcc aag 336 Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 100 105 110 aag aag gcc cag cag gcc gcc gcc gac acc ggc aac aac tcc cag gtg 384 Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly Asn Asn Ser Gln Val 115 120 125 tcc cag aac tac ccc atc gtg cag aac ctg cag ggc cag atg gtg cac 432 Ser Gln Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His 130 135 140 cag gcc atc tcc ccc cgc acc ctg aac gcc tgg gtg aag gtg gtg gag 480 Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 145 150 155 160 gag aag gcc ttc tcc ccc gaa gtc atc ccc atg ttc tcc gcc ctg tcc 528 Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 165 170 175 gag ggc gcc acc ccc cag gac ctg aac acc atg ctg aac acc gtg ggc 576 Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 180 185 190 ggc cac cag gcc gcc atg cag atg ctg aag gag acc atc aac gag gag 624 Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 195 200 205 gcc gcc gag tgg gac cgc ctg cac ccc gtg cac gcc ggc ccc atc gcc 672 Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala 210 215 220 ccc ggc cag atg cgc gag ccc cgc ggc tcc gac atc gcc ggc acc acc 720 Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 225 230 235 240 tcc acc ctg caa gag cag atc ggc tgg atg acc cac aac ccc ccc atc 768 Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr His Asn Pro Pro Ile 245 250 255 ccc gtg ggc gag atc tac aag cgc tgg atc atc ctg ggc ctg aac aag 816 Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 260 265 270

atc gtg cgc atg tac tcc ccc acc tcc atc ctg gac atc cgc cag ggc 864 Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 275 280 285 ccc aag gag ccc ttc cgc gac tac gtg gac cgc ttc tac aag acc ctg 912 Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 290 295 300 cgc gcc gag cag gcc tcc cag gag gta aag aac tgg atg acc gag acc 960 Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 305 310 315 320 ctg ctg gtg cag aac gcc aac ccc gac tgc aag acc atc ctg aag gcc 1008 Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 325 330 335 ctg ggc ccc ggc gcc acc ctg gag gag atg atg acc gcc tgc cag ggc 1056 Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 340 345 350 gtg ggc ggc ccc ggc cac aag gcc cgc gtg ctg gcc gag gcc atg tcc 1104 Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 355 360 365 caa gtc acc aac ccc gcc acc atc atg atc cag aag ggc aac ttc cgc 1152 Gln Val Thr Asn Pro Ala Thr Ile Met Ile Gln Lys Gly Asn Phe Arg 370 375 380 aac cag cgc aag acc gtg aag tgc ttc aac tgc ggc aag gag ggc cac 1200 Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 385 390 395 400 atc gcc aag aac tgc cgc gcc ccc cgc aag aag ggc tgc tgg aag tgc 1248 Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 405 410 415 ggc aag gag ggc cac cag atg aaa gat tgt act gag aga cag gct aat 1296 Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 420 425 430 ttt tta ggg aag atc tgg cct tcc cac aag gga agg cca ggg aat ttt 1344 Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe 435 440 445 ctt cag agc aga cca gag cca aca gcc cca cca gaa gag agc ttc agg 1392 Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg 450 455 460 ttt ggg gaa gag aca aca act ccc tct cag aag cag gag ccg ata gac 1440 Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp 465 470 475 480 aag gaa ctg tat cct tta gct tcc ctc aga tca ctc ttt ggc agc gac 1488 Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Ser Asp 485 490 495 ccc tcg tca caa taa 1503 Pro Ser Ser Gln 500 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 5 <211> LENGTH: 500 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Codon optimized form of HIV gag coding region <400> SEQUENCE: 5 Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 1 5 10 15 Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Gln Tyr Lys Leu Lys 20 25 30 His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 35 40 45 Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 50 55 60 Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 65 70 75 80 Thr Ile Ala Val Leu Tyr Cys Val His Gln Arg Ile Asp Val Lys Asp 85 90 95 Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 100 105 110 Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly Asn Asn Ser Gln Val 115 120 125 Ser Gln Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His 130 135 140 Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 145 150 155 160 Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 165 170 175 Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 180 185 190 Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 195 200 205 Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala 210 215 220 Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 225 230 235 240 Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr His Asn Pro Pro Ile 245 250 255 Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 260 265 270 Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 275 280 285 Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 290 295 300 Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 305 310 315 320 Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 325 330 335 Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 340 345 350 Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 355 360 365 Gln Val Thr Asn Pro Ala Thr Ile Met Ile Gln Lys Gly Asn Phe Arg 370 375 380 Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 385 390 395 400 Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 405 410 415 Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 420 425 430 Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe 435 440 445 Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg 450 455 460 Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp 465 470 475 480 Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Ser Asp 485 490 495 Pro Ser Ser Gln 500 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 6 <211> LENGTH: 3012 <212> TYPE: DNA <213> ORGANISM: Homo sapien <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)...(3012) <400> SEQUENCE: 6 ttt ttt agg gaa gat ctg gcc ttc cca caa ggg aag gcc agg gaa ttt 48 Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 1 5 10 15 tct tca gag cag acc aga gcc aac agc ccc acc aga aga gag ctt cag 96 Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 20 25 30 gtt tgg gga aga gac aac aac tcc ctc tca gaa gca gga gcc gat aga 144 Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 35 40 45 caa gga act gta tcc ttt agc ttc cct cag atc act ctt tgg cag cga 192 Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 50 55 60 ccc ctc gtc aca ata aag ata ggg ggg caa tta aag gaa gct cta tta 240 Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 65 70 75 80 gat aca gga gca gat gat aca gta tta gaa gaa atg aat ttg cca gga 288 Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 85 90 95 aga tgg aaa cca aaa atg ata ggg gga att gga ggt ttt atc aaa gta 336 Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 100 105 110 gga cag tat gat cag ata ctc ata gaa atc tgc gga cat aaa gct ata 384 Gly Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 115 120 125 ggt aca gta tta gta gga cct aca cct gtc aac ata att gga aga aat 432 Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 130 135 140 ctg ttg act cag att ggc tgc act tta aat ttt ccc att agt cct att 480 Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 145 150 155 160 gag act gta cca gta aaa tta aag cca gga atg gat ggc cca aaa gtt 528 Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 165 170 175 aaa caa tgg cca ttg aca gaa gaa aaa ata aaa gca tta gta gaa att 576 Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 180 185 190 tgt aca gaa atg gaa aag gaa gga aaa att tca aaa att ggg cct gaa 624 Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 195 200 205 aat cca tac aat act cca gta ttt gcc ata aag aaa aaa gac agt act 672 Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 210 215 220 aaa tgg aga aaa tta gta gat ttc aga gaa ctt aat aag aga act caa 720 Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 225 230 235 240 gat ttc tgg gaa gtt caa tta gga ata cca cat cct gca ggg tta aaa 768 Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 245 250 255 cag aaa aaa tca gta aca gta ctg gat gtg ggc gat gca tat ttt tca 816 Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 260 265 270 gtt ccc tta gat aaa gac ttc agg aag tat act gca ttt acc ata cct 864 Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 275 280 285 agt ata aac aat gag aca cca ggg att aga tat cag tac aat gtg ctt 912 Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 290 295 300 cca cag gga tgg aaa gga tca cca gca ata ttc cag tgt agc atg aca 960 Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 305 310 315 320 aaa atc tta gag cct ttt aga aaa caa aat cca gac ata gtc atc tat 1008 Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 325 330 335 caa tac atg gat gat ttg tat gta gga tct gac tta gaa ata ggg cag 1056 Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 340 345 350 cat aga aca aaa ata gag gaa ctg aga caa cat ctg ttg agg tgg gga 1104 His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 355 360 365 ttt acc aca cca gac aaa aaa cat cag aaa gaa cct cca ttc ctt tgg 1152 Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 370 375 380 atg ggt tat gaa ctc cat cct gat aaa tgg aca gta cag cct ata gtg 1200 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 385 390 395 400 ctg cca gaa aag gac agc tgg act gtc aat gac ata cag aaa tta gtg 1248 Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 405 410 415 gga aaa ttg aat tgg gca agt cag att tat gca ggg att aaa gta agg 1296 Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 420 425 430 caa tta tgt aaa ctt ctt agg gga acc aaa gca cta aca gaa gta gta 1344 Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 435 440 445 cca cta aca gaa gaa gca gag cta gaa ctg gca gaa aac agg gag att 1392 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 450 455 460 cta aaa gaa ccg gta cat gga gtg tat tat gac cca tca aaa gac tta 1440 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 465 470 475 480 ata gca gaa ata cag aag cag ggg caa ggc caa tgg aca tat caa att 1488 Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 485 490 495 tat caa gag cca ttt aaa aat ctg aaa aca gga aaa tat gca aga atg 1536 Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 500 505 510 aag ggt gcc cac act aat gat gtg aaa caa tta aca gag gca gta caa 1584 Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 515 520 525 aaa ata gcc aca gaa agc ata gta ata tgg gga aag act cct aaa ttt 1632 Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 530 535 540 aaa tta ccc ata caa aag gaa aca tgg gaa gca tgg tgg aca gag tat 1680 Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 545 550 555 560 tgg caa gcc acc tgg att cct gag tgg gag ttt gtc aat acc cct ccc 1728 Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 565 570 575 tta gtg aag tta tgg tac cag tta gag aaa gaa ccc ata ata gga gca 1776 Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 580 585 590 gaa act ttc tat gta gat ggg gca gcc aat agg gaa act aaa tta gga 1824 Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 595 600 605 aaa gca gga tat gta act gac aga gga aga caa aaa gtt gtc ccc cta 1872 Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 610 615 620 acg gac aca aca aat cag aag act gag tta caa gca att cat cta gct 1920 Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 625 630 635 640 ttg cag gat tcg gga tta gaa gta aac ata gtg aca gac tca caa tat 1968 Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 645 650 655 gca ttg gga atc att caa gca caa cca gat aag agt gaa tca gag tta 2016

Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 660 665 670 gtc agt caa ata ata gag cag tta ata aaa aag gaa aaa gtc tac ctg 2064 Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 675 680 685 gca tgg gta cca gca cac aaa gga att gga gga aat gaa caa gta gat 2112 Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 690 695 700 ggg ttg gtc agt gct gga atc agg aaa gta cta ttt tta gat gga ata 2160 Gly Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 705 710 715 720 gat aag gcc caa gaa gaa cat gag aaa tat cac agt aat tgg aga gca 2208 Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 725 730 735 atg gct agt gat ttt aac cta cca cct gta gta gca aaa gaa ata gta 2256 Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 740 745 750 gcc agc tgt gat aaa tgt cag cta aaa ggg gaa gcc atg cat gga caa 2304 Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 755 760 765 gta gac tgt agc cca gga ata tgg cag cta gat tgt aca cat tta gaa 2352 Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 770 775 780 gga aaa gtt atc ttg gta gca gtt cat gta gcc agt gga tat ata gaa 2400 Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 785 790 795 800 gca gaa gta att cca gca gag aca ggg caa gaa aca gca tac ttc ctc 2448 Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 805 810 815 tta aaa tta gca gga aga tgg cca gta aaa aca gta cat aca gac aat 2496 Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 820 825 830 ggc agc aat ttc acc agt act aca gtt aag gcc gcc tgt tgg tgg gcg 2544 Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 835 840 845 ggg atc aag cag gaa ttt ggc att ccc tac aat ccc caa agt caa gga 2592 Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 850 855 860 gta ata gaa tct atg aat aaa gaa tta aag aaa att ata gga cag gta 2640 Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 865 870 875 880 aga gat cag gct gaa cat ctt aag aca gca gta caa atg gca gta ttc 2688 Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 885 890 895 atc cac aat ttt aaa aga aaa ggg ggg att ggg ggg tac agt gca ggg 2736 Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 900 905 910 gaa aga ata gta gac ata ata gca aca gac ata caa act aaa gaa tta 2784 Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 915 920 925 caa aaa caa att aca aaa att caa aat ttt cgg gtt tat tac agg gac 2832 Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 930 935 940 agc aga gat cca gtt tgg aaa gga cca gca aag ctc ctc tgg aaa ggt 2880 Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 945 950 955 960 gaa ggg gca gta gta ata caa gat aat agt gac ata aaa gta gtg cca 2928 Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 965 970 975 aga aga aaa gca aag atc atc agg gat tat gga aaa cag atg gca ggt 2976 Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 980 985 990 gat gat tgt gtg gca agt aga cag gat gag gat taa 3012 Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp 995 1000 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 7 <211> LENGTH: 1003 <212> TYPE: PRT <213> ORGANISM: Homo sapien <400> SEQUENCE: 7 Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 1 5 10 15 Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 20 25 30 Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 35 40 45 Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 50 55 60 Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 65 70 75 80 Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 85 90 95 Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 100 105 110 Gly Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 115 120 125 Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 130 135 140 Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 145 150 155 160 Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 165 170 175 Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 180 185 190 Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 195 200 205 Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 210 215 220 Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 225 230 235 240 Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 245 250 255 Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 260 265 270 Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 275 280 285 Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 290 295 300 Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 305 310 315 320 Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 325 330 335 Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 340 345 350 His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 355 360 365 Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 370 375 380 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 385 390 395 400 Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 405 410 415 Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 420 425 430 Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 435 440 445 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 450 455 460 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 465 470 475 480 Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 485 490 495 Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 500 505 510 Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 515 520 525 Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 530 535 540 Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 545 550 555 560 Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 565 570 575 Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 580 585 590 Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 595 600 605 Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 610 615 620 Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 625 630 635 640 Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 645 650 655 Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 660 665 670 Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 675 680 685 Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 690 695 700 Gly Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 705 710 715 720 Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 725 730 735 Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 740 745 750 Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 755 760 765 Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 770 775 780 Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 785 790 795 800 Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 805 810 815 Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 820 825 830 Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 835 840 845 Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 850 855 860 Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 865 870 875 880 Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 885 890 895 Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 900 905 910 Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 915 920 925 Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 930 935 940 Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 945 950 955 960 Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 965 970 975 Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 980 985 990 Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp 995 1000 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 8 <211> LENGTH: 3012 <212> TYPE: DNA <213> ORGANISM: Homo sapien <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (1)...(3012) <400> SEQUENCE: 8 ttt ttt agg gaa gat ctg gcc ttc cca caa ggg aag gcc agg gaa ttt 48 Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 1 5 10 15 tct tca gag cag acc aga gcc aac agc ccc acc aga aga gag ctt cag 96 Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 20 25 30 gtt tgg gga aga gac aac aac tcc ctc tca gaa gca gga gcc gat aga 144 Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 35 40 45 caa gga act gta tcc ttt agc ttc cct cag atc act ctt tgg cag cga 192 Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 50 55 60 ccc ctc gtc aca ata aag ata ggg ggg caa tta aag gaa gct cta tta 240 Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 65 70 75 80 gat aca gga gca gat gat aca gta tta gaa gaa atg aat ttg cca gga 288 Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 85 90 95 aga tgg aaa cca aaa atg ata ggg gga att gga ggt ttt atc aaa gta 336 Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 100 105 110 aga cag tat gat cag ata ctc ata gaa atc tgc gga cat aaa gct ata 384 Arg Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 115 120 125 ggt aca gta tta gta gga cct aca cct gtc aac ata att gga aga aat 432 Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 130 135 140 ctg ttg act cag att ggc tgc act tta aat ttt ccc att agt cct att 480 Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 145 150 155 160 gag act gta cca gta aaa tta aag cca gga atg gat ggc cca aaa gtt 528 Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 165 170 175 aaa caa tgg cca ttg aca gaa gaa aaa ata aaa gca tta gta gaa att 576 Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 180 185 190 tgt aca gaa atg gaa aag gaa gga aaa att tca aaa att ggg cct gaa 624 Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 195 200 205 aat cca tac aat act cca gta ttt gcc ata aag aaa aaa gac agt act 672 Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 210 215 220 aaa tgg aga aaa tta gta gat ttc aga gaa ctt aat aag aga act caa 720 Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 225 230 235 240

gat ttc tgg gaa gtt caa tta gga ata cca cat cct gca ggg tta aaa 768 Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 245 250 255 cag aaa aaa tca gta aca gta ctg gat gtg ggc gat gca tat ttt tca 816 Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 260 265 270 gtt ccc tta gat aaa gac ttc agg aag tat act gca ttt acc ata cct 864 Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 275 280 285 agt ata aac aat gag aca cca ggg att aga tat cag tac aat gtg ctt 912 Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 290 295 300 cca cag gga tgg aaa gga tca cca gca ata ttc cag tgt agc atg aca 960 Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 305 310 315 320 aaa atc tta gag cct ttt aga aaa caa aat cca gac ata gtc atc tat 1008 Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 325 330 335 caa tac atg gat gat ttg tat gta gga tct gac tta gaa ata ggg cag 1056 Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 340 345 350 cat aga aca aaa ata gag gaa ctg aga caa cat ctg ttg agg tgg gga 1104 His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 355 360 365 ttt acc aca cca gac aaa aaa cat cag aaa gaa cct cca ttc ctt tgg 1152 Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 370 375 380 atg ggt tat gaa ctc cat cct gat aaa tgg aca gta cag cct ata gtg 1200 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 385 390 395 400 ctg cca gaa aag gac agc tgg act gtc aat gac ata cag aaa tta gtg 1248 Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 405 410 415 gga aaa ttg aat tgg gca agt cag att tat gca ggg att aaa gta agg 1296 Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 420 425 430 caa tta tgt aaa ctt ctt agg gga acc aaa gca cta aca gaa gta gta 1344 Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 435 440 445 cca cta aca gaa gaa gca gag cta gaa ctg gca gaa aac agg gag att 1392 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 450 455 460 cta aaa gaa ccg gta cat gga gtg tat tat gac cca tca aaa gac tta 1440 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 465 470 475 480 ata gca gaa ata cag aag cag ggg caa ggc caa tgg aca tat caa att 1488 Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 485 490 495 tat caa gag cca ttt aaa aat ctg aaa aca gga aaa tat gca aga atg 1536 Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 500 505 510 aag ggt gcc cac act aat gat gtg aaa caa tta aca gag gca gta caa 1584 Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 515 520 525 aaa ata gcc aca gaa agc ata gta ata tgg gga aag act cct aaa ttt 1632 Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 530 535 540 aaa tta ccc ata caa aag gaa aca tgg gaa gca tgg tgg aca gag tat 1680 Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 545 550 555 560 tgg caa gcc acc tgg att cct gag tgg gag ttt gtc aat acc cct ccc 1728 Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 565 570 575 tta gtg aag tta tgg tac cag tta gag aaa gaa ccc ata ata gga gca 1776 Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 580 585 590 gaa act ttc tat gta gat ggg gca gcc aat agg gaa act aaa tta gga 1824 Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 595 600 605 aaa gca gga tat gta act gac aga gga aga caa aaa gtt gtc ccc cta 1872 Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 610 615 620 acg gac aca aca aat cag aag act gag tta caa gca att cat cta gct 1920 Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 625 630 635 640 ttg cag gat tcg gga tta gaa gta aac ata gtg aca gac tca caa tat 1968 Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 645 650 655 gca ttg gga atc att caa gca caa cca gat aag agt gaa tca gag tta 2016 Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 660 665 670 gtc agt caa ata ata gag cag tta ata aaa aag gaa aaa gtc tac ctg 2064 Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 675 680 685 gca tgg gta cca gca cac aaa gga att gga gga aat gaa caa gta gat 2112 Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 690 695 700 aag ttg gtc agt gct gga atc agg aaa gta cta ttt tta gat gga ata 2160 Lys Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 705 710 715 720 gat aag gcc caa gaa gaa cat gag aaa tat cac agt aat tgg aga gca 2208 Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 725 730 735 atg gct agt gat ttt aac cta cca cct gta gta gca aaa gaa ata gta 2256 Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 740 745 750 gcc agc tgt gat aaa tgt cag cta aaa ggg gaa gcc atg cat gga caa 2304 Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 755 760 765 gta gac tgt agc cca gga ata tgg cag cta gat tgt aca cat tta gaa 2352 Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 770 775 780 gga aaa gtt atc ttg gta gca gtt cat gta gcc agt gga tat ata gaa 2400 Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 785 790 795 800 gca gaa gta att cca gca gag aca ggg caa gaa aca gca tac ttc ctc 2448 Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 805 810 815 tta aaa tta gca gga aga tgg cca gta aaa aca gta cat aca gac aat 2496 Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 820 825 830 ggc agc aat ttc acc agt act aca gtt aag gcc gcc tgt tgg tgg gcg 2544 Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 835 840 845 ggg atc aag cag gaa ttt ggc att ccc tac aat ccc caa agt caa gga 2592 Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 850 855 860 gta ata gaa tct atg aat aaa gaa tta aag aaa att ata gga cag gta 2640 Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 865 870 875 880 aga gat cag gct gaa cat ctt aag aca gca gta caa atg gca gta ttc 2688 Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 885 890 895 atc cac aat ttt aaa aga aaa ggg ggg att ggg ggg tac agt gca ggg 2736 Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 900 905 910 gaa aga ata gta gac ata ata gca aca gac ata caa act aaa gaa tta 2784 Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 915 920 925 caa aaa caa att aca aaa att caa aat ttt cgg gtt tat tac agg gac 2832 Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 930 935 940 agc aga gat cca gtt tgg aaa gga cca gca aag ctc ctc tgg aaa ggt 2880 Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 945 950 955 960 gaa ggg gca gta gta ata caa gat aat agt gac ata aaa gta gtg cca 2928 Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 965 970 975 aga aga aaa gca aag atc atc agg gat tat gga aaa cag atg gca ggt 2976 Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 980 985 990 gat gat tgt gtg gca agt aga cag gat gag gat taa 3012 Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp 995 1000 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 9 <211> LENGTH: 1003 <212> TYPE: PRT <213> ORGANISM: Homo sapien <400> SEQUENCE: 9 Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 1 5 10 15 Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 20 25 30 Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 35 40 45 Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 50 55 60 Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 65 70 75 80 Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 85 90 95 Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 100 105 110 Arg Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 115 120 125 Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 130 135 140 Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 145 150 155 160 Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 165 170 175 Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 180 185 190 Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 195 200 205 Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 210 215 220 Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 225 230 235 240 Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 245 250 255 Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 260 265 270 Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 275 280 285 Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 290 295 300 Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 305 310 315 320 Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 325 330 335 Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 340 345 350 His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 355 360 365 Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 370 375 380 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 385 390 395 400 Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 405 410 415 Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 420 425 430 Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 435 440 445 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 450 455 460 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 465 470 475 480 Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 485 490 495 Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 500 505 510 Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 515 520 525 Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 530 535 540 Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 545 550 555 560 Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 565 570 575 Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 580 585 590 Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 595 600 605 Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 610 615 620 Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 625 630 635 640 Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 645 650 655 Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 660 665 670 Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 675 680 685 Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 690 695 700 Lys Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 705 710 715 720 Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 725 730 735 Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 740 745 750 Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 755 760 765 Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 770 775 780 Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 785 790 795 800 Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu

805 810 815 Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 820 825 830 Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 835 840 845 Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 850 855 860 Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 865 870 875 880 Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 885 890 895 Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 900 905 910 Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 915 920 925 Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 930 935 940 Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 945 950 955 960 Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 965 970 975 Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 980 985 990 Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp 995 1000 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 10 <211> LENGTH: 3012 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Codon optimized form of HIV pol coding region <400> SEQUENCE: 10 ttt ttt agg gaa gat ctg gcc ttc cca caa ggg aag gcc agg gaa ttt 48 Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 1 5 10 15 tct tca gag cag acc aga gcc aac agc ccc acc aga aga gag ctt cag 96 Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 20 25 30 gtt tgg gga aga gac aac aac tcc ctc tca gaa gca gga gcc gat aga 144 Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 35 40 45 caa gga act gta tcc ttt agc ttc cct cag atc act ctt tgg cag cga 192 Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 50 55 60 ccc ctc gtc aca ata aag atc ggt ggc cag ctg aag gag gcc ctg ctg 240 Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 65 70 75 80 gac acc ggc gcc gac gac acc gtg ctg gag gag atg aac ctg ccc ggc 288 Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 85 90 95 cgc tgg aag ccc aag atg atc ggc ggc atc ggc ggc ttc atc aaa gtc 336 Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 100 105 110 cgc cag tac gac cag atc ctg atc gag atc tgc ggc cac aag gcc atc 384 Arg Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 115 120 125 ggc acc gtg ctg gtg ggc ccc acc ccc gtg aac atc atc ggc cgc aac 432 Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 130 135 140 ctg ctg acc cag atc ggc tgc acc ctg aac ttc ccc atc tcc ccc atc 480 Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 145 150 155 160 gag acc gtg ccc gtg aag ctg aag ccc ggc atg gac ggc ccc aaa gtc 528 Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 165 170 175 aag cag tgg ccc ctg acc gag gag aag atc aag gcc ctg gtg gag atc 576 Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 180 185 190 tgc acc gag atg gag aag gag ggc aag atc tcc aag atc ggc ccc gag 624 Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 195 200 205 aac ccc tac aac acc ccc gtg ttc gcc atc aag aag aag gac tcc acc 672 Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 210 215 220 aag tgg cgc aag ctg gtg gac ttc cgc gag ctg aac aag cgc acc cag 720 Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 225 230 235 240 gac ttc tgg gag gtg cag ctg ggc atc ccc cac ccc gcc ggc ctg aag 768 Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 245 250 255 cag aag aag tcc gtg acc gtg ctg gac gtg ggc gac gcc tac ttc tcc 816 Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 260 265 270 gtg ccc ctg gac aag gac ttc cgc aag tac acc gcc ttc acc atc ccc 864 Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 275 280 285 tcc atc aac aac gag acc ccc ggc atc cgc tac cag tac aac gtg ctg 912 Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 290 295 300 ccc cag ggc tgg aag ggc tcc ccc gcc atc ttc cag tgc tcc atg acc 960 Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 305 310 315 320 aag atc ctg gag ccc ttc cgc aag cag aac ccc gac atc gtg atc tac 1008 Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 325 330 335 cag tac atg gac gac ctg tac gtg ggc tcc gac ctg gag atc ggc cag 1056 Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 340 345 350 cac cgc acc aag atc gag gag ctg cgc cag cac ctg ctg cgc tgg ggc 1104 His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 355 360 365 ttc acc acc ccc gac aag aag cac cag aag gag ccc ccc ttc ctg tgg 1152 Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 370 375 380 atg ggc tac gag ctg cac ccc gac aag tgg acc gtg cag ccc atc gtg 1200 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 385 390 395 400 ctg ccc gag aag gac tcc tgg acc gtg aac gac atc cag aag ctg gtg 1248 Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 405 410 415 ggc aag ctg aac tgg gcc tcc cag atc tac gcc ggc atc aaa gtc cgc 1296 Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 420 425 430 cag ctg tgc aag ctg ctg cgc ggc acc aag gcc ctg acc gag gtg gtg 1344 Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 435 440 445 ccc ctg acc gag gag gcc gag ctg gag ctg gcc gag aac cgc gag atc 1392 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 450 455 460 ctg aag gag ccc gtg cac ggc gtg tac tac gac ccc tcc aag gac ctg 1440 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 465 470 475 480 atc gcc gag atc cag aag cag ggc cag ggc cag tgg acc tac cag atc 1488 Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 485 490 495 tac cag gag ccc ttc aag aac ctg aag acc ggc aaa tac gcc cgc atg 1536 Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 500 505 510 aag ggc gcc cac acc aac gac gtg aag cag ctg acc gag gcc gtg cag 1584 Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 515 520 525 aag atc gcc acc gag tcc atc gtg atc tgg ggc aag act ccc aag ttc 1632 Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 530 535 540 aag ctg ccc atc cag aag gag acc tgg gag gcc tgg tgg acc gag tac 1680 Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 545 550 555 560 tgg cag gcc acc tgg atc ccc gag tgg gag ttc gtg aac acc ccc ccc 1728 Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 565 570 575 ctg gtg aag ctg tgg tac cag ctg gag aag gag ccc atc atc ggc gcc 1776 Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 580 585 590 gag acc ttc tac gtg gac ggc gcc gcc aac cgc gag acc aag ctg ggc 1824 Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 595 600 605 aag gcc ggc tac gtg acc gac cgc ggc cgc cag aag gtg gtg ccc ctg 1872 Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 610 615 620 acc gac acc acc aac cag aag acc gag ctg cag gcc atc cac ctg gcc 1920 Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 625 630 635 640 ctg caa gac tcc ggc ctg gag gtg aac atc gtg acc gac tcc cag tat 1968 Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 645 650 655 gca ttg ggc atc atc cag gcc cag ccc gac aag tcc gag tcc gag ctg 2016 Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 660 665 670 gtg tcc cag atc atc gag cag ctg atc aag aag gag aag gtg tac ctg 2064 Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 675 680 685 gcc tgg gtg ccc gcc cac aag ggc atc ggc ggc aac gag cag gtg gac 2112 Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 690 695 700 aag ctg gtg tcc gcc ggc atc cgc aag gtg ctg ttc ctg gac ggc atc 2160 Lys Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 705 710 715 720 gac aag gcc cag gag gag cac gag aag tac cac tcc aac tgg cgc gcc 2208 Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 725 730 735 atg gcc tcc gac ttc aac ctg ccc ccc gtg gtg gcc aag gag atc gtg 2256 Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 740 745 750 gcc tcc tgc gac aag tgc cag ctg aag ggc gag gcc atg cac ggc cag 2304 Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 755 760 765 gtg gac tgc tcc ccc ggc atc tgg cag ctg gac tgc acc cac ctg gag 2352 Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 770 775 780 ggc aag gtg atc ctg gtg gcc gtg cac gtg gcc tcc ggc tac atc gag 2400 Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 785 790 795 800 gcc gag gtg atc ccc gcc gag acc ggc cag gag acc gcc tac ttc ctg 2448 Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 805 810 815 ctg aag ctg gcc ggc cgc tgg ccc gtg aag acc gtg cac acc gac aac 2496 Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 820 825 830 ggc tcc aac ttc acc tcc acc acc gtg aag gcc gcc tgc tgg tgg gcc 2544 Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 835 840 845 ggc atc aag cag gag ttc ggc atc ccc tac aac ccc cag tcc cag ggc 2592 Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 850 855 860 gtg atc gag tcc atg aac aag gag ctg aag aag atc atc ggc caa gtc 2640 Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 865 870 875 880 cgc gac cag gcc gag cac ctg aag acc gcc gtg cag atg gcc gtg ttc 2688 Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 885 890 895 atc cac aac ttc aag cgc aag ggc ggc atc ggc ggc tac tcc gcc ggc 2736 Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 900 905 910 gag cgc atc gtg gac atc atc gcc acc gac atc cag acc aag gag ctg 2784 Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 915 920 925 cag aag cag atc acc aag atc cag aac ttc cgc gtg tac tac cgc gac 2832 Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 930 935 940 tcc cgc gac ccc gtg tgg aag ggc ccc gcc aag ctg ctg tgg aag ggc 2880 Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 945 950 955 960 gag ggc gcc gtg gtg atc cag gac aac tcc gac atc aag gtg gtg ccc 2928 Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 965 970 975 cgc cgc aag gcc aag atc atc cgc gac tac ggc aag cag atg gcc ggc 2976 Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 980 985 990 gac gac tgc gtg gcc tcc cgc cag gac gag gac taa 3012 Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp 995 1000 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 11 <211> LENGTH: 1003 <212> TYPE: PRT <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Codon optimized form of HIV pol coding region <400> SEQUENCE: 11 Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 1 5 10 15 Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 20 25 30 Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 35 40 45 Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 50 55 60 Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 65 70 75 80 Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 85 90 95 Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 100 105 110 Arg Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 115 120 125 Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 130 135 140 Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile

145 150 155 160 Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 165 170 175 Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 180 185 190 Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 195 200 205 Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 210 215 220 Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 225 230 235 240 Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 245 250 255 Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 260 265 270 Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 275 280 285 Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 290 295 300 Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 305 310 315 320 Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 325 330 335 Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 340 345 350 His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 355 360 365 Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 370 375 380 Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 385 390 395 400 Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 405 410 415 Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 420 425 430 Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 435 440 445 Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 450 455 460 Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 465 470 475 480 Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 485 490 495 Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 500 505 510 Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 515 520 525 Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 530 535 540 Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 545 550 555 560 Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 565 570 575 Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 580 585 590 Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 595 600 605 Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 610 615 620 Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 625 630 635 640 Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 645 650 655 Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 660 665 670 Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 675 680 685 Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 690 695 700 Lys Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 705 710 715 720 Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 725 730 735 Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 740 745 750 Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 755 760 765 Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 770 775 780 Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 785 790 795 800 Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 805 810 815 Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 820 825 830 Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 835 840 845 Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 850 855 860 Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 865 870 875 880 Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 885 890 895 Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 900 905 910 Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 915 920 925 Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 930 935 940 Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 945 950 955 960 Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 965 970 975 Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 980 985 990 Asp Asp Cys Val Ala Ser Arg Gln Asp Glu Asp 995 1000 <200> SEQUENCE CHARACTERISTICS: <210> SEQ ID NO 12 <211> LENGTH: 8908 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION: Packaging construct pHDmHgpm2 comprising a codon optimized form of HIV gag pol region <400> SEQUENCE: 12 agcttggccc attgcatacg ttgtatccat atcataatat gtacatttat attggctcat 60 gtccaacatt accgccatgt tgacattgat tattgactag ttattaatag taatcaatta 120 cggggtcatt agttcatagc ccatatatgg agttccgcgt tacataactt acggtaaatg 180 gcccgcctgg ctgaccgccc aacgaccccc gcccattgac gtcaataatg acgtatgttc 240 ccatagtaac gccaataggg actttccatt gacgtcaatg ggtggagtat ttacggtaaa 300 ctgcccactt ggcagtacat caagtgtatc atatgccaag tacgccccct attgacgtca 360 atgacggtaa atggcccgcc tggcattatg cccagtacat gaccttatgg gactttccta 420 cttggcagta catctacgta ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt 480 acatcaatgg gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg 540 acgtcaatgg gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca 600 actccgcccc attgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca 660 gagctcgttt agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc 720 atagaagaca ccgggaccga tccagcctcc cctcgaagct gatcctgaga acttcagggt 780 gagtctatgg gacccttgat gttttctttc cccttctttt ctatggttaa gttcatgtca 840 taggaagggg agaagtaaca gggtacacat attgaccaaa tcagggtaat tttgcatttg 900 taattttaaa aaatgctttc ttcttttaat atactttttt gtttatctta tttctaatac 960 tttccctaat ctctttcttt cagggcaata atgatacaat gtatcatgcc tctttgcacc 1020 attctaaaga ataacagtga taatttctgg gttaaggcaa tagcaatatt tctgcatata 1080 aatatttctg catataaatt gtaactgatg taagaggttt catattgcta atagcagcta 1140 caatccagct accattctgc ttttatttta tggttgggat aaggctggat tattctgagt 1200 ccaagctagg cccttttgct aatcatgttc atacctctta tcttcctccc acagctcctg 1260 ggcaacgtgc tggtctgtgt gctggcccat cactttggca aagaattcta gactgccatg 1320 ggcgcccgcg cctccgtgct gtccggcggc gagctggaca agtgggagaa gatccgcctg 1380 cgccccggcg gcaagaagca gtacaagctg aagcacatcg tgtgggcctc ccgcgagctg 1440 gagcgcttcg ccgtgaaccc cggcctgctg gagacctccg agggctgccg ccagatcctg 1500 ggccagctgc agccctccct gcaaaccggc tccgaggagc tgcgctccct gtacaacacc 1560 atcgccgtgc tgtactgcgt gcaccagcgc atcgacgtga aggacaccaa ggaggccctg 1620 gacaagatcg aggaggagca gaacaagtcc aagaagaagg cccagcaggc cgccgccgac 1680 accggcaaca actcccaggt gtcccagaac taccccatcg tgcagaacct gcagggccag 1740 atggtgcacc aggccatctc cccccgcacc ctgaacgcct gggtgaaggt ggtggaggag 1800 aaggccttct cccccgaagt catccccatg ttctccgccc tgtccgaggg cgccaccccc 1860 caggacctga acaccatgct gaacaccgtg ggcggccacc aggccgccat gcagatgctg 1920 aaggagacca tcaacgagga ggccgccgag tgggaccgcc tgcaccccgt gcacgccggc 1980 cccatcgccc ccggccagat gcgcgagccc cgcggctccg acatcgccgg caccacctcc 2040 accctgcaag agcagatcgg ctggatgacc cacaaccccc ccatccccgt gggcgagatc 2100 tacaagcgct ggatcatcct gggcctgaac aagatcgtgc gcatgtactc ccccacctcc 2160 atcctggaca tccgccaggg ccccaaggag cccttccgcg actacgtgga ccgcttctac 2220 aagaccctgc gcgccgagca ggcctcccag gaggtaaaga actggatgac cgagaccctg 2280 ctggtgcaga acgccaaccc cgactgcaag accatcctga aggccctggg ccccggcgcc 2340 accctggagg agatgatgac cgcctgccag ggcgtgggcg gccccggcca caaggcccgc 2400 gtgctggccg aggccatgtc ccaagtcacc aaccccgcca ccatcatgat ccagaagggc 2460 aacttccgca accagcgcaa gaccgtgaag tgcttcaact gcggcaagga gggccacatc 2520 gccaagaact gccgcgcccc ccgcaagaag ggctgctgga agtgcggcaa ggagggccac 2580 cagatgaaag attgtactga gagacaggct aattttttag ggaagatctg gccttcccac 2640 aagggaaggc cagggaattt tcttcagagc agaccagagc caacagcccc accagaagag 2700 agcttcaggt ttggggaaga gacaacaact ccctctcaga agcaggagcc gatagacaag 2760 gaactgtatc ctttagcttc cctcagatca ctctttggca gcgacccctc gtcacaataa 2820 agatcggtgg ccagctgaag gaggccctgc tggacaccgg cgccgacgac accgtgctgg 2880 aggagatgaa cctgcccggc cgctggaagc ccaagatgat cggcggcatc ggcggcttca 2940 tcaaagtccg ccagtacgac cagatcctga tcgagatctg cggccacaag gccatcggca 3000 ccgtgctggt gggccccacc cccgtgaaca tcatcggccg caacctgctg acccagatcg 3060 gctgcaccct gaacttcccc atctccccca tcgagaccgt gcccgtgaag ctgaagcccg 3120 gcatggacgg ccccaaagtc aagcagtggc ccctgaccga ggagaagatc aaggccctgg 3180 tggagatctg caccgagatg gagaaggagg gcaagatctc caagatcggc cccgagaacc 3240 cctacaacac ccccgtgttc gccatcaaga agaaggactc caccaagtgg cgcaagctgg 3300 tggacttccg cgagctgaac aagcgcaccc aggacttctg ggaggtgcag ctgggcatcc 3360 cccaccccgc cggcctgaag cagaagaagt ccgtgaccgt gctggacgtg ggcgacgcct 3420 acttctccgt gcccctggac aaggacttcc gcaagtacac cgccttcacc atcccctcca 3480 tcaacaacga gacccccggc atccgctacc agtacaacgt gctgccccag ggctggaagg 3540 gctcccccgc catcttccag tgctccatga ccaagatcct ggagcccttc cgcaagcaga 3600 accccgacat cgtgatctac cagtacatgg acgacctgta cgtgggctcc gacctggaga 3660 tcggccagca ccgcaccaag atcgaggagc tgcgccagca cctgctgcgc tggggcttca 3720 ccacccccga caagaagcac cagaaggagc cccccttcct gtggatgggc tacgagctgc 3780 accccgacaa gtggaccgtg cagcccatcg tgctgcccga gaaggactcc tggaccgtga 3840 acgacatcca gaagctggtg ggcaagctga actgggcctc ccagatctac gccggcatca 3900 aagtccgcca gctgtgcaag ctgctgcgcg gcaccaaggc cctgaccgag gtggtgcccc 3960 tgaccgagga ggccgagctg gagctggccg agaaccgcga gatcctgaag gagcccgtgc 4020 acggcgtgta ctacgacccc tccaaggacc tgatcgccga gatccagaag cagggccagg 4080 gccagtggac ctaccagatc taccaggagc ccttcaagaa cctgaagacc ggcaaatacg 4140 cccgcatgaa gggcgcccac accaacgacg tgaagcagct gaccgaggcc gtgcagaaga 4200 tcgccaccga gtccatcgtg atctggggca agactcccaa gttcaagctg cccatccaga 4260 aggagacctg ggaggcctgg tggaccgagt actggcaggc cacctggatc cccgagtggg 4320 agttcgtgaa cacccccccc ctggtgaagc tgtggtacca gctggagaag gagcccatca 4380 tcggcgccga gaccttctac gtggacggcg ccgccaaccg cgagaccaag ctgggcaagg 4440 ccggctacgt gaccgaccgc ggccgccaga aggtggtgcc cctgaccgac accaccaacc 4500 agaagaccga gctgcaggcc atccacctgg ccctgcaaga ctccggcctg gaggtgaaca 4560 tcgtgaccga ctcccagtat gcattgggca tcatccaggc ccagcccgac aagtccgagt 4620 ccgagctggt gtcccagatc atcgagcagc tgatcaagaa ggagaaggtg tacctggcct 4680 gggtgcccgc ccacaagggc atcggcggca acgagcaggt ggacaagctg gtgtccgccg 4740 gcatccgcaa ggtgctgttc ctggacggca tcgacaaggc ccaggaggag cacgagaagt 4800 accactccaa ctggcgcgcc atggcctccg acttcaacct gccccccgtg gtggccaagg 4860 agatcgtggc ctcctgcgac aagtgccagc tgaagggcga ggccatgcac ggccaggtgg 4920 actgctcccc cggcatctgg cagctggact gcacccacct ggagggcaag gtgatcctgg 4980 tggccgtgca cgtggcctcc ggctacatcg aggccgaggt gatccccgcc gagaccggcc 5040 aggagaccgc ctacttcctg ctgaagctgg ccggccgctg gcccgtgaag accgtgcaca 5100 ccgacaacgg ctccaacttc acctccacca ccgtgaaggc cgcctgctgg tgggccggca 5160 tcaagcagga gttcggcatc ccctacaacc cccagtccca gggcgtgatc gagtccatga 5220 acaaggagct gaagaagatc atcggccaag tccgcgacca ggccgagcac ctgaagaccg 5280 ccgtgcagat ggccgtgttc atccacaact tcaagcgcaa gggcggcatc ggcggctact 5340 ccgccggcga gcgcatcgtg gacatcatcg ccaccgacat ccagaccaag gagctgcaga 5400 agcagatcac caagatccag aacttccgcg tgtactaccg cgactcccgc gaccccgtgt 5460 ggaagggccc cgccaagctg ctgtggaagg gcgagggcgc cgtggtgatc caggacaact 5520 ccgacatcaa ggtggtgccc cgccgcaagg ccaagatcat ccgcgactac ggcaagcaga 5580 tggccggcga cgactgcgtg gcctcccgcc aggacgagga ctaacacatg gaaaagatta 5640 gtaaaacacc ataggccgct ctagaggatc caagcttatc gataccgtcg acctcgaggg 5700 cccagatcta attcacccca ccagtgcagg ctgcctatca gaaagtggtg gctggtgtgg 5760 ctaatgccct ggcccacaag tatcactaag ctcgctttct tgctgtccaa tttctattaa 5820 aggttccttt gttccctaag tccaactact aaactggggg atattatgaa gggccttgag 5880 catctggatt ctgcctaata aaaaacattt attttcattg caatgatgta tttaaattat 5940 ttctgaatat tttactaaaa agggaatgtg ggaggtcagt gcatttaaaa cataaagaaa 6000 tgaagagcta gttcaaacct tgggaaaata cactatatct taaactccat gaaagaaggt 6060 gaggctgcaa acagctaatg cacattggca acagcccctg atgcctatgc cttattcatc 6120 cctcagaaaa ggattcaagt agaggcttga tttggaggtt aaagttttgc tatgctgtat 6180 tttacattac ttattgtttt agctgtcctc atgaatgtct tttcactacc catttgctta 6240 tcctgcatct ctcagccttg actccactca gttctcttgc ttagagatac cacctttccc 6300 ctgaagtgtt ccttccatgt tttacggcga gatggtttct cctcgcctgg ccactcagcc 6360 ttagttgtct ctgttgtctt atagaggtct acttgaagaa ggaaaaacag ggggcatggt 6420 ttgactgtcc tgtgagccct tcttccctgc ctcccccact cacagtgacc cggaatccct 6480 cgacatggca gtctagatca ttcttgaaga cgaaagggcc tcgtgatacg cctattttta 6540 taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat 6600 gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 6660 agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa 6720 catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 6780 ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 6840 atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 6900 ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc 6960 gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 7020 ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 7080 ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 7140 gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 7200 ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg 7260 gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 7320 ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 7380 gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt 7440 gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 7500 caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 7560 cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 7620 ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 7680 taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 7740 tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 7800 gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 7860 agcagagcgc agataccaaa tactgttctt ctagtgtagc cgtagttagg ccaccacttc 7920 aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 7980 gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 8040 gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 8100

tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 8160 agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 8220 cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 8280 gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 8340 ggatgcgccg cgtgcggctg ctggagatgg cggacgcgat ggatatgttc tgccaagggt 8400 tggtttgcgc attcacagtt ctccgcaaga attgattggc tccaattctt ggagtggtga 8460 atccgttagc gaggtgccgc cggcttccat tcaggtcgag gtggcccggc tccatgcacc 8520 gcgacgcaac gcggggaggc agacaaggta tagggcggcg cctacaatcc atgccaaccc 8580 gttccatgtg ctcgccgagg cggcataaat ccccgtgacg atcagcggtc caatgatcga 8640 agttaggctg gtaagagccg cgagcgatcc ttgaagctgt ccctgatggt cgtcatctac 8700 ctgcctggac agcatggcct gcaacgcggg catcccgatg ccgccggaag cgagaagaat 8760 cataatgggg aaggccatcc agcctcgcgt cggggagctt tttgcaaaag cctaggcctc 8820 caaaaaagcc tcctcactac ttctggaata gctcagaggc cgaggcggcc tcggcctctg 8880 cataaataaa aaaaattagt cagccatg 8908

* * * * *

Field of search:




Browse by classes

Advertisements

© 2014 PatentsMania.com | viewweather.com | lyricsinfo.org | getamovie.org | getalyric.com | carpati.org | getamap.net | patentsdb.org | ro | 0.0435s