TABLE 2

Annotation of predicted proteins in 41C

2R WGS3 centromere extension (11 genes)
    CG40278 (=CG18001), 70 aa, ribosomal protein L38.
    Closest human hit: NP_000990.1, ribosomal protein L38 (P(n) = 5e - 14).
    CG40293, 333 aa, Ste-20-like protein kinase.
    Has close Drosophila relatives including Frayed (P(n) = 8e - 21), CG5169.
    Closest human hit: AAG48269.1, breast cancer antigen NY-BR-96 (P(n) = 4e - 41).
    p120ctn (CG17484), 781 aa, p120 family, adherens junction component.
    Closest human hit: AAB97957, Arm-repeat protein NPRAP (P(n) = e - 128).
    CG17486, 564 aa, possible asparagine synthetase.
    Closest human hit: NP_061921.1, hypothetical protein (P(n) = 8e - 73).
    Closest hit with known function: NP_578800.1, asparagine synthetase [Pyrococcus furiosus] (P(n) = 1e - 12).
    CG17883, 312 aa, TBC domain protein.
    Closest human hit: NP_653229.1, chromosome 20 open reading frame 140 (P(n) = 1e - 62).
    Nipped B (CG17704), 2053 aa, chromosomal adherin family member.
    Closest human hit: NP_597677, IDN3 protein (P(n) = 0.0).
    CG40282, 128 aa, and CG40287 (=CG17706), 77 aa. Both closely related to Drosophila NonA (e.g., (P(n) = 1e - 40), but transcribed in opposite directions.
    Rearranged and diverged; pseudogene?
    CG17082, 629 aa, Rho GAP.
    Closest human hit: NP_277050.1, MacGAP protein (P(n) = 4e - 22).
    CG12547, 717 aa, N-terminal thioredoxin domain, C-terminal NHL repeat.
    Closest human hit: XP_089702.1, hypothetical protein (P(n) = e - 103).
    CG17528, 560 aa, doublecortin kinase-like.
    Closest human hit: NP_004725.1, doublecortin and CaM kinase-like 1 (P(n) = 2e - 66).
    CG40285 (=CG14464), 127 aa. Has human relative of unknown function.
    Closest human hit: NP_689529.1, hypothetical protein (P(n) = 9e - 14).
AE003788 (one gene)
    TpnC41C (CG2981), 154 aa, troponin C.
    Closest relatives other Drosophila proteins, e.g., CG7930 (P(n) = 7e - 40).
    Closest human hit: AAH08437, calmodulin 2 (P(n) = 8e - 25).
AE003787 (11 genes)
    CG3107, 1112 aa, metalloprotease.
    Closest human hit: NP_055704.1, metalloprotease 1 (pitrilysin family) (P(n) = 0.0).
    CG2944, 349 aa, SSB1 homolog.
    Closest human hit: XP_045247.2, SPRY domain-containing SOCS box protein SSB-1 (P(n) = e - 115).
    CG3136, 739 aa, bZIP family transcription factor.
    Closest human hit: NP_004372.2, cAMP-responsive element-binding protein-like 1 (P(n) = 3e - 13).
    CG2905, 3435 aa, Tra1/TRRAP, part of SAGA acetyltransferase/transcriptional adaptor complex.
    Closest human hit: NP_003487, transformation/transcription domain-associated protein (P(n) = 0.0).
    d4 (CG2682), 495 aa, homolog of requiem PhD finger.
    Closest human hit: NP_006259.1, requiem; apoptosis response zinc-finger protein (P(n) = 2e - 48).
    Ogt (CG10392), 1059 aa, O-glycosyltransferase.
    Closest human hit: NP_858059, O-linked GlcNAc transferase isoform 2 (P(n) = 0.0).
    CG10465, 301 aa, BTB domain protein.
    Closest human hit: Q13829, TNF-α-induced protein, B12 BTB domain homolog (P(n) = 5e - 90).
    CG10395, 281 aa, PAP-1-binding protein.
    Closest human hit: NP_112578.1, PAP-1-binding protein (P(n) = 3e - 14).
    CG30441, 126 aa intraflagellar transport protein 20.
    Closest human hit: AAH02640, intraflagellar transport protein IFT20 (P(n) = 1e - 9).
    CG10396, 162 aa, cytochrome C oxidase polypeptide IV.
    Closest human hit: P13073, cytochrome C oxidase polypeptide IV (P(n) = 5e - 23).
    CG10417, 662 aa, protein phosphatase 2C gamma.
    Closest human hit: O15355, protein phosphatase 2C gamma (P(n) = 8e - 87).
AE003786 (8 genes)
    CG30437, 733 aa, laccase.
    Several others in fly genome [e.g., CG7871 (P(n) = 1e - 57), CG5959].
    Hit in another insect: CAD20461, laccase, venom protein, parasitic wasp (P(n) = e - 117).
    Matches in fungi, plants, nematodes, but not in mammals.
    CG32838, 733 aa, another laccase, 70% identical to CG30437.
    CG30440, 1057 aa, Ost/trio-like rho GEF.
    Closest human hit: AAA52172.1, DBL-transforming protein (P(n) = 6e - 74).
    CG30438, 413 aa, putative UDP-glycosyltransferase.
    >5 in flies (e.g., CG6658, CG6644, UGT35A (P(n) = 1e - 59).
    Closest human hit: AAA83406, UDP-Glucuronosyltransferase (P(n) = 2e - 56).
    CG12408, 140 aa, troponin C relative.
    Best matches are in Drosophila, including TpnC41C (P(n) = 1e - 41).
    CG17510, 98 aa, related to Fis1.
    Closest human hit: AF151893, human Fis1 (role in mitochondrial fission) (P(n) = 3e - 6).
    CG17508, 321 aa, has human homolog of unknown function.
    Closest relative: Drosophila CG15403 (P(n) = 5e - 39).
    Closest human hit: NP_543011.1, chromosome 20, open reading frame 108 (P(n) = 6e - 38).
    CG11665, 442 aa, monocarboxylate transporter.
    Closest relatives are several similar proteins in Drosophila, e.g., CG8271 (P(n) = 3e - 12).
    Closest human hit: O60669, monocarboxylate transporter 2.
AE003785 (7 genes up to and including Vulcan)
    CG1344, 641 aa, Arm/HEAT repeat, N-terminal kinase like domain.
    Closest human hit: NP_065156.1, hypothetical protein LOC57147 (P(n) = 2e - 51).
    CG8426, 948 aa, Cdc39/NOT3 transcriptional repressor homolog.
    Closest human hit: NP_055331.1, CCR4-NOT transcription complex, subunit 3 (P(n) = 2e - 74).
    CG8245, 343 aa, has human homolog of unknown function.
    Closest human hit: NP_078863.1, hypothetical protein (P(n) = 5e - 35).
    CG1298, 264 aa.
    Related to Sinuous = CG10624 (P(n) = 1e - 22).
    No close human hits.
    CG11066, 655 aa, Serine protease, prophenoloxidase activating factor?
    Closest relative is Drosophila CG40160 (P(n) = 7e - 23).
    Insect relative with known function: CAC12665.1, prophenoloxidase activating factor (P(n) = 6e - 20).
    Closest human hit: (likely NOT ortholog) NP_000883.1, plasma kallikrein B1 precursor (P(n) = 9e - 16).
    CG17337, 462 aa, glutamate carboxypeptidase.
    Closest human hit: CAC69883.1, glutamate carboxypeptidase (P(n) = e - 172).
    Vulcan (CG8390), 605 aa.
    Closest human hit: T03306, PSD-95/SAP90-associated protein-2 (P(n) = 9e - 14).
  • aa, amino acid.