[No authors listed]
We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.
KEYWORDS: {{ getKeywords(articleDetailText.words) }}
NAALAD2, LOC100132356, TMED7-TICAM2, TSPAN3, ATP6AP2, CHST4, CNIH1, LOC101928215, PSME3, ZMPSTE24, RTN3, LAMC3, B3GNT3, TUBB4A, CPQ, SPON2, SPON1, SEMA4F, FBLN5, ATG7, ARL6IP5, SLC34A2, POMT1, PDPN, DLL3, LOC107133515, PLAC1, GJB6, HPSE, RUVBL2, LYVE1, BLCAP, ADCY3, B4GAT1, SPACA9, CACFD1, PRSS23, EMILIN1, SLC2A6, GALNT6, SEC63, PRRT2, MAN1B1, NRM, RDH13, GLMP, CHRM3, SCAMP4, CHST14, VSIG4, SLC52A3, FKBP9, TEX261, SMIM12, PIK3IP1, C1QTNF1, C1QTNF3, TMEM123, LRRC42, FAM210B, LYPD1, MRGPRF, UBE2J2, CLPTM1, CANT1, SLC43A2, GGT6, PUSL1, DRAM2, PIGU, MBOAT2, TMEM178A, TRAM1L1, TMEM139, HGSNAT, CRH, CS, FGFBP3, LDLRAD3, TMEM86A, ISM2, C16orf89, SLC25A10, LRRC37BP1, C19orf25, CTSB, CTSL, XXYLT1, GLIPR2, SLC38A9, RNF145, ATP6V0E2, SLC16A11, DEDD2, APCDD1L, UGT3A2, DLX3, DNASE2, TOR1A, APLNR, ARID2, ACSF3, ELN, KCTD6, HACD2, STT3B, C4orf46, TAPT1, HTRA4, FBLN1, GJD4, CYB561A3, ADGRF5, PI16, LEMD2, FCGRT, VWDE, DAGLB, TMED4, CHSY1, SCAP, TBC1D9B, EMC1, ERP44, SYT11, TBC1D1, NUP210, MAN2B2, ANGPTL2, LEPROTL1, CCNDBP1, BACE1, RUSC1, HSPBP1, PLXNB2, PLA2G15, TMEFF2, CADM1, FLRT3, PANX1, CNPY4, RNASEH1, ALPP, FUCA2, TMEM9, MFSD8, TINCR, TMEM184B, SUMF2, TMEM87A, EGFL6, TSKU, GGA1, ABHD12, SERBP1, SEZ6L2, SLC17A5, PCOLCE2, ATP2C1, TSPAN13, LYPD3, KCNH5, SLC39A1, SRPX2, TOR2A, GPR1, AQP11, SLC26A11, GPR19, TMEM145, SUMF1, RELL2, GPR34, OSTM1, SLC43A3, SCG3, PDIA3, UBIAD1, SEC61A1, SERTAD1, EFEMP2, HSBP1, UBAC2, TMEM119, SLC35B2, IGF2, CLEC18A, IGFBP3, IGHA1, DRAXIN, FIBIN, NOTCH2NL, FADS1, FAM174B, LRP3, ERVFRD-1, MATN1, MATN2, MGAT1, MGAT2, MMP11, CD200, BAIAP2-AS1, RERE, NT5E, OGN, P4HB, PAX6, F11R, PCDH1, TMED7, TXNDC12, TMX2, SIDT2, SCCPDH, APH1A, METTL9, RDH11, ANGPTL4, SDF4, HSD17B11, EMC4, CLEC1A, GOLM1, ARMCX1, TLR8, SLC25A37, THEM6, MS4A4A, HACD3, LSR, PIGT, ERGIC3, TMBIM4, SARAF, GPRC5B, CYB5R1, SELT, ATP5G2, DNAJB11, GALNT7, CECR1, SLC25A3, NUDT9, CLIC5, S1PR5, ATP6AP1, MIS18A, NLGN3, DNAJC10, CPVL, LEPROT, SYTL2, PIGG, RETSAT, SLC35F6, CLN6, SPTLC3, LAPTM4B, PPT1, CAND1, DBNDD2, GPRC5C, ALG1, PCDHA12, TMX4, LRRC8A, RPRM, CLDND1, TMEM9B, PSG1, SLAMF8, TM9SF3, OLFML3, MRPS22, C5orf15, RGMA, POGLUT1, CHPT1, CCDC47, COQ9, TWSG1, TMEM159, SLC44A2, ADGRG6, PTGDS, PTGFRN, HEG1, SEMA6A, GRAMD1A, WFDC1, MS4A7, SRPRB, SLC25A19, ELOVL5, MRPS35, FKBP10, SDC2, SDHD, SEL1L, PERP, TMBIM1, TINAGL1, P3H1, SFRP2, SIL1, MFSD14A, FNDC3B, SLC2A3, CYP4F12, CHID1, SLC2A11, SPP1, SSR1, LEFTY2, TIMP1, TSPAN6, TPBG, WNT6, WNT11, RELL1, IFRD2, PCYOX1L, ALG8, PRRG3, RNF26, FAM134A, TMEM43, KREMEN2, ATP13A3, TTC13, COLGALT1, TFPI2, SLC35E1, WLS, UXS1, CUBN, CD276, ITIH5, NDFIP1, SPX, CALU, HM13, SLC38A1, TMX1, GDPD5, VOPP1, LMAN2L, PLA2G12A, ITM2C, TSPAN14, SFXN3, ABHD17A, ADPGK, GSG1, JAM3, INHBE, ARMC10, RAB34, TM2D2, MAGT1, MED10, C2orf88, FAM213A, GHDC, JAGN1, TPST2, MFSD9, PLPP7, SFT2D3, ABHD14B, CCDC142, TMEM25, MFSD2A, ZDHHC12, SPPL2A, SLC7A3, FAM73B, ATAD1, TBRG1, SLC35B4, ABHD13, UBASH3B, IGSF21, MFSD5, ALG2, FCN3, SCIN, STC2, B3GALT4, ADAM15, MATN4, CREG1, WISP2, PROM1, SLC5A6, ENDOU, CACNA1H, PLOD3, STBD1, GLB1L2, MIDN, TSPAN18, HS6ST2, MPZL1, SYS1, TP53I13, ANGPTL1, CLDN6, CLDN2, ESAM, MYADM, RRP7BP, ADGRG1, PIGM, PGAP3, STOML1, PIGS, ORMDL1, ORMDL3, FADS2, TMEM59, CXCL14, ENTPD6, SLC25A44, RNF40, TOMM70, SV2A, KLHL21, RBM8A
Sample name | Organism | Experiment title | Sample type | Library instrument | Attributes | |||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
{{attr}} | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
{{ dataList.sampleTitle }} | {{ dataList.organism }} | {{ dataList.expermentTitle }} | {{ dataList.sampleType }} | {{ dataList.libraryInstrument }} | {{ showAttributeName(index,attr,dataList.attributes) }} |
{{ list.authorName }} {{ list.authorName }} |