|
Doron Lipson's Abstracts |
|
D.T. Ting*, D. Lipson*, S. Paul, B.W. Brannigan, S. Akhavanfard, E.J. Coffman, G. Contino, V. Deshpande, A.J. Iafrate, S. Letovsky,
M.N. Rivera, N. Bardeesy, S. Maheswaran, D.A. Haber
Aberrant Overexpression of Satellite Repeats in Pancreatic and Other Epithelial Cancers |
Satellite repeats in heterochromatin are transcribed into noncoding RNAs that have been linked to gene silencing and maintenance of chromosomal integrity. Using digital gene expression analysis, we show that these transcripts are greatly overexpressed in mouse and human epithelial cancers. In 8 of 10 mouse pancreatic ductal adenocarcinomas (PDAC), pericentromeric satellites accounted for a mean 12% (range 1 to 50%) of all cellular transcripts, a mean 40-fold increase over normal tissue. In 15/15 human PDACs, alpha satellite transcripts were most abundant and HSATII transcripts were highly specific for cancer. Similar patterns were observed in cancers of lung, kidney, ovary, colon, and prostate. Derepression of satellite transcripts correlated with overexpression of the LINE-1 retrotransposon and with aberrant expression of neuroendocrine-associated genes proximal to LINE-1 insertions. The overexpression of satellite transcripts in cancer may reflect global alterations in heterochromatin silencing and could potentially be useful as a biomarker for cancer detection. Science, published online 13 January 2011. [pdf] |
|
P. Kapranov, F. Ozsolak , S.W. Kim, S. Foissac, D. Lipson, C. Hart, S. Roels, C. Borel, S.E. Antonarakis,
A.P. Monaghan, B. John, P.M. Milos
New class of gene-termini-associated human RNAs suggests a novel RNA copying mechanism |
Small (< 200 nucleotide) RNA (sRNA) profiling of human cells using various technologies demonstrates unexpected complexity of sRNAs with hundreds of thousands of sRNA species present. Genetic and in vitro studies show that these RNAs are not merely degradation products of longer transcripts but could indeed have a function. Furthermore, profiling of RNAs, including the sRNAs, can reveal not only novel transcripts, but also make clear predictions about the existence and properties of novel biochemical pathways operating in a cell. For example, sRNA profiling in human cells indicated the existence of an unknown capping mechanism operating on cleaved RNA2, a biochemical component of which was later identified. Here we show that human cells contain a novel type of sRNA that has non-genomically encoded 5' poly(U) tails. The presence of these RNAs at the termini of genes, specifically at the very 3' ends of known mRNAs, strongly argues for the presence of a yet uncharacterized endogenous biochemical pathway in cells that can copy RNA. We show that this pathway can operate on multiple genes, with specific enrichment towards transcript-encoding components of the translational machinery. Finally, we show that genes are also flanked by sense, 3' polyadenylated sRNAs that are likely to be capped. Nature, 466(7306):642-6, 2010. [pdf] |
|
Eldar Giladi, John Healy, Gene Myers, Chris Hart, Phillip Kapranov, Doron Lipson, Steven Roels, Edward Thayer, Stan Letovsky
Error tolerant indexing and alignment of short reads with covering template families |
|
The rapid adoption of high-throughput next generation sequence data in biological research is presenting a major challenge for sequence alignment tools - specifically, the efficient alignment of vast amounts of short reads to large references in the presence of differences arising from sequencing errors and biological sequence variations. To address this challenge, we developed a short read aligner for high-throughput sequencer data that is tolerant of errors or mutations of all types - namely, substitutions, deletions, and insertions. The aligner utilizes a multi-stage approach in which template-based indexing is used to identify candidate regions for alignment with dynamic programming. A template is a pair of gapped seeds, with one used with the read and one used with the reference. In this article, we focus on the development of template families that yield error-tolerant indexing up to a given error-budget. A general algorithm for finding those families is presented, and a recursive construction that creates families with higher error tolerance from ones with a lower error tolerance is developed. J. Computational Biology, 17(10):1279-1293, 2010. [pdf] |
|
Y. Buganim, I. Goldstein, D. Lipson, M. Milyavsky, S. Polak-Charcon, C. Mardoukh, H. Solomon, E. Kalo, S. Madar,
R. Brosh, M. Perelman, R. Navon, N. Goldfinger, I. Barshack, Z. Yakhini, V. Rotter A Novel Translocation Breakpoint within the BPTF Gene Is Associated with a Pre-Malignant Phenotype |
Partial gain of chromosome arm 17q is an abundant aberrancy in various cancer types such as lung and prostate cancer with a prominent occurrence and prognostic significance in neuroblastoma - one of the most common embryonic tumors. The specific genetic element/s in 17q responsible for the cancer-promoting effect of these aberrancies is yet to be defined although many genes located in 17q have been proposed to play a role in malignancy. We report here the characterization of a naturally-occurring, non-reciprocal translocation der(X)t(X;17) in human lung embryonal-derived cells following continuous culturing. This aberrancy was strongly correlated with an increased proliferative capacity and with an acquired ability to form colonies in vitro. The breakpoint region was mapped by fluorescence in situ hybridization (FISH) to the 17q24.3 locus. Further characterization by a custom-made comparative genome hybridization array (CGH) localized the breakpoint within the Bromodomain PHD finger Transcription Factor gene (BPTF), a gene involved in transcriptional regulation and chromatin remodeling. Interestingly, this translocation led to elevation in the mRNA levels of the endogenous BPTF. Knock-down of BPTF restricted proliferation suggesting a role for BPTF in promoting cellular growth. Furthermore, the BPTF chromosomal region was found to be amplified in various human tumors, especially in neuroblastomas and lung cancers in which 55% and 27% of the samples showed gain of 17q24.3, respectively. Additionally, 42% percent of the cancer cell lines comprising the NCI-60 had an abnormal BPTF locus copy number. We suggest that deregulation of BPTF resulting from the translocation may confer the cells with the observed cancer-promoting phenotype and that our cellular model can serve to establish causality between 17q aberrations and carcinogenesis. PLoS ONE, 5(3): e9657, 2010 [pdf] |
|
Doron Lipson*, Tal Raz*, Alix Kieu, Daniel R. Jones, Eldar Giladi, Edward Thayer, John F. Thompson, Stan Letovsky, Patrice Milos, Marie Causey Quantification of the Yeast Transcriptome by Single-molecule Sequencing |
We present single-molecule sequencing digital gene expression (smsDGE), a high-throughput, amplification-free method for accurate quantification of the full range of cellular polyadenylated RNA transcripts using a Helicos Genetic Analysis system. smsDGE involves a reverse-transcription and polyA-tailing sample preparation procedure followed by sequencing that generates a single read per transcript. We applied smsDGE to the transcriptome of Saccharomyces cerevisiae strain DBY746, using 6 of the available 50 channels in a single sequencing run, yielding on average 12 million aligned reads per channel. Using spiked-in RNA, accurate quantitative measurements were obtained over four orders of magnitude. High correlation was demonstrated across independent flow-cell channels, instrument runs and sample preparations. Transcript counting in smsDGE is highly efficient due to the representation of each transcript molecule by a single read. This efficiency, coupled with the high throughput enabled by the single-molecule sequencing platform, provides an alternative method for expression profiling. Nature Biotechnology, 27(7):652-8, 2009 [pdf] |
|
Jayson Bowers, Judith Mitchell, Eric Beer, Philip R Buzby, Marie Causey, J William Efcavitch, Mirna Jarosz, Edyta Krzymanska-Olejnik, Li Kung,
Doron Lipson, Geoffrey M Lowman, Subramanian Marappan, Peter McInerney, Adam Platt, Atanu Roy, Suhaib M Siddiqi, Kathleen Steinmann and John F Thompson Virtual terminator nucleotides for next-generation DNA sequencing |
|
We synthesized reversible terminators with tethered inhibitors for next-generation sequencing. These were efficiently incorporated with high fidelity while preventing incorporation of additional nucleotides, and we used them to sequence canine bacterial artificial chromosomes in a single-molecule system that provided even coverage for over 99% of the region sequenced. This single-molecule approach generated high-quality sequence data without the need for target amplification and thus avoided concomitant biases. Nature Methods, 6, 593-595, 2009 |
|
Margaret Taub, Doron Lipson and Terence P. Speed Methods for Allocating Ambiguous Short-reads |
With the rise in prominence of biological research using new short-read DNA sequencing technologies comes the need for new techniques for aligning and assigning these reads to their genomic location of origin. Until now, methods for allocating reads which align with equal or similar fidelity to multiple genomic locations have not been model-based, and have tended to ignore potentially informative data. Here, we demonstrate that existing methods for assigning ambiguous reads can produce biased results. We then present new methods for allocating ambiguous reads to the genome, developed within a framework of statistical modeling, which show promise in alleviating these biases, both in simulated and real data. Communications in Information and Systems, 10(2):69-82, 2010 [pdf] |
|
Eran Eden, Roy Navon, Israel Steinfeld, Doron Lipson and Zohar Yakhini GOrilla: a Tool for Discovery and Visualization of Enriched GO Terms in Ranked Gene Lists |
Since the inception of the GO annotation project, a variety of tools have been developed that support exploring and searching the GO database. In particular, a variety of tools that perform GO enrichment analysis are currently available. Most of these tools require as input a target set of genes and a background set and seek enrichment in the target set compared to the background set. A few tools also exist that support analyzing ranked lists. The latter typically rely on simulations or on union-bound correction for assigning statistical significance to the results.
|
|
S. Farkash-Amar, D. Lipson, A. Polten, A. Goren, C. Helmstetter,
Z. Yakhini and I. Simon Global Organization of Replication Time Zones of the Mouse Genome |
The division of genomes into distinct replication time zones has long been established. However, an in-depth understanding of their organization and their relationship to transcription is incomplete. Taking advantage of a novel synchronization method ("baby machine") and of genomic DNA microarrays, we have, for the first time, mapped replication times of the entire mouse genome at a high temporal resolution. Our data revealed that although most of the genome has a distinct time of replication either early, middle, or late S phase, a significant portion of the genome is replicated asynchronously. Analysis of the replication map revealed the genomic scale organization of the replication time zones. We found that the genomic regions between early and late replication time zones often consist of extremely large replicons. Analysis of the relationship between replication and transcription revealed that early replication is frequently correlated with the transcription potential of a gene and not necessarily with its actual transcriptional activity. These findings, along with the strong conservation found between replication timing in human and mouse genomes, emphasize the importance of replication timing in transcription regulation.
|
| Amir
Ben-Dor, Doron Lipson, Anya Tsalenko, Mark Reimers, Lars O. Baumbusch, Michael
T. Barrett, John N. Weinstein, Anne-Lise Borresen-Dale, Zohar Yakhini Framework for Identifying Common Aberrations in DNA Copy Number Data |
Motivation. High-resolution
array comparative genomic hybridization (aCGH) provides exon-level mapping
of
DNA aberrations in cells or tissues. Such aberrations are central to
carcinogenesis and, in many cases, central to
targeted therapy of the cancers. Some of the aberrations are sporadic,
one-of-a-kind changes in particular tumor
samples; others occur frequently and reflect common themes in cancer biology
that have interpretable, causal ramifications.
Methods and Results. In this paper we present an efficient computational
framework for identification and
statistical characterization of genomic aberrations that are common to
multiple cancer samples in a CGH data set.
We present and compare three different algorithmic approaches within the
context of that framework. Finally, we
apply our methods to two datasets – a collection of 20 breast cancer samples
and a panel of 60 diverse human tumor
cell lines (the NCI-60). Those analyses identified both known and novel
common aberrations containing cancer related
genes. The potential impact of the analytical methods is well demonstrated
by new insights into the patterns
of deletion of CDKN2A (p16), a tumor suppressor gene crucial for the genesis
of many types of cancer.
|
| Eran Eden,
Doron Lipson, Sivan Yogev, Zohar Yakhini Discovering Motifs in Ranked Lists of DNA Sequences |
![]()
|
| E. Dehan, A.
Ben-Dor, W. Liao, D. Lipson, H. Frimer, S. Rienstein, D. Simansky, M. Krupsky,
P. Yaron, E. Friedman, G. Rechavi, M. Perlman, A. Aviram-Goldring, S. Izraeli,
M. Bittner, Z. Yakhini, N. Kaminski Chromosomal aberrations and gene expression profiles in non-small cell lung cancer |
|
|
| Doron
Lipson, Zohar Yakhini, Yonatan Aumann Optimization of Probe Coverage for High-Resolution Oligonucleotide aCGH |
|
Results. In this paper we describe a well-defined optimization problem associated with the problem of probe selection for high-resolution aCGH arrays. We propose the whenever possible e-cover as a formulation that faithfully captures the requirement of probe selection problem, and provide a fast randomized algorithm that solves the optimization problem in O(n log n) time, as well as a deterministic algorithm with the same asymptotic performance. We apply the method in a typical high-definition array design scenario and demonstrate its superiority with respect to alternative approaches.. Availability.
Address requests to the authors.
|
|
Ilya Baskin, Stav Zaitsev, Doron Lipson, Rachel Gilad,
Kinneret Keren, Gidi Ben-Yoseph, Uri Sivan A Molecular Shift Register and its Utilization as an Autonomous DNA Synthesizer |
|
We demonstrate a novel algorithmic approach to autonomous synthesis of fairly long DNA molecules with well-defined, non-recurring sequences. The scheme exploits chemical embodiment of shift registers to execute algorithms similar to those used to generate pseudo-random numbers on a computer. A collection of single stranded “rule” DNA molecules is added to a tube together with a single stranded “seed” molecule and polymerase. The “rule” molecules guide seed elongation according to the algorithm to produce the desired DNA molecule. The synthesis effort is exponentially small compared with all present strategies. The reduced effort is facilitated by the sliding reading frame of shift registers, namely, the utilization of a previously synthesized sequence for the synthesis of the next bases. A redundancy based error reduction scheme, similar to those used in communication, is utilized to systematically suppress synthesis errors.
|
| Doron Lipson, Yonatan Aumann,
Amir Ben-Dor, Nathan Linial, Zohar Yakhini Efficient Calculation of Interval Scores for DNA Copy Number Data Analysis |
|
Methods.
We develop a statistical framework that enables the casting of several DNA
copy number data analysis questions as optimization problems over real valued
vectors of signals. The simplest form of the optimization problem seeks to
maximize
Examples.
We benchmark our algorithms on synthetic as well as publicly available DNA
copy number data. We demonstrate the use of these methods for identifying
aberrations in single samples as well as common alterations in fixed sets and
subsets of breast cancer samples.
Journal of Computational Biology, Vol. 13, No. 2: 215-228, 2006. |
| Michael T.
Barrett, Alicia Scheffer, Amir Ben-Dor, Nick Sampas, Doron Lipson, Robert
Kincaid, Peter Tsang, Bo Curry, Kristin Baird, Paul S. Meltzer, Zohar Yakhini,
Laurakay Bruhn, Stephen Laderman Comparative Genomic Hybridization using Oligonucleotide Microarrays and Total Genomic DNA |
|
|
| Doron Lipson,
Amir Ben-Dor, Elinor Dehan, Zohar Yakhini Joint Analysis of DNA Copy Numbers and Gene Expression Levels |
|
|
| Doron Lipson, Peter Webb,
Zohar Yakhini Designing Specific Oligonucleotide Probes for the Entire S. cerevisiae Transcriptome |
|
|
|
Brian F. Volkman, Doron Lipson, David E. Wemmer, Dorothee Kern Two-State Allosteric Behavior in a Single-Domain Signaling Protein |
|
|
|
Nicola J.
Turton, David J. Judah, Joan Riley, Reginald Davies, Doron Lipson, Jerry A. Styles,
Andrew G. Smith, Timothy W. Gant Gene expression and amplification in breast carcinoma cells with intrinsic and acquired doxorubicin resistance |
|
The multidrug resistance (MDR)
phenotype is a major cause of cancer treatment failure. Here the expressions of
4224 genes were analysed for association with intrinsic or acquired doxorubicin
(DOX) resistance. A cluster of overexpressed genes related to DOX resistance was
observed. Included in this cluster was ABCB1 the P-glycoprotein transporter
protein gene and MMP1 (Matrix Metalloproteinase 1), indicative of the invasive
nature of resistant cells, and the oxytocin receptor (OXTR), a potential new
therapeutic target. Overexpression of genes associated with xenobiotic
transformation, cell transformation, cell signalling and lymphocyte activation
was also associated with DOX resistance as was estrogen receptor negativity. In
all carcinoma cells, compared with HBL100 a putatively normal breast epithelial
cell line, a cluster of overexpressed genes was identified which included
several keratins, in particular keratins 8 and 18 which are regulated through
the ras signalling pathway. Analysis of genomic amplifications and deletions
revealed specific genetic alterations common to both intrinsic and acquired DOX
resistance including ABCB1, PGY3 (ABCB4) and BAK. The findings shown here
indicate new possibilities for the diagnosis of DOX resistance using gene
expression, and potential novel therapeutic targets for pharmacological
intervention.
|
|
Yoav Soen, Netta Cohen, Doron Lipson, and Erez Braun Emergence of spontaneous rhythm disorders in self-assembled networks of heart cells |
|
Physical Review Letters, Vol. 82, Num. 17 , 26 April 1999, pp. 3556-3559. [pdf]
|
| Back to Doron's homepage |