<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns="http://www.w3.org/2005/Atom">
<title>Center for Advanced Genomic Technology Papers</title>
<link href="http://hdl.handle.net/2144/2439" rel="alternate"/>
<subtitle/>
<id>http://hdl.handle.net/2144/2439</id>
<updated>2013-05-24T23:07:20Z</updated>
<dc:date>2013-05-24T23:07:20Z</dc:date>
<entry>
<title>VisANT 3.0: New Modules for Pathway Visualization, Editing, Prediction and Construction</title>
<link href="http://hdl.handle.net/2144/3373" rel="alternate"/>
<author>
<name>Hu, Zhenjun</name>
</author>
<author>
<name>Ng, David M.</name>
</author>
<author>
<name>Yamada, Takuji</name>
</author>
<author>
<name>Chen, Chunnuan</name>
</author>
<author>
<name>Kawashima, Shuichi</name>
</author>
<author>
<name>Mellor, Joe</name>
</author>
<author>
<name>Linghu, Bolan</name>
</author>
<author>
<name>Kanehisa, Minoru</name>
</author>
<author>
<name>Stuart, Joshua M.</name>
</author>
<author>
<name>DeLisi, Charles</name>
</author>
<id>http://hdl.handle.net/2144/3373</id>
<updated>2012-01-13T07:00:59Z</updated>
<published>2007-07-01T00:00:00Z</published>
<summary type="text">VisANT 3.0: New Modules for Pathway Visualization, Editing, Prediction and Construction
Hu, Zhenjun; Ng, David M.; Yamada, Takuji; Chen, Chunnuan; Kawashima, Shuichi; Mellor, Joe; Linghu, Bolan; Kanehisa, Minoru; Stuart, Joshua M.; DeLisi, Charles
With the integration of the KEGG and Predictome databases as well as two search engines for coexpressed genes/proteins using data sets obtained from the Stanford Microarray Database (SMD) and Gene Expression Omnibus (GEO) database, VisANT 3.0 supports exploratory pathway analysis, which includes multi-scale visualization of multiple pathways, editing and annotating pathways using a KEGG compatible visual notation and visualization of expression data in the context of pathways. Expression levels are represented either by color intensity or by nodes with an embedded expression profile. Multiple experiments can be navigated or animated. Known KEGG pathways can be enriched by querying either coexpressed components of known pathway members or proteins with known physical interactions. Predicted pathways for genes/proteins with unknown functions can be inferred from coexpression or physical interaction data. Pathways produced in VisANT can be saved as computer-readable XML format (VisML), graphic images or high-resolution Scalable Vector Graphics (SVG). Pathways in the format of VisML can be securely shared within an interested group or published online using a simple Web link. VisANT is freely available at http://visant.bu.edu.
</summary>
<dc:date>2007-07-01T00:00:00Z</dc:date>
</entry>
<entry>
<title>High-Precision High-Coverage Functional Inference from Integrated Data Sources</title>
<link href="http://hdl.handle.net/2144/3195" rel="alternate"/>
<author>
<name>Linghu, Bolan</name>
</author>
<author>
<name>Snitkin, Evan S</name>
</author>
<author>
<name>Holloway, Dustin T</name>
</author>
<author>
<name>Gustafson, Adam M</name>
</author>
<author>
<name>Xia, Yu</name>
</author>
<author>
<name>DeLisi, Charles</name>
</author>
<id>http://hdl.handle.net/2144/3195</id>
<updated>2012-01-12T07:01:47Z</updated>
<published>2008-02-25T00:00:00Z</published>
<summary type="text">High-Precision High-Coverage Functional Inference from Integrated Data Sources
Linghu, Bolan; Snitkin, Evan S; Holloway, Dustin T; Gustafson, Adam M; Xia, Yu; DeLisi, Charles
BACKGROUND. Information obtained from diverse data sources can be combined in a principled manner using various machine learning methods to increase the reliability and range of knowledge about protein function. The result is a weighted functional linkage network (FLN) in which linked neighbors share at least one function with high probability. Precision is, however, low. Aiming to provide precise functional annotation for as many proteins as possible, we explore and propose a two-step framework for functional annotation (1) construction of a high-coverage and reliable FLN via machine learning techniques (2) development of a decision rule for the constructed FLN to optimize functional annotation. RESULTS. We first apply this framework to Saccharomyces cerevisiae. In the first step, we demonstrate that four commonly used machine learning methods, Linear SVM, Linear Discriminant Analysis, Naïve Bayes, and Neural Network, all combine heterogeneous data to produce reliable and high-coverage FLNs, in which the linkage weight more accurately estimates functional coupling of linked proteins than use individual data sources alone. In the second step, empirical tuning of an adjustable decision rule on the constructed FLN reveals that basing annotation on maximum edge weight results in the most precise annotation at high coverages. In particular at low coverage all rules evaluated perform comparably. At coverage above approximately 50%, however, they diverge rapidly. At full coverage, the maximum weight decision rule still has a precision of approximately 70%, whereas for other methods, precision ranges from a high of slightly more than 30%, down to 3%. In addition, a scoring scheme to estimate the precisions of individual predictions is also provided. Finally, tests of the robustness of the framework indicate that our framework can be successfully applied to less studied organisms. CONCLUSION. We provide a general two-step function-annotation framework, and show that high coverage, high precision annotations can be achieved by constructing a high-coverage and reliable FLN via data integration followed by applying a maximum weight decision rule.
</summary>
<dc:date>2008-02-25T00:00:00Z</dc:date>
</entry>
<entry>
<title>Biological Process Linkage Networks</title>
<link href="http://hdl.handle.net/2144/3055" rel="alternate"/>
<author>
<name>Dotan-Cohen, Dikla</name>
</author>
<author>
<name>Letovsky, Stan</name>
</author>
<author>
<name>Melkman, Avraham A.</name>
</author>
<author>
<name>Kasif, Simon</name>
</author>
<id>http://hdl.handle.net/2144/3055</id>
<updated>2012-01-11T07:00:38Z</updated>
<published>2009-04-23T00:00:00Z</published>
<summary type="text">Biological Process Linkage Networks
Dotan-Cohen, Dikla; Letovsky, Stan; Melkman, Avraham A.; Kasif, Simon
BACKGROUND. The traditional approach to studying complex biological networks is based on the identification of interactions between internal components of signaling or metabolic pathways. By comparison, little is known about interactions between higher order biological systems, such as biological pathways and processes. We propose a methodology for gleaning patterns of interactions between biological processes by analyzing protein-protein interactions, transcriptional co-expression and genetic interactions. At the heart of the methodology are the concept of Linked Processes and the resultant network of biological processes, the Process Linkage Network (PLN). RESULTS. We construct, catalogue, and analyze different types of PLNs derived from different data sources and different species. When applied to the Gene Ontology, many of the resulting links connect processes that are distant from each other in the hierarchy, even though the connection makes eminent sense biologically. Some others, however, carry an element of surprise and may reflect mechanisms that are unique to the organism under investigation. In this aspect our method complements the link structure between processes inherent in the Gene Ontology, which by its very nature is species-independent. As a practical application of the linkage of processes we demonstrate that it can be effectively used in protein function prediction, having the power to increase both the coverage and the accuracy of predictions, when carefully integrated into prediction methods. CONCLUSIONS. Our approach constitutes a promising new direction towards understanding the higher levels of organization of the cell as a system which should help current efforts to re-engineer ontologies and improve our ability to predict which proteins are involved in specific biological processes.
</summary>
<dc:date>2009-04-23T00:00:00Z</dc:date>
</entry>
<entry>
<title>A Predictive Phosphorylation Signature of Lung Cancer</title>
<link href="http://hdl.handle.net/2144/3056" rel="alternate"/>
<author>
<name>Wu, Chang-Jiun</name>
</author>
<author>
<name>Cai, Tianxi</name>
</author>
<author>
<name>Rikova, Klarisa</name>
</author>
<author>
<name>Merberg, David</name>
</author>
<author>
<name>Kasif, Simon</name>
</author>
<author>
<name>Steffen, Martin</name>
</author>
<id>http://hdl.handle.net/2144/3056</id>
<updated>2012-01-11T07:00:38Z</updated>
<published>2009-11-25T00:00:00Z</published>
<summary type="text">A Predictive Phosphorylation Signature of Lung Cancer
Wu, Chang-Jiun; Cai, Tianxi; Rikova, Klarisa; Merberg, David; Kasif, Simon; Steffen, Martin
BACKGROUND. Aberrant activation of signaling pathways drives many of the fundamental biological processes that accompany tumor initiation and progression. Inappropriate phosphorylation of intermediates in these signaling pathways are a frequently observed molecular lesion that accompanies the undesirable activation or repression of pro- and anti-oncogenic pathways. Therefore, methods which directly query signaling pathway activation via phosphorylation assays in individual cancer biopsies are expected to provide important insights into the molecular "logic" that distinguishes cancer and normal tissue on one hand, and enables personalized intervention strategies on the other. RESULTS. We first document the largest available set of tyrosine phosphorylation sites that are, individually, differentially phosphorylated in lung cancer, thus providing an immediate set of drug targets. Next, we develop a novel computational methodology to identify pathways whose phosphorylation activity is strongly correlated with the lung cancer phenotype. Finally, we demonstrate the feasibility of classifying lung cancers based on multi-variate phosphorylation signatures. CONCLUSIONS. Highly predictive and biologically transparent phosphorylation signatures of lung cancer provide evidence for the existence of a robust set of phosphorylation mechanisms (captured by the signatures) present in the majority of lung cancers, and that reliably distinguish each lung cancer from normal. This approach should improve our understanding of cancer and help guide its treatment, since the phosphorylation signatures highlight proteins and pathways whose phosphorylation should be inhibited in order to prevent unregulated proliferation.
</summary>
<dc:date>2009-11-25T00:00:00Z</dc:date>
</entry>
</feed>
