<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns="http://www.w3.org/2005/Atom">
<title>CAS: Computer Science: Scholarly Papers</title>
<link href="http://hdl.handle.net/2144/1240" rel="alternate"/>
<subtitle/>
<id>http://hdl.handle.net/2144/1240</id>
<updated>2013-05-21T07:29:06Z</updated>
<dc:date>2013-05-21T07:29:06Z</dc:date>
<entry>
<title>Spectral affinity in protein networks</title>
<link href="http://hdl.handle.net/2144/3198" rel="alternate"/>
<author>
<name>Voevodski, Konstantin</name>
</author>
<author>
<name>Teng, Shang-Hua</name>
</author>
<author>
<name>Xia, Yu</name>
</author>
<id>http://hdl.handle.net/2144/3198</id>
<updated>2012-01-12T07:01:49Z</updated>
<published>2009-11-29T00:00:00Z</published>
<summary type="text">Spectral affinity in protein networks
Voevodski, Konstantin; Teng, Shang-Hua; Xia, Yu
BACKGROUND. Protein-protein interaction (PPI) networks enable us to better understand the functional organization of the proteome. We can learn a lot about a particular protein by querying its neighborhood in a PPI network to find proteins with similar function. A spectral approach that considers random walks between nodes of interest is particularly useful in evaluating closeness in PPI networks. Spectral measures of closeness are more robust to noise in the data and are more precise than simpler methods based on edge density and shortest path length. RESULTS. We develop a novel affinity measure for pairs of proteins in PPI networks, which uses personalized PageRank, a random walk based method used in context-sensitive search on the Web. Our measure of closeness, which we call PageRank Affinity, is proportional to the number of times the smaller-degree protein is visited in a random walk that restarts at the larger-degree protein. PageRank considers paths of all lengths in a network, therefore PageRank Affinity is a precise measure that is robust to noise in the data. PageRank Affinity is also provably related to cluster co-membership, making it a meaningful measure. In our experiments on protein networks we find that our measure is better at predicting co-complex membership and finding functionally related proteins than other commonly used measures of closeness. Moreover, our experiments indicate that PageRank Affinity is very resilient to noise in the network. In addition, based on our method we build a tool that quickly finds nodes closest to a queried protein in any protein network, and easily scales to much larger biological networks. CONCLUSION. We define a meaningful way to assess the closeness of two proteins in a PPI network, and show that our closeness measure is more biologically significant than other commonly used methods. We also develop a tool, accessible at http://xialab.bu.edu/resources/pnns, that allows the user to quickly find nodes closest to a queried vertex in any protein network available from BioGRID or specified by the user.
</summary>
<dc:date>2009-11-29T00:00:00Z</dc:date>
</entry>
<entry>
<title>Finding Local Communities in Protein Networks</title>
<link href="http://hdl.handle.net/2144/3190" rel="alternate"/>
<author>
<name>Voevodski, Konstantin</name>
</author>
<author>
<name>Teng, Shang-Hua</name>
</author>
<author>
<name>Xia, Yu</name>
</author>
<id>http://hdl.handle.net/2144/3190</id>
<updated>2012-01-12T07:01:46Z</updated>
<published>2009-09-18T00:00:00Z</published>
<summary type="text">Finding Local Communities in Protein Networks
Voevodski, Konstantin; Teng, Shang-Hua; Xia, Yu
BACKGROUND. Protein-protein interactions (PPIs) play fundamental roles in nearly all biological processes, and provide major insights into the inner workings of cells. A vast amount of PPI data for various organisms is available from BioGRID and other sources. The identification of communities in PPI networks is of great interest because they often reveal previously unknown functional ties between proteins. A large number of global clustering algorithms have been applied to protein networks, where the entire network is partitioned into clusters. Here we take a different approach by looking for local communities in PPI networks. RESULTS. We develop a tool, named Local Protein Community Finder, which quickly finds a community close to a queried protein in any network available from BioGRID or specified by the user. Our tool uses two new local clustering algorithms Nibble and PageRank-Nibble, which look for a good cluster among the most popular destinations of a short random walk from the queried vertex. The quality of a cluster is determined by proportion of outgoing edges, known as conductance, which is a relative measure particularly useful in undersampled networks. We show that the two local clustering algorithms find communities that not only form excellent clusters, but are also likely to be biologically relevant functional components. We compare the performance of Nibble and PageRank-Nibble to other popular and effective graph partitioning algorithms, and show that they find better clusters in the graph. Moreover, Nibble and PageRank-Nibble find communities that are more functionally coherent. CONCLUSION. The Local Protein Community Finder, accessible at , allows the user to quickly find a high-quality community close to a queried protein in any network available from BioGRID or specified by the user. We show that the communities found by our tool form good clusters and are functionally coherent, making our application useful for biologists who wish to investigate functional modules that a particular protein is a part of.
</summary>
<dc:date>2009-09-18T00:00:00Z</dc:date>
</entry>
<entry>
<title>Evolutionary History of Mammalian Transposons Determined by Genome-Wide Defragmentation</title>
<link href="http://hdl.handle.net/2144/3130" rel="alternate"/>
<author>
<name>Giordano, Joti</name>
</author>
<author>
<name>Ge, Yongchao</name>
</author>
<author>
<name>Gelfand, Yevgeniy</name>
</author>
<author>
<name>Abrusán, György</name>
</author>
<author>
<name>Benson, Gary</name>
</author>
<author>
<name>Warburton, Peter E</name>
</author>
<id>http://hdl.handle.net/2144/3130</id>
<updated>2012-01-12T07:00:25Z</updated>
<published>2007-07-13T00:00:00Z</published>
<summary type="text">Evolutionary History of Mammalian Transposons Determined by Genome-Wide Defragmentation
Giordano, Joti; Ge, Yongchao; Gelfand, Yevgeniy; Abrusán, György; Benson, Gary; Warburton, Peter E
The constant bombardment of mammalian genomes by transposable elements (TEs) has resulted in TEs comprising at least 45% of the human genome. Because of their great age and abundance, TEs are important in comparative phylogenomics. However, estimates of TE age were previously based on divergence from derived consensus sequences or phylogenetic analysis, which can be unreliable, especially for older more diverged elements. Therefore, a novel genome-wide analysis of TE organization and fragmentation was performed to estimate TE age independently of sequence composition and divergence or the assumption of a constant molecular clock. Analysis of TEs in the human genome revealed ∼600,000 examples where TEs have transposed into and fragmented other TEs, covering &gt;40% of all TEs or ∼542 Mbp of genomic sequence. The relative age of these TEs over evolutionary time is implicit in their organization, because newer TEs have necessarily transposed into older TEs that were already present. A matrix of the number of times that each TE has transposed into every other TE was constructed, and a novel objective function was developed that derived the chronological order and relative ages of human TEs spanning &gt;100 million years. This method has been used to infer the relative ages across all four major TE classes, including the oldest, most diverged elements. Analysis of DNA transposons over the history of the human genome has revealed the early activity of some MER2 transposons, and the relatively recent activity of MER1 transposons during primate lineages. The TEs from six additional mammalian genomes were defragmented and analyzed. Pairwise comparison of the independent chronological orders of TEs in these mammalian genomes revealed species phylogeny, the fact that transposons shared between genomes are older than species-specific transposons, and a subset of TEs that were potentially active	during periods of speciation. 

Author Summary. 

Transposable elements (TEs) are interspersed repetitive DNA families that are capable of copying themselves from place to place; they have literally infested our genome over evolutionary time, and now comprise as much as 45% of our total DNA. Because of their great age and abundance, TEs are important in evolutionary genomics. However, estimates of their age based on DNA sequence composition have been unreliable, especially for older more diverged elements. Therefore, a novel method to estimate the age of TEs was developed based on the fact that as TEs spread throughout the genome, they inserted into and fragmented older TEs that were already present. Therefore, the age of TEs can be revealed by how often they have been fragmented over evolutionary time. We performed a genome-wide defragmention of TEs, and developed a novel objective function to derive the chronological order of TEs spanning &lt;100 million years. This method has been used to infer the relative ages of TEs from seven sequenced mammalian genomes across all four major TE classes, including the oldest, most diverged elements. This age estimate is independent of TE sequence composition or divergence and does not rely on the assumption of a constant molecular clock. This study provides a novel analysis of the evolutionary history of some of the most abundant and ancient repetitive DNA elements in mammalian genomes, which is important for understanding the dynamic forces that shape our genomes during evolution.
</summary>
<dc:date>2007-07-13T00:00:00Z</dc:date>
</entry>
<entry>
<title>TRDB—The Tandem Repeats Database</title>
<link href="http://hdl.handle.net/2144/3128" rel="alternate"/>
<author>
<name>Gelfand, Yevgeniy</name>
</author>
<author>
<name>Rodriguez, Alfredo</name>
</author>
<author>
<name>Benson, Gary</name>
</author>
<id>http://hdl.handle.net/2144/3128</id>
<updated>2012-01-12T07:01:45Z</updated>
<published>2006-12-14T00:00:00Z</published>
<summary type="text">TRDB—The Tandem Repeats Database
Gelfand, Yevgeniy; Rodriguez, Alfredo; Benson, Gary
Tandem repeats in DNA have been under intensive study for many years, first, as a consequence of their usefulness as genomic markers and DNA fingerprints and more recently as their role in human disease and regulatory processes has become apparent. The Tandem Repeats Database (TRDB) is a public repository of information on tandem repeats in genomic DNA. It contains a variety of tools for repeat analysis, including the Tandem Repeats Finder program, query and filtering capabilities, repeat clustering, polymorphism prediction, PCR primer selection, data visualization and data download in a variety of formats. In addition, TRDB serves as a centralized research workbench. It provides user storage space and permits collaborators to privately share their data and analysis. TRDB is available at https://tandem. bu.edu/cgi-bin/trdb/trdb.exe.
</summary>
<dc:date>2006-12-14T00:00:00Z</dc:date>
</entry>
</feed>
