<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8549809657959855223</id><updated>2012-01-23T16:44:01.667+01:00</updated><category term='string'/><category term='outages'/><category term='api'/><category term='documentation'/><category term='stitch'/><category term='announcements'/><title type='text'>News about the STRING and STITCH databases</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>23</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-8744750055450963111</id><published>2010-03-16T13:43:00.001+01:00</published><updated>2010-03-16T13:49:45.919+01:00</updated><title type='text'>Combining scores the right way</title><content type='html'>&lt;div style="font-family: inherit;"&gt;With STRING version 8, we began to add prior correction to the scores. That means that we consider how close scores are to the background rate of about 6% for protein–protein interactions, and we remove the random expectation from the scores before we combine them.&amp;nbsp; (At the end, we put the random contribution back in to make it consistent with the case that only a single evidence channel contributes to the score.)&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="font-family: inherit;"&gt;For some applications, you might want to remove certain evidence types from the STRING download files. To compute a new combined score, you can now use the same prior correction we've been using internally. We have started a new &lt;a href="http://bitbucket.org/mkuhn/stringtools/"&gt;BitBucket repository,&lt;/a&gt; and added a &lt;a href="http://bitbucket.org/mkuhn/stringtools/src/tip/prior_correction/"&gt;script to discard channels&lt;/a&gt; from the full STRING download files.&amp;nbsp; &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-8744750055450963111?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/8744750055450963111/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2010/03/combining-scores-right-way.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/8744750055450963111'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/8744750055450963111'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2010/03/combining-scores-right-way.html' title='Combining scores the right way'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-3568984280151408335</id><published>2009-11-09T14:37:00.005+01:00</published><updated>2009-11-09T16:18:53.314+01:00</updated><title type='text'>STRING 8.2 and STITCH 2.0 released</title><content type='html'>We have released new versions of STRING and STITCH!&lt;br /&gt;&lt;br /&gt;Here are some of the changes:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;protein-protein actions like &lt;span style="font-style: italic;"&gt;binding&lt;/span&gt; or &lt;span style="font-style: italic;"&gt;post-translational modifications&lt;/span&gt; are now included&lt;/li&gt;&lt;li&gt;most download files are now under Creative Commons licenses, so academics usually don't need to fill license agreements anymore&lt;/li&gt;&lt;li&gt;bugfix: version 8.1 had accidentally omitted homology transfer in the textmining channel, this is now fixed.&lt;/li&gt;&lt;li&gt;the Flash interactive network viewer has further matured; a simple, lean version is now enabled by default. The full-featured version can be activated by clicking on the "advanced" button below the network&lt;br /&gt;&lt;/li&gt;&lt;li&gt;STITCH 2 uses the protein universe of STRING 8&lt;/li&gt;&lt;li&gt;the number of interactions has further increased&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-3568984280151408335?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/3568984280151408335/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/11/string-82-and-stitch-20-released.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/3568984280151408335'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/3568984280151408335'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/11/string-82-and-stitch-20-released.html' title='STRING 8.2 and STITCH 2.0 released'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-5961922572691469462</id><published>2009-09-01T17:20:00.003+02:00</published><updated>2009-09-01T17:24:10.552+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>API for STRING 8.0 still accessible</title><content type='html'>After the release of STRING 8.1, the previous version, 8.0, can still be accessed: &lt;a href="http://string80.embl.de"&gt;http://string80.embl.de&lt;/a&gt;. Under this URL, the API continues to work, e.g.:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://string80.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN"&gt;http://string80.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This pattern will continue for upcoming releases (you could also use "string81" as host name now to stay on this version even if we update to a new version).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-5961922572691469462?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/5961922572691469462/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/09/api-for-string-80-still-accessible.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5961922572691469462'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5961922572691469462'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/09/api-for-string-80-still-accessible.html' title='API for STRING 8.0 still accessible'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-5059414188852837770</id><published>2009-07-23T15:39:00.003+02:00</published><updated>2009-07-23T15:46:46.344+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><title type='text'>Quick poll for next STITCH version concerning chemical names</title><content type='html'>I'm conducting a &lt;a href="http://friendfeed.com/the-life-scientists/ea894896/quick-poll-for-next-stitch-version-concerning"&gt;small poll over at FriendFeed&lt;/a&gt; about abbreviations of chemical names. If you have a opinion about which abbreviation is better in the following cases, please leave your feedback (either on FriendFeed or in the comments here):&lt;br /&gt;&lt;ul&gt;&lt;li&gt;10-formyltetrahydrofolate: "10-formyltetra." or "10-formyl-THF"?&lt;/li&gt;&lt;li&gt;acetyl-L-carnitine: "acetyl-L-carni." or "ALCAR"?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;dihydrotestosterone: "androstanolone" or "dihydrotestost."?&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-5059414188852837770?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/5059414188852837770/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/07/quick-poll-for-next-stitch-version.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5059414188852837770'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5059414188852837770'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/07/quick-poll-for-next-stitch-version.html' title='Quick poll for next STITCH version concerning chemical names'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-4502180363031279497</id><published>2009-07-21T14:00:00.001+02:00</published><updated>2009-07-21T14:33:14.816+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>STRING Cytoscape plugin</title><content type='html'>Users of Cytoscape can now natively retrieve interaction networks from STRING !&lt;br /&gt;&lt;br /&gt;During a recent &lt;a href="http://www.psidev.info/index.php?q=esf-2008"&gt;workshop&lt;/a&gt; at the &lt;a href="http://www.ebi.ac.uk/"&gt;EBI&lt;/a&gt;, a common web service API to query interaction databases (called &lt;a href="http://hackathon.dbcls.jp/wiki/PSICQUIC"&gt;PSICQUIC&lt;/a&gt;) was finalized. Once all interaction databases have implemented this interface, it will be possible to use a single client (a Cytoscape plugin for example) to interact with all of them. We are committed to this initiative, and look forward to the implementations.&lt;br /&gt;&lt;br /&gt;In the interim, we have decided to also release a small, custom-made &lt;a href="http://cytoscape.org/plugins/index.php"&gt;plugin&lt;/a&gt; for &lt;a href="http://www.cytoscape.org/"&gt;Cytoscape&lt;/a&gt; called &lt;span style="font-style: italic;"&gt;StringWSClient&lt;/span&gt;, which interacts only with the &lt;a href="http://string-db.org/"&gt;STRING database&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;This allows us to offer users the full range of features that the &lt;a href="http://string-stitch.blogspot.com/2008/02/we-have-api.html"&gt;STRING API&lt;/a&gt; allows (e.g. to show all available species, or to resolve ambiguous inputs). Version 1.0 (1.1) supports only the import of interaction networks; upcoming versions will be able to extend existing networks, filter them using STRING specific criteria, etc. The 1.0 version works only with Cytoscape 2.6.1, and 1.1 was released to support the whole 2.6.x branch.&lt;br /&gt;&lt;br /&gt;To install it, fire up Cytoscape, open &lt;span style="font-style: italic;"&gt;Plugins/M&lt;/span&gt;&lt;span style="font-style: italic;"&gt;anage Plugins&lt;/span&gt; dialog and pick &lt;span style="font-weight: bold;"&gt;StringWSClient v1.0&lt;/span&gt; from "&lt;span style="font-style: italic;"&gt;Network and Attribute I/O&lt;/span&gt;" section. You may have to restart Cytoscape to load the plugin. See Cytoscape &lt;a href="http://cytoscape.org/manual/Cytoscape2_6Manual.html#Plugins%20and%20the%20Plugin%20Manager"&gt;documentation&lt;/a&gt; for details.&lt;br /&gt;&lt;br /&gt;Once you have the plugin installed, open &lt;span style="font-style: italic;"&gt;File/Import/Network from web services...&lt;/span&gt; dialog and&lt;br /&gt;pick the String plugin.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_Jzv9awBIdbU/Sa0xIJxaeXI/AAAAAAAAAD4/BDFNqE0mKPw/s1600-h/img1.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 262px;" src="http://2.bp.blogspot.com/_Jzv9awBIdbU/Sa0xIJxaeXI/AAAAAAAAAD4/BDFNqE0mKPw/s320/img1.png" alt="" id="BLOGGER_PHOTO_ID_5308953551871572338" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;The plugin resembles STRING's web user interface: a field to type queries and the organisms selector. In the background, the query is sent to the STRING database and the resulting interaction network is fetched and displayed.&lt;br /&gt;&lt;br /&gt;We're looking forward to your feedback !&lt;br /&gt;Milan Simonovic and the STRING team.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-4502180363031279497?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/4502180363031279497/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/03/string-cytoscape-plugin.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4502180363031279497'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4502180363031279497'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/03/string-cytoscape-plugin.html' title='STRING Cytoscape plugin'/><author><name>Milan Simonovic</name><uri>http://www.blogger.com/profile/02468380693061457881</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_Jzv9awBIdbU/Sa0xIJxaeXI/AAAAAAAAAD4/BDFNqE0mKPw/s72-c/img1.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-6373468272925959641</id><published>2009-07-01T09:46:00.004+02:00</published><updated>2009-07-01T10:00:25.505+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>100 API accesses per minute: not a good idea</title><content type='html'>STRING was unreachable for some time this morning while it was busy processing ~10,000 API requests in about 1.5 hours. 100 API accesses per minute is &lt;span style="font-weight: bold;"&gt;not&lt;/span&gt; a good idea, as this overloads our server, making STRING (and STITCH) unavailable for everyone. Don't run scripts that access the API in parallel.&lt;br /&gt;&lt;br /&gt;If you want to do large-scale analysis on STRING data, you can make your and our life much easier by signing an academic license agreement and downloading the full dataset.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-6373468272925959641?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/6373468272925959641/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/07/100-api-accesses-per-minute-not-good.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/6373468272925959641'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/6373468272925959641'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/07/100-api-accesses-per-minute-not-good.html' title='100 API accesses per minute: not a good idea'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-7922280525660563379</id><published>2009-06-24T09:38:00.013+02:00</published><updated>2009-06-24T12:24:30.398+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>New Release of STRING: Version 8.1</title><content type='html'>&lt;span style="font-family:arial;"&gt;We are happy to announce that STRING 8.1 has just been released. We have updated the interaction data, fixed a number of bugs and greatly improved the web interface.  As always, we keep older versions around to guarantee reproducibility of earlier work (see &lt;a href="http://string.embl.de/server_versions.html"&gt;here&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;The interactive network viewer has been re-implemented (it's now based on Adobe Flash version 10), and it gives users the new opportunity to "play around" with the network while keeping the same look and feel as previously.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt; The proteins in the network can now also be clustered, 'live', via two different methods: &lt;a href="http://bonsai.ims.u-tokyo.ac.jp/%7Emdehoon/software/cluster/software.htm"&gt;k-means&lt;/a&gt; &lt;/span&gt;&lt;span style="font-family:arial;"&gt;and &lt;a href="http://www.micans.org/mcl/"&gt;markov chain clustering&lt;/a&gt;&lt;/span&gt;&lt;span style="font-family:arial;"&gt;.  The topology can be relaxed after clustering, or in real time by turning on "relaxation" and "cooling".  The graph layouting is done by force-directed placement (see &lt;a href="http://www.cs.ubc.ca/local/reading/proceedings/spe91-95/spe/vol21/issue11/spe060tf.pdf"&gt;here&lt;/a&gt;),&lt;/span&gt;&lt;span style="font-family:arial;"&gt; and the cooling is done by gradually lowering the relaxation variable hat determines the strength of node movements, to zero.&lt;/span&gt; &lt;span style="font-family:arial;"&gt;Of course, this is merely a first version of the interactive viewer and future versions may well have additional features (in particular in case our users have specific requests).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt;We have also extended the protein structure previews. These are now not only based on PDB, but we also incorporate homology models from the &lt;a href="http://swissmodel.expasy.org/repository/"&gt;SWISS-MODEL Repository&lt;/a&gt;. &lt;/span&gt;&lt;span style="font-family:arial;"&gt;And, structure previews are finally shown in the proper context of the protein's domain architecture.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt;Lastly, we fixed several '&lt;a href="http://string-stitch.blogspot.com/2009/02/known-issues-in-string-version-80.html"&gt;known issues&lt;/a&gt;' (i.e., bugs)&lt;/span&gt;&lt;span style="font-family:arial;"&gt; - such as missing minor chromosomes for some species, and viewer problems with the textmining predictions.&lt;/span&gt;&lt;span style="font-family:arial;"&gt;&lt;br /&gt;&lt;br /&gt;To start playing with the new release, visit &lt;a href="http://string.embl.de/version_8_1"&gt;STRING 8.1&lt;/a&gt;  -  and p&lt;/span&gt;&lt;span style="font-family:arial;"&gt;lease don't hesitate to send us any feedback, criticism or suggestions.&lt;/span&gt;..&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt; Have Fun !&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt;&lt;br /&gt;The STRING team&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-7922280525660563379?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/7922280525660563379/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/06/new-release-of-string-81.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/7922280525660563379'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/7922280525660563379'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/06/new-release-of-string-81.html' title='New Release of STRING: Version 8.1'/><author><name>Christian von Mering</name><uri>http://www.blogger.com/profile/14734418426791670530</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-5182441739175821040</id><published>2009-02-19T18:01:00.011+01:00</published><updated>2009-03-03T11:17:21.853+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Known Issues in STRING version 8.0</title><content type='html'>For each STRING version so far, only when we released it to the users did we find the last remaining bugs. Users often email us with their problems, and sometimes we are indeed to blame because there is an error. This is good (we think), because each bug found is a bug fixed - albeit only in the next release, usually.&lt;br /&gt;&lt;br /&gt;So far, this is what we have found in release 8.0:&lt;br /&gt;&lt;br /&gt;a) Some of our text-mining links do not show up in the corresponding evidence viewer. They are still correct, but the underlying text cannot be recovered and shown, for technical reasons. This happens because we developed a new feature that recognizes generic 'family' names for gene groups (like 'WNTs' for the various, homologous Wnt proteins). Within reasonable limits, such ambiguous names are now expanded to the individual protein members. However, we forgot to  update the code of the text-viewer to reflect this ... we will do so in the next version.&lt;br /&gt;&lt;br /&gt;b) Unfortunately, some of the prokaryotic genomes in this release are incomplete - in 43 cases we're missing a second (or third) minor chromosome. This was caused by a misunderstanding when parsing files from the RefSeq database: RefSeq provides an overview file that only lists one chromosome for each prokaryote, and we mistook that file for the full listing. Again, this will be fixed in the next release of STRING (on which we are already working). Obviously, we're now writing a new entry in our test suite that will prevent this type of error in the future - we will be  checking the final gene counts of all organisms for consistency and also compare these counts against an external reference. Below is a list of affected organisms in the current release; if you're working with any of these, we recommend you continue using version 7.1 of STRING for now.&lt;br /&gt;&lt;br /&gt;Luckily, no major model organisms are affected !!&lt;br /&gt;&lt;br /&gt;Agrobacterium tumefaciens str. C58&lt;br /&gt;Brucella abortus biovar 1 str. 9-941&lt;br /&gt;Brucella melitensis 16M&lt;br /&gt;Brucella melitensis biovar Abortus 2308&lt;br /&gt;Brucella ovis ATCC 25840&lt;br /&gt;Brucella suis 1330&lt;br /&gt;Burkholderia ambifaria AMMD&lt;br /&gt;Burkholderia cenocepacia AU 1054&lt;br /&gt;Burkholderia cenocepacia HI2424&lt;br /&gt;Burkholderia mallei ATCC 23344&lt;br /&gt;Burkholderia mallei NCTC 10229&lt;br /&gt;Burkholderia mallei NCTC 10247&lt;br /&gt;Burkholderia mallei SAVP1&lt;br /&gt;Burkholderia pseudomallei 1106a&lt;br /&gt;Burkholderia pseudomallei 1710b&lt;br /&gt;Burkholderia pseudomallei 668&lt;br /&gt;Burkholderia pseudomallei K96243&lt;br /&gt;Burkholderia sp. 383&lt;br /&gt;Burkholderia thailandensis E264&lt;br /&gt;Burkholderia vietnamiensis G4&lt;br /&gt;Burkholderia xenovorans LB400&lt;br /&gt;Deinococcus radiodurans R1&lt;br /&gt;Haloarcula marismortui ATCC 43049&lt;br /&gt;Leptospira borgpetersenii serovar Hardjo-bovis JB197&lt;br /&gt;Leptospira borgpetersenii serovar Hardjo-bovis L550&lt;br /&gt;Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130&lt;br /&gt;Leptospira interrogans serovar Lai str. 56601&lt;br /&gt;Ochrobactrum anthropi ATCC 49188&lt;br /&gt;Paracoccus denitrificans PD1222&lt;br /&gt;Photobacterium profundum SS9&lt;br /&gt;Pseudoalteromonas haloplanktis TAC125&lt;br /&gt;Ralstonia eutropha H16&lt;br /&gt;Ralstonia eutropha JMP134&lt;br /&gt;Ralstonia metallidurans CH34&lt;br /&gt;Rhodobacter sphaeroides 2.4.1&lt;br /&gt;Rhodobacter sphaeroides ATCC 17029&lt;br /&gt;Vibrio cholerae O1 biovar eltor str. N16961&lt;br /&gt;Vibrio cholerae O395&lt;br /&gt;Vibrio fischeri ES114&lt;br /&gt;Vibrio harveyi ATCC BAA-1116&lt;br /&gt;Vibrio parahaemolyticus RIMD 2210633&lt;br /&gt;Vibrio vulnificus CMCP6&lt;br /&gt;Vibrio vulnificus YJ016&lt;br /&gt;&lt;br /&gt;That's it for known issues so far. But, do keep those emails coming - the feedback is very valuable !!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-5182441739175821040?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/5182441739175821040/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/02/known-issues-in-string-version-80.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5182441739175821040'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5182441739175821040'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/02/known-issues-in-string-version-80.html' title='Known Issues in STRING version 8.0'/><author><name>Christian von Mering</name><uri>http://www.blogger.com/profile/14734418426791670530</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-4115346898763955084</id><published>2009-01-29T11:30:00.001+01:00</published><updated>2009-01-29T13:17:42.243+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='documentation'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Homology correction of co-occurrence and text-mining scores (updated)</title><content type='html'>&lt;span&gt;&lt;span&gt;[&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;Update 29/01/2009: &lt;/span&gt;The homology correction is also applied to the text-mining channel starting with STRING&amp;nbsp;8.&lt;/span&gt;&lt;span&gt;]&lt;/span&gt;&lt;span style="font-style: italic;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;In order to avoid that gene duplications lead spurious functional associations, homologous proteins are down-weighed in the co-occurrence and text-mining channels. You will notice this on the score summary page of a link and if you have our SQL dumps.&lt;br /&gt;&lt;br /&gt;Here's an example: The co-occurrence view looks fine for this pair of proteins.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp3.blogger.com/_xxDT5IJAZWo/SG3whfQzjiI/AAAAAAAAABk/XICzk5Oj6lU/s1600-h/Picture+12.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp3.blogger.com/_xxDT5IJAZWo/SG3whfQzjiI/AAAAAAAAABk/XICzk5Oj6lU/s320/Picture+12.png" alt="" id="BLOGGER_PHOTO_ID_5219092001310608930" border="0" /&gt;&lt;/a&gt;However, the total score of 0.204 is less than the co-occurrence score:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp1.blogger.com/_xxDT5IJAZWo/SG3whQGhm5I/AAAAAAAAABc/kXHW9sK8Lic/s1600-h/Picture+11.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp1.blogger.com/_xxDT5IJAZWo/SG3whQGhm5I/AAAAAAAAABc/kXHW9sK8Lic/s320/Picture+11.png" alt="" id="BLOGGER_PHOTO_ID_5219091997240957842" border="0" /&gt;&lt;/a&gt;The reason for this is that the proteins  have some sequence similarity and are therefore down-weighted according to this formula:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt; effective co-occurrence score = co-occurrence score * (1 - homology score)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;(The homology score is calculated from the bit score of the alignment.) In this case:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;0.204 = 0.478 * ( 1 - 0.572 )&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span style="font-style: italic;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-4115346898763955084?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/4115346898763955084/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/07/homology-and-co-occurrence.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4115346898763955084'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4115346898763955084'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/07/homology-and-co-occurrence.html' title='Homology correction of co-occurrence and text-mining scores (updated)'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://bp3.blogger.com/_xxDT5IJAZWo/SG3whfQzjiI/AAAAAAAAABk/XICzk5Oj6lU/s72-c/Picture+12.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-4819887896403014330</id><published>2009-01-15T17:30:00.000+01:00</published><updated>2009-01-15T17:34:58.273+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Linking to individual networks (updated)</title><content type='html'>If you want to link to STRING or STITCH from your website, you can use the following URLs for simple queries:&lt;br /&gt;&lt;br /&gt;For STITCH, you can use names of chemicals:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/interactions/aspirin?species=9606"&gt;http://stitch.embl.de/interactions/aspirin?species=9606&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;You can also use identifiers, e.g. SwissProt or ATC codes:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/interactions/DRD1_HUMAN"&gt;http://string.embl.de/interactions/DRD1_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/interactions/A01AD05?species=9606"&gt;http://stitch.embl.de/interactions/A01AD05?species=9606&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;(The 9606 specifies that you want human interactions, see &lt;a href="http://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=9606"&gt;NCBI taxonomy&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Update&lt;/span&gt;: You can also link to networks with multiple items. As STRING saves the user's preference for proteins/COG mode, it's better to specify the target mode.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/interactionsList/zgc:73075%0Dzgc:136854?targetmode=proteins"&gt;http://string.embl.de/interactionsList/zgc:73075%0Dzgc:136854?targetmode=proteins&lt;/a&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/interactionsList/zgc:73075%0Dzgc:136854?targetmode=cogs"&gt;http://string.embl.de/interactionsList/zgc:73075%0Dzgc:136854?targetmode=cogs&lt;/a&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/interactionsList/KOG0044%0DKOG3656?targetmode=cogs"&gt;http://string.embl.de/interactionsList/KOG0044%0DKOG3656?targetmode=cogs&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/interactionsList/DRD1_HUMAN%0Dpergolide?species=9606"&gt;http://stitch.embl.de/interactionsList/DRD1_HUMAN%0Dpergolide?species=9606&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;You construct the URL by concatenating the protein names with "%0D" or "%0A" (an encoded carriage return / newline character).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-4819887896403014330?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/4819887896403014330/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/05/linking-to-individual-networks.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4819887896403014330'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4819887896403014330'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/05/linking-to-individual-networks.html' title='Linking to individual networks (updated)'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-5681649837598847347</id><published>2009-01-14T09:44:00.009+01:00</published><updated>2009-03-03T11:17:21.853+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>New Year - New Major Release</title><content type='html'>Looks like 2009 will bring a lot of changes for both STRING and STITCH - and to lead the way, STRING has now been upgraded to version 8.0 !&lt;br /&gt;&lt;br /&gt;This has been a major upgrade, and it has been some time in the making. We have almost doubled the number of organisms (again), and re-imported all the various pathways, protein-complexes and text-collections. We've also worked a lot behind the scenes, solidifying the API, further automating our data import and updating the way we display orthologous groups, to name just a few examples. All this has been possible only, really, because of our new sponsor - the Swiss Institute of Bioinformatics (&lt;a href="http://www.isb-sib.ch/"&gt;SIB&lt;/a&gt;). Thanks guys !&lt;br /&gt;&lt;br /&gt;More info about this new release is also available from &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18940858"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-5681649837598847347?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/5681649837598847347/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2009/01/new-year-new-major-release.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5681649837598847347'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5681649837598847347'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2009/01/new-year-new-major-release.html' title='New Year - New Major Release'/><author><name>Christian von Mering</name><uri>http://www.blogger.com/profile/14734418426791670530</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-6229022052268809899</id><published>2008-07-21T20:03:00.003+02:00</published><updated>2008-07-21T20:10:13.594+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>High-resolution images</title><content type='html'>We recently implemented a way to export high-res images (300 dpi, click the image below to see an example). This feature will go public with STRING 8 / STITCH 2, but if you're now using STRING or STITCH and want to prepare an image for publication, please get in touch with us (mkuhn embl de) and we can send you the image.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://bp2.blogger.com/_xxDT5IJAZWo/SITQBiACT7I/AAAAAAAAABs/uA0mzKmvjI4/s1600-h/Picture+10.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://bp2.blogger.com/_xxDT5IJAZWo/SITQBiACT7I/AAAAAAAAABs/uA0mzKmvjI4/s400/Picture+10.png" alt="" id="BLOGGER_PHOTO_ID_5225530192382021554" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-6229022052268809899?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/6229022052268809899/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/07/high-resolution-images.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/6229022052268809899'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/6229022052268809899'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/07/high-resolution-images.html' title='High-resolution images'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://bp2.blogger.com/_xxDT5IJAZWo/SITQBiACT7I/AAAAAAAAABs/uA0mzKmvjI4/s72-c/Picture+10.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-4718276036721055662</id><published>2008-06-26T17:23:00.007+02:00</published><updated>2008-06-27T09:02:11.678+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='documentation'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'></title><content type='html'>&lt;span style="font-size:130%;"&gt;&lt;span style="font-weight: bold;font-family:arial;" &gt;How we compute scores (Part 1: experiment channel)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt; This is in response to a a question that we get quite frequently.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt; Sorry, it's a bit long - but this way it should contain sufficient&lt;/span&gt;&lt;span style="font-family:arial;"&gt; detail to roughly understand how our scores come about (for&lt;/span&gt;&lt;span style="font-family:arial;"&gt; the 'experiments channel' at least, and limited to protein mode). Have fun reading !&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt; Christian von Mering (and Lars Jensen).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family:arial;"&gt; &lt;span style="font-style: italic;"&gt;Procedure to compute experimental scores&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; first, we import information about which proteins have been shown to&lt;/span&gt;&lt;span style="font-family:arial;"&gt; interact experimentally, from the following databases: INTACT, MINT,&lt;/span&gt;&lt;span style="font-family:arial;"&gt; GRID, BIND, and DIP. To a small extent, this also includes&lt;/span&gt;&lt;span style="font-family:arial;"&gt; experimental data that is not necessarily indicative of a direct&lt;/span&gt;&lt;span style="font-family:arial;"&gt; physical interaction, such as genetic interaction data. Most, however,&lt;/span&gt;&lt;span style="font-family:arial;"&gt; are from more-or-less direct, physical detection methods.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; then, we map the proteins mentioned in these database onto the&lt;/span&gt;&lt;span style="font-family:arial;"&gt; proteins in the STRING database - using identifiers, or (if needed)&lt;/span&gt;&lt;span style="font-family:arial;"&gt; sequences.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; next, we group all interactions by their supporting publication&lt;/span&gt;&lt;span style="font-family:arial;"&gt; (PMID), and make them non-redundant (they might be reported under the &lt;/span&gt;&lt;span style="font-family:arial;"&gt;same PMID from several databases). We also expand pulldowns of entire &lt;/span&gt;&lt;span style="font-family:arial;"&gt;protein complexes using the 'spoke' model (i.e. assuming binary &lt;/span&gt;&lt;span style="font-family:arial;"&gt;interactions from the tagged/immunoprecipitated protein to all of its&lt;/span&gt;&lt;span style="font-family:arial;"&gt; co-purified partners).&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; then, we subdivide all interactions into 'small-scale, medium-scale,&lt;/span&gt;&lt;span style="font-family:arial;"&gt; and high-throughput', based on the number of interactions reported by &lt;/span&gt;&lt;span style="font-family:arial;"&gt;a single publication. These three classes are delineated by the extent &lt;/span&gt;&lt;span style="font-family:arial;"&gt;of overlap&lt;/span&gt;&lt;span style="font-family:arial;"&gt; with benchmark information, see below.&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt;next, for each of these classes, we determine their 'reliability', &lt;/span&gt;&lt;span style="font-family:arial;"&gt;by comparing them to our KEGG-benchmark. Briefly, an interaction &lt;/span&gt;&lt;span style="font-family:arial;"&gt;between two proteins is counted as 'correct', when they are both &lt;/span&gt;&lt;span style="font-family:arial;"&gt;annotated together in at least one 'KEGG-map', i.e. in at least one&lt;/span&gt; &lt;span style="font-family:arial;"&gt;functional process / pathway. It is counted as 'incorrect', when the &lt;/span&gt;&lt;span style="font-family:arial;"&gt;two proteins are annotated in KEGG, but never in the same pathway. &lt;/span&gt;&lt;span style="font-family:arial;"&gt;Note that proteins that are not annotated at all in KEGG are not &lt;/span&gt;&lt;span style="font-family:arial;"&gt;considered here).&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; for the small-scale experiments, which are only very few (per paper), &lt;/span&gt;&lt;span style="font-family:arial;"&gt;we cannot benchmark each paper separately. Therefor, all such papers &lt;/span&gt;&lt;span style="font-family:arial;"&gt;are lumped, benchmarked together, and we usually find them to be of &lt;/span&gt;&lt;span style="font-family:arial;"&gt;quite high quality. As a result, we fix their score to some high &lt;/span&gt;&lt;span style="font-family:arial;"&gt;number, for example 0.900 in the case of STRING version 7.1&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; for the medium-scale experiments, a separate score is computed for &lt;/span&gt;&lt;span style="font-family:arial;"&gt;each publication, in a similar manner (some publications are found to &lt;/span&gt;&lt;span style="font-family:arial;"&gt;report data of better reliability, other of lower reliability).&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; for the high-throughput experiments (there are less than 20 of these &lt;/span&gt;&lt;span style="font-family:arial;"&gt;currently), we have enough information to be even a bit more specific: &lt;/span&gt;&lt;span style="font-family:arial;"&gt;for each interaction in these sets, we can compute a 'raw score' from &lt;/span&gt;&lt;span style="font-family:arial;"&gt;the data, because there are so many measurements done. Usually, this &lt;/span&gt;&lt;span style="font-family:arial;"&gt;would be a score that describes how often a measurement has been&lt;/span&gt; &lt;span style="font-family:arial;"&gt;confirmed, or how specific a particular interaction is, given the &lt;/span&gt;&lt;span style="font-family:arial;"&gt;occurence of the two protein elsewhere throughout the dataset. These &lt;/span&gt;&lt;span style="font-family:arial;"&gt;'raw scores' are then binned, and each bin benchmarked separately, to &lt;/span&gt;&lt;span style="font-family:arial;"&gt;arrive at a 'calibration curve', again using the KEGG pathways as a &lt;/span&gt;&lt;span style="font-family:arial;"&gt;benchmark as described above. Thus, for these large sets, some&lt;/span&gt; &lt;span style="font-family:arial;"&gt;interactions get a higher score, and others a lower score, depending &lt;/span&gt;&lt;span style="font-family:arial;"&gt;on the information in the entire dataset.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; this brings us to the cutoffs that determine whether something is &lt;/span&gt;&lt;span style="font-family:arial;"&gt;small-scale, medium or high-throughput. This is defined on how &lt;/span&gt;&lt;span style="font-family:arial;"&gt;many 'true-positives' are in the dataset: To be a large-scale dataset, &lt;/span&gt;&lt;span style="font-family:arial;"&gt;we require at least 50 true positive interactions to enable the&lt;/span&gt; &lt;span style="font-family:arial;"&gt;benchmarking. Otherwise, more than 20 true positive interactions &lt;/span&gt;&lt;span style="font-family:arial;"&gt;will make it a medium-scale dataset, and the rest is small-scale. &lt;/span&gt;&lt;span style="font-family:arial;"&gt;("true positives" are defined as interactions where both proteins &lt;/span&gt;&lt;span style="font-family:arial;"&gt;are in KEGG, and are sharing at least one KEGG map).&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt; &lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; then, we have to deal with interactions supported by more than one &lt;/span&gt;&lt;span style="font-family:arial;"&gt;independent dataset (i.e. by more than one publication). For those, &lt;/span&gt;&lt;span style="font-family:arial;"&gt;the scores are 'added up'. Of course, they are not literally added up, &lt;/span&gt;&lt;span style="font-family:arial;"&gt;but rather in a probabilistic integration, like so:&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-family:arial;"&gt;new_score = 1 -&lt;/span&gt;&lt;span style="font-family:arial;"&gt; (1 - score_a) * (1 - score_b).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-family:arial;"&gt; and finally, whe have to deal with interactions that are reported in &lt;/span&gt;&lt;span style="font-family:arial;"&gt;multiple organisms, or in an organism other than the one of interest. &lt;/span&gt;&lt;span style="font-family:arial;"&gt;This is called 'interaction transfer', and is a very important step to &lt;/span&gt;&lt;span style="font-family:arial;"&gt;increase coverage. It is described in the 2005 STRING paper.&lt;/span&gt; &lt;span style="font-family:arial;"&gt;Essentially, the better the orthology situation can be delineated &lt;/span&gt;&lt;span style="font-family:arial;"&gt;(i.e. clear orthologs for both interacting partners can be &lt;/span&gt;&lt;span style="font-family:arial;"&gt;identified), the bigger the score fraction that is transferred.&lt;/span&gt;&lt;span style="font-family:arial;"&gt; Transferred interactions are integrated probabilistically as mentioned &lt;/span&gt;&lt;span style="font-family:arial;"&gt;above, and interactions that are reported in two very similary &lt;/span&gt;&lt;span style="font-family:arial;"&gt;organisms (say, mouse and rat), are considered redundant and &lt;/span&gt;&lt;span style="font-family:arial;"&gt;transferred only once. Note that transferred scores are stored&lt;/span&gt; &lt;span style="font-family:arial;"&gt;separately from the 'direct' scores in the database, so that all the &lt;/span&gt;&lt;span style="font-family:arial;"&gt;transferred information can be discarded, if desired.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-4718276036721055662?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/4718276036721055662/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/06/how-we-compute-scores-part-1-experiment.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4718276036721055662'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4718276036721055662'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/06/how-we-compute-scores-part-1-experiment.html' title=''/><author><name>Christian von Mering</name><uri>http://www.blogger.com/profile/14734418426791670530</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-8500281601654833157</id><published>2008-06-17T11:41:00.004+02:00</published><updated>2008-06-18T12:55:08.000+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='outages'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Downtime Wednesday morning</title><content type='html'>The STRING server will get a new disk tomorrow morning (European time), so there will be a downtime for STRING/STITCH. We hope everything will be working again in the early afternoon.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Update:&lt;/span&gt; We're back online, with enough room for the next version of STRING.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-8500281601654833157?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/8500281601654833157/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/06/downtime-wednesday-morning.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/8500281601654833157'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/8500281601654833157'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/06/downtime-wednesday-morning.html' title='Downtime Wednesday morning'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-9089335928436923548</id><published>2008-05-19T11:19:00.002+02:00</published><updated>2008-05-19T11:23:04.040+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>API also available on STRING</title><content type='html'>When I created the &lt;a href="/2008/02/we-have-api.html"&gt;API&lt;/a&gt;, I only put it on STITCH. Of course there's no reason to not have it also on STRING, so here you go:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN"&gt;http://string.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;It's in the same state as the STITCH API: Still subject to change, and potentially unstable.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-9089335928436923548?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/9089335928436923548/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/05/api-also-available-on-string.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/9089335928436923548'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/9089335928436923548'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/05/api-also-available-on-string.html' title='API also available on STRING'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-1634146803449492620</id><published>2008-04-28T11:54:00.003+02:00</published><updated>2008-04-28T11:59:12.927+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='api'/><title type='text'>Getting identifiers for a list of genes</title><content type='html'>If you want to to quickly get identifiers for a long list of items you can use the following command, which uses wget to repeatedly query the API.&lt;br /&gt;&lt;pre&gt;&lt;code&gt;cat protein_names.txt | xargs -i wget -nv -O - \&lt;br /&gt;'http://stitch.embl.de/api/tsv-no-header/resolve?identifier={}&amp;amp;species=4932&amp;amp;echo_query=1' \&lt;br /&gt;&gt; protein_identifiers.tsv&lt;/code&gt;&lt;/pre&gt;I've also introduced another parameter, echo_query, so that you can see your query item in the output.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-1634146803449492620?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/1634146803449492620/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/04/getting-identifiers-for-list-of-genes.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1634146803449492620'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1634146803449492620'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/04/getting-identifiers-for-list-of-genes.html' title='Getting identifiers for a list of genes'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-1033763809943460105</id><published>2008-04-23T14:22:00.003+02:00</published><updated>2008-04-24T17:22:51.756+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='outages'/><title type='text'>No Downtime on Saturday, April 26</title><content type='html'>&lt;strike&gt;There'll be an EMBL-wide power cut on Saturday, April 26. Therefore, our servers won't be reachable at this time. We hope that the computer infrastructure will be re-activated by Monday.&lt;/strike&gt;&lt;br /&gt;&lt;br /&gt;Sorry, the plans were changed and not all of EMBL is affected, so we should stay online.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-1033763809943460105?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/1033763809943460105/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/04/downtime-on-saturday-april-26.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1033763809943460105'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1033763809943460105'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/04/downtime-on-saturday-april-26.html' title='No Downtime on Saturday, April 26'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-5008895457292657406</id><published>2008-03-04T16:49:00.006+01:00</published><updated>2008-03-04T17:35:04.858+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Embedding protein/chemical information into other web pages</title><content type='html'>One of the nice things about STRING/STITCH is that you can click on any item and be presented with a helpful pop-up describing the item in question. :-)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_xxDT5IJAZWo/R81xnx1tO6I/AAAAAAAAAAM/NCuS1RBc6-g/s1600-h/Picture+1.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://4.bp.blogspot.com/_xxDT5IJAZWo/R81xnx1tO6I/AAAAAAAAAAM/NCuS1RBc6-g/s320/Picture+1.png" alt="" id="BLOGGER_PHOTO_ID_5173916475126004642" border="0" /&gt;&lt;/a&gt;As you can see, for a protein, we show links to different servers, the domain structure and if possible a representative PDB structure. For a chemical we currently  only show the structure and link to &lt;a href="http://pubchem.ncbi.nlm.nih.gov/"&gt;PubChem&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;In order to use the same pop-up in &lt;a href="http://reflect.embl.de/"&gt;Reflect&lt;/a&gt; (a cool tool that recognizes and annotates proteins and chemical any web page), we made a service out of it. The functionality is going to change in the future, but here are two working examples:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/services/iteminfo?node=9606.ENSP00000186982"&gt;http://stitch.embl.de/services/iteminfo?node=9606.ENSP00000186982&lt;/a&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/services/iteminfo?node=CID2244"&gt;http://stitch.embl.de/services/iteminfo?node=CID2244&lt;/a&gt;&lt;span style="display: block;" id="formatbar_Buttons"&gt;&lt;span class="on" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;The idea is that you can take this raw HTML and put it into an iframe with JavaScript. (This is left as an exercise to the reader... future versions of STRING may use this technique to make the pages a bit smaller.) For now, it only accepts internal STRING identifiers, which you either get from our &lt;a href="http://string.embl.de/newstring_cgi/show_download_page.pl"&gt;download files&lt;/a&gt; or from the &lt;a href="http://www.blogger.com/2008/02/we-have-api.html"&gt;API&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-5008895457292657406?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/5008895457292657406/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/03/embedding-proteinchemical-information.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5008895457292657406'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/5008895457292657406'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/03/embedding-proteinchemical-information.html' title='Embedding protein/chemical information into other web pages'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_xxDT5IJAZWo/R81xnx1tO6I/AAAAAAAAAAM/NCuS1RBc6-g/s72-c/Picture+1.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-4313215554604421988</id><published>2008-03-03T16:40:00.003+01:00</published><updated>2008-03-03T17:47:27.528+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='outages'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Servers may be unreachable today (March 3)</title><content type='html'>You might have trouble today accessing our servers today because of construction in the server rooms. We apologize and hope that everything will be done soon!&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Update: We're back online.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-4313215554604421988?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/4313215554604421988/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/03/servers-may-be-unreachable-today-march.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4313215554604421988'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/4313215554604421988'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/03/servers-may-be-unreachable-today-march.html' title='Servers may be unreachable today (March 3)'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-8812946804345017896</id><published>2008-03-03T15:05:00.003+01:00</published><updated>2008-03-03T15:19:32.762+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Scope of the API and current plans</title><content type='html'>When I &lt;a href="/2008/02/we-have-api.html"&gt;announced the API&lt;/a&gt;, I didn't devote much space to the intended scope of the API. To make things clearer:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;REST/SOAP: We'll only provide a REST API plus a Soaplab2 wrapper for Taverna. Perhaps later dedicated programmers can add a SOAP interface if the demand is sufficiently high.&lt;/li&gt;&lt;li&gt;Queries for bulk data: For implementation and licensing reasons, we'll only provide methods to query by individual items, just like on the web site. If you need access to bulk data, you can &lt;a href="http://string.embl.de/newstring_cgi/show_download_page.pl"&gt;download&lt;/a&gt; it.&lt;/li&gt;&lt;li&gt;Miscellaneous records:  We want to add more query options  later for retrieving information from the freely available files. For example: What are the synonyms of this item? To which orthologous group does this protein belong?&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-8812946804345017896?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/8812946804345017896/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/03/scope-of-api-and-current-plans.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/8812946804345017896'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/8812946804345017896'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/03/scope-of-api-and-current-plans.html' title='Scope of the API and current plans'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-1598883112383764868</id><published>2008-02-22T14:26:00.003+01:00</published><updated>2008-02-28T18:20:48.063+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>Example Taverna workflow</title><content type='html'>&lt;a href="http://taverna.sourceforge.net/"&gt;Taverna&lt;/a&gt; is a tool that lets you connect web services from different sources with each other. I've implemented a &lt;a href="http://stitch.embl.de/blog_dl/stitch_network_diseases.xml"&gt;simple example workflow (.xml)&lt;/a&gt;: You enter a human protein or chemical of interest, STITCH will identify the matching item(s), generate an interaction network and retrieve the 10 most relevant abstracts. The list of Pubmed ids is then passed on to &lt;a href="http://www.ebi.ac.uk/webservices/whatizit/"&gt;Whatizit&lt;/a&gt;, which highlights disease terms in the abstracts. A simple Python script then counts the diseases.&lt;br /&gt;&lt;br /&gt;Try it out; try other things; tell me what you think.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://stitch.embl.de/blog_dl/stitch_network_diseases.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px;" src="http://stitch.embl.de/blog_dl/stitch_network_diseases.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;span style="font-size:85%;"&gt;Note: The &lt;a href="http://stitch.embl.de/blog_dl/whatizit_count.py"&gt;Python script&lt;/a&gt; is called via the &lt;a href="http://stitch.embl.de/soaplab2/"&gt;Soaplab interface&lt;/a&gt;. I tried to do this in Taverna but gave up, it didn't like my XPath (due to the XML produced by Whatizit) and writing a Beanshell script seemed more cumbersome.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-1598883112383764868?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/1598883112383764868/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/02/example-taverna-workflow.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1598883112383764868'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1598883112383764868'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/02/example-taverna-workflow.html' title='Example Taverna workflow'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-7843563551796578985</id><published>2008-02-19T17:00:00.016+01:00</published><updated>2009-04-24T11:51:23.930+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='api'/><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><category scheme='http://www.blogger.com/atom/ns#' term='string'/><title type='text'>We have an API!</title><content type='html'>&lt;span style="font-size:100%;"&gt;I went to the &lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;a href="http://hackathon.dbcls.jp/"&gt;BioHackathon 2008&lt;/a&gt; in Tokyo and worked on an API for STRING and STITCH. If you think about using STRING or STITCH with an API, and miss features, please get in touch with us either via the comments or e-mail (e.g. mkuhn//embl.de).&lt;br /&gt;&lt;br /&gt;Here's what we have to offer so far:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;REST interface&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;The URL patterns are: http://stitch.embl.de/api/[format]&lt;format&gt;/[request]&lt;request&gt;?[parameters]&lt;parameters&gt;&lt;br /&gt;http://string.embl.de/api/[format]&lt;format&gt;/[request]&lt;request&gt;?[parameters]&lt;parameters&gt;&lt;br /&gt;&lt;br /&gt;Possible formats:&lt;br /&gt;&lt;/parameters&gt;&lt;/request&gt;&lt;/format&gt;&lt;/parameters&gt;&lt;/request&gt;&lt;/format&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;tsv: tab-separated values, with a header line&lt;br /&gt;&lt;/li&gt;&lt;li&gt;tsv-no-header: as above, but no header&lt;/li&gt;&lt;li&gt;json: JSON format either as a list of hashes/dictionaries, or as a plain list (if there is only one value to be returned per record)&lt;/li&gt;&lt;li&gt;psi-mi: the interaction network is available in &lt;a href="http://www.psidev.info/index.php?q=node/60"&gt;PSI-MI 2.5&lt;/a&gt; XML format&lt;/li&gt;&lt;li&gt;psi-mi-tab: there is also a tab-delimited form, modeled after the &lt;a href="ftp://ftp.ebi.ac.uk/pub/databases/intact/current/psimitab/README"&gt;IntAct specification&lt;/a&gt;. This is easier to parse, but contains less information than the XML format.&lt;/li&gt;&lt;li&gt;url: &lt;a href="http://string-stitch.blogspot.com/2008/07/api-to-generate-network-images.html"&gt;return the URL of the network image&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:100%;"&gt;Possible requests:&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;ul&gt;&lt;li&gt;abstracts: return a list of abstracts that contain the query item&lt;br /&gt;&lt;/li&gt;&lt;li&gt;abstractsList: return a list of abstracts that contain any of the query items&lt;/li&gt;&lt;li&gt;interactions: return an interaction network in &lt;a href="http://www.psidev.info/index.php?q=node/60"&gt;PSI-MI 2.5&lt;/a&gt; format (PSI-MI is currently the only format for interactions. Perhaps the PSI-MI tab-delimited form would also make sense? I don't know how a JSON form should look like.)&lt;/li&gt;&lt;li&gt;interactionsList: same as above, but for list of identifiers&lt;br /&gt;&lt;/li&gt;&lt;li&gt;interactors: return a list of interaction partners for the query item&lt;/li&gt;&lt;li&gt;interactorsList: return a list of interaction partners for any of the query item&lt;/li&gt;&lt;li&gt;resolve: return the list of items that match (in name or identifier) the query item&lt;/li&gt;&lt;li&gt;network / networkList: in conjunction with the "url" format, &lt;a href="http://string-stitch.blogspot.com/2008/07/api-to-generate-network-images.html"&gt;return the URL to the network&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:100%;"&gt;&lt;strike&gt;For a full list of possible parameters, please refer to our &lt;a href="http://stitch.embl.de/soaplab2/"&gt;STRING Soaplab 2 interface&lt;/a&gt;. With the help of &lt;a href="http://soaplab.sourceforge.net/soaplab2/"&gt;Soaplab2 / Gowlab&lt;/a&gt;, we'll describe the set of possible parameters there. &lt;/strike&gt; (Doesn't work right now. :-/ )&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;Examples&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;To find out which proteins match the description "dopamine receptor" in human, you can use this query:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/api/tsv/resolve?identifier=dopamine%20receptor&amp;amp;species=9606"&gt;http://stitch.embl.de/api/tsv/resolve?identifier=dopamine%20receptor&amp;amp;species=9606&lt;/a&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/api/tsv/resolve?identifier=dopamine%20receptor&amp;amp;species=9606"&gt;http://string.embl.de/api/tsv/resolve?identifier=dopamine%20receptor&amp;amp;species=9606&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This gives you a lot of additional info. If you just want to get the list of STRING identifiers, you can alter the query a bit:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/api/tsv-no-header/resolve?identifier=dopamine%20receptor&amp;amp;species=9606&amp;amp;format=only-ids"&gt;http://stitch.embl.de/api/tsv-no-header/resolve?identifier=dopamine%20receptor&amp;amp;species=9606&amp;amp;format=only-ids&lt;/a&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/api/tsv-no-header/resolve?identifier=dopamine%20receptor&amp;amp;species=9606&amp;amp;format=only-ids"&gt;http://string.embl.de/api/tsv-no-header/resolve?identifier=dopamine%20receptor&amp;amp;species=9606&amp;amp;format=only-ids&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Now, you'll only receive a bare list of ids that you could pipe into other STRING API functions.&lt;br /&gt;&lt;br /&gt;To illustrate the difference between normal and "list" queries:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN"&gt;http://stitch.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;a href="http://stitch.embl.de/api/tsv/interactorsList?identifiers=DRD1_HUMAN%0DDRD2_HUMAN"&gt;http://stitch.embl.de/api/tsv/interactorsList?identifiers=DRD1_HUMAN%0DDRD2_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN"&gt;http://string.embl.de/api/tsv/interactors?identifier=DRD1_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;a href="http://string.embl.de/api/tsv/interactorsList?identifiers=DRD1_HUMAN%0DDRD2_HUMAN"&gt;http://string.embl.de/api/tsv/interactorsList?identifiers=DRD1_HUMAN%0DDRD2_HUMAN&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;span&gt;In the second case, the identifier&lt;span style="font-weight: bold;"&gt;s&lt;/span&gt; parameter contains a list of items separated by new line characters (%0A or %0D).&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;SOAP / Taverna&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;span&gt;&lt;a href="http://www.blogger.com/2008/02/example-taverna-workflow.html"&gt;In a separate post&lt;/a&gt;, I've described an example &lt;a href="http://taverna.sourceforge.net/"&gt;Taverna&lt;/a&gt; workflow. As for SOAP integration, I hope that the Soaplab interface works...&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;Obligatory beta notice&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;As all good things these days, this is still in beta (internally, everything in fact runs on our beta server, I'm just making it accessible via the normal STITCH domain to expose it to the web). Therefore, the API might change, be down, ... until STITCH 2 / STRING 8 comes out.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Updates&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;03.03.2008: Added clarification – PSI-MI is currently the only interactions format.&lt;br /&gt;04.03.2008: Fixed typo – it's "&lt;/span&gt;interactorsList"&lt;br /&gt;12.03.2008: Add psi-mi-tab format&lt;br /&gt;19.05.2008: Add STRING API (with same specification)&lt;br /&gt;08.07.2008: Add &lt;a href="http://string-stitch.blogspot.com/2008/07/api-to-generate-network-images.html"&gt;API for generating network images&lt;/a&gt;&lt;br /&gt;16.03.2009: Enabled interactionsList&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-7843563551796578985?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/7843563551796578985/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/02/we-have-api.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/7843563551796578985'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/7843563551796578985'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/02/we-have-api.html' title='We have an API!'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8549809657959855223.post-1788822435667274775</id><published>2008-02-06T14:11:00.000+01:00</published><updated>2008-02-28T18:20:41.565+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='stitch'/><category scheme='http://www.blogger.com/atom/ns#' term='announcements'/><title type='text'>Mea culpa: missing links from PDB</title><content type='html'>I intended to extract protein–chemical links from the PDB (and we wrote this in the &lt;a href="http://nar.oxfordjournals.org/cgi/content/full/36/suppl_1/D684"&gt;paper&lt;/a&gt;), but somehow I didn't quite finish the import scripts before we finalized STITCH 1.0. I am sorry about this, and apologize if you are missing interactions.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We are currently preparing the next versions of STRING and STITCH (versions 8 and 2, respectively) and I will import the &lt;a href="http://remediation.wwpdb.org/"&gt;remediated PDB&lt;/a&gt; for the new version. I guess STITCH 2 should come out in Spring 2008.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;(Special thanks to Florian Raible for testing his pet molecule and discovering what was missing.)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8549809657959855223-1788822435667274775?l=string-stitch.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://string-stitch.blogspot.com/feeds/1788822435667274775/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://string-stitch.blogspot.com/2008/02/mea-culpa-missing-links-from-pdb.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1788822435667274775'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8549809657959855223/posts/default/1788822435667274775'/><link rel='alternate' type='text/html' href='http://string-stitch.blogspot.com/2008/02/mea-culpa-missing-links-from-pdb.html' title='Mea culpa: missing links from PDB'/><author><name>Michael Kuhn</name><uri>https://profiles.google.com/116824032690427957728</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh5.googleusercontent.com/-f9TdFiMLaUM/AAAAAAAAAAI/AAAAAAAAAKA/5gB2YoCPGl8/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry></feed>
