<?xml version="1.0"?><?xml-stylesheet type="text/xsl" href="/rss.xsl"?><rss version="2.0"><channel><title>UCSDBioLit Wiki &amp; Documentation Rss Feed</title><link>http://www.codeplex.com/UCSDBioLit/Wiki/View.aspx?title=Home</link><description>UCSDBioLit Wiki Rss Description</description><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/wikipage?version=28</link><description>&lt;div class="wikidoc"&gt;&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt;&lt;br /&gt;Read the latest press release &lt;a href="http://www.microsoft.com/presspass/press/2009/mar09/03-11MSCreativeCommonsPR.mspx"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft Research Connections&amp;#39; goal with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt;&lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (from the National Center for Biomedical Ontology) and identifiers from major biological databases, and to integrate manuscript content with existing public data repositories.&lt;br /&gt;&lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt;&lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt;&lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=Ontologies%20and%20Controlled%20Vocabularies&amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/"&gt;NCBO&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org"&gt;Protein Data Bank&lt;/a&gt;, &lt;a href="http://www.uniprot.org"&gt;UniProtKB&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov"&gt;NCBI GenBank/RefSeq&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;
&lt;li&gt;Custom Semantic Markup&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Trying out the add-in:
&lt;ul&gt;&lt;li&gt;You will need Microsoft Word 2007 or Microsoft Word 2010 (32-bit or 64-bit) running on Windows XP, Windows Vista or Windows 7.&lt;/li&gt;
&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;i&gt;If you have installed a previous version of the add-in, you may need to follow &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;referringTitle=Home"&gt;these instructions&lt;/a&gt; to achieve a full uninstall.&lt;/i&gt;
&lt;ul&gt;&lt;li&gt;Examining the source code and contributing to the project:
&lt;ul&gt;&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;
&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;
&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=designdocs&amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;astrocyte&amp;quot; being tagged with the Cell Line ontology.&lt;br /&gt;&lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;named-content&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
 &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;content-type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;ncbo_id=40962;term_id=CL:0000127;term=astrocyte;url=http://purl.org/obo/owl/CL#CL_0000127&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramStart&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt; &lt;span style="color:Red;"&gt;w:rsidRPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;00FA60F6&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
     &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;highlight&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;yellow&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;astrocyte&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramEnd&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;&lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>AlexWade</author><pubDate>Mon, 25 Feb 2013 23:00:59 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20130225110059P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/wikipage?version=27</link><description>&lt;div class="wikidoc"&gt;&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt;&lt;br /&gt;Read the latest press release &lt;a href="http://www.microsoft.com/presspass/press/2009/mar09/03-11MSCreativeCommonsPR.mspx" class="externalLink"&gt;here&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt;&lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (from the National Center for Biomedical Ontology) and identifiers from major biological databases, and to integrate manuscript content with existing public data repositories.&lt;br /&gt;&lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt;&lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt;&lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=Ontologies%20and%20Controlled%20Vocabularies&amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;
&lt;li&gt;Custom Semantic Markup&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Trying out the add-in:
&lt;ul&gt;&lt;li&gt;You will need Microsoft Word 2007 or Microsoft Word 2010 (32-bit or 64-bit) running on Windows XP, Windows Vista or Windows 7.&lt;/li&gt;
&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;i&gt;If you have installed a previous version of the add-in, you may need to follow &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;referringTitle=Home"&gt;these instructions&lt;/a&gt; to achieve a full uninstall.&lt;/i&gt;
&lt;ul&gt;&lt;li&gt;Examining the source code and contributing to the project:
&lt;ul&gt;&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;
&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;
&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=designdocs&amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;astrocyte&amp;quot; being tagged with the Cell Line ontology.&lt;br /&gt;&lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;named-content&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
 &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;content-type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;ncbo_id=40962;term_id=CL:0000127;term=astrocyte;url=http://purl.org/obo/owl/CL#CL_0000127&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramStart&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt; &lt;span style="color:Red;"&gt;w:rsidRPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;00FA60F6&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
     &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;highlight&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;yellow&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;astrocyte&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramEnd&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;&lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>erteam</author><pubDate>Fri, 07 May 2010 23:24:37 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20100507112437P</guid></item><item><title>Updated Wiki: uninstallhelp</title><link>http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;version=4</link><description>&lt;div class="wikidoc"&gt;
&lt;h1&gt;Full Uninstall of Earlier Add-in Versions&lt;/h1&gt;
If you have previously installed and uninstalled earlier versions, there may be files that were not fully uninstalled. The following directions will explain how to remove any old files.&lt;br /&gt; 
&lt;h2&gt;Uninstall the Ontology Add-in for Word 2007&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Open the Control Panel&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Programs group&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Uninstall a Program&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Ontology Add-in for Word 2007 and click Uninstall&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Remove Any Remaining Files&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Start Word&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Word Options from the main menu&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Add-ins tab&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97264" alt="image004.jpg" title="image004.jpg" /&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;In the lower panel, you’ll see a section for Inactive Application Add-ins. &lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Uninstall any remaining XML Schema&lt;/li&gt;&lt;/ul&gt;
 &lt;br /&gt;There may be several items with the type XML Schema listed that are part of the Ontology Add-in. These need to be manually removed as follows:&lt;br /&gt; &lt;br /&gt;At the bottom of the panel is a drop down box labeled Manage.&lt;br /&gt;&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97265" alt="image003.png" title="image003.png" /&gt;&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Change the selection in the Manage box to XML Schema and click the Go… button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Templates and Add-ins dialog should appear. You’ll notice that the XML schema you need to remove should be on this list.&lt;br /&gt;&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97266" alt="image005.png" title="image005.png" /&gt;&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Click the Schema Library… button to remove them&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Schema Library dialog will appear.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Select each schema you want to remove and click the Delete Schema button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97267" alt="image006.png" title="image006.png" /&gt;&lt;br /&gt;&lt;br /&gt;Repeat the previous step until you have deleted all Ontology Add-in schemas.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Click OK to close the Schema Library dialog the click OK to close the Templates and Add-ins dialog&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Exit Word&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Install the Latest Version&lt;/h2&gt;
&lt;ul&gt;&lt;li&gt;Restart your computer (just to be safe)&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Reinstall the latest Ontology Add-in for Word 2007&lt;/li&gt;&lt;/ul&gt;
 &lt;br /&gt;&lt;i&gt;Contributed by Mike Galos&lt;/i&gt;&lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>jlfink</author><pubDate>Wed, 16 Dec 2009 13:09:08 GMT</pubDate><guid isPermaLink="false">Updated Wiki: uninstallhelp 20091216010908P</guid></item><item><title>Updated Wiki: uninstallhelp</title><link>http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;version=3</link><description>&lt;div class="wikidoc"&gt;
&lt;h1&gt;Full Uninstall of Earlier Add-in Versions&lt;/h1&gt;
If you have previously installed and uninstalled earlier versions, there may be files that were not fully uninstalled. The following directions will explain how to remove any old files.&lt;br /&gt; 
&lt;h2&gt;Uninstall the Ontology Add-in for Word 2007&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Open the Control Panel&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Programs group&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Uninstall a Program&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Ontology Add-in for Word 2007 and click Uninstall&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Remove Any Remaining Files&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Start Word&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Word Options from the main menu&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Add-ins tab&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97264" alt="image004.jpg" title="image004.jpg" /&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;In the lower panel, you’ll see a section for Inactive Application Add-ins. &lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Uninstall any remaining XML Schema&lt;/li&gt;&lt;/ul&gt;
 &lt;br /&gt;There may be several items with the type XML Schema listed that are part of the Ontology Add-in. These need to be manually removed as follows:&lt;br /&gt; &lt;br /&gt;At the bottom of the panel is a drop down box labeled Manage.&lt;br /&gt;&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97265" alt="image003.png" title="image003.png" /&gt;&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Change the selection in the Manage box to XML Schema and click the Go… button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Templates and Add-ins dialog should appear. You’ll notice that the XML schema you need to remove should be on this list.&lt;br /&gt;&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97266" alt="image005.png" title="image005.png" /&gt;&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Click the Schema Library… button to remove them&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Schema Library dialog will appear.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Select each schema you want to remove and click the Delete Schema button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97267" alt="image006.png" title="image006.png" /&gt;&lt;br /&gt;&lt;br /&gt;Repeat the previous step until you have deleted all Ontology Add-in schemas.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Click OK to close the Schema Library dialog the click OK to close the Templates and Add-ins dialog&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Exit Word&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Install the Latest Version&lt;/h2&gt;
&lt;ul&gt;&lt;li&gt;Restart your computer (just to be safe)&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Reinstall the latest Ontology Add-in for Word 2007&lt;/li&gt;&lt;/ul&gt;
 &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>jlfink</author><pubDate>Wed, 16 Dec 2009 13:08:22 GMT</pubDate><guid isPermaLink="false">Updated Wiki: uninstallhelp 20091216010822P</guid></item><item><title>Updated Wiki: uninstallhelp</title><link>http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;version=2</link><description>&lt;div class="wikidoc"&gt;
&lt;h1&gt;Full Uninstall of Earlier Add-in Versions&lt;/h1&gt;
If you have previously installed and uninstalled earlier versions, there may be files that were not fully uninstalled. The following directions will explain how to remove any old files.&lt;br /&gt; 
&lt;h2&gt;Uninstall the Ontology Add-in for Word 2007&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Open the Control Panel&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Programs group&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Uninstall a Program&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Ontology Add-in for Word 2007 and click Uninstall&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Remove Any Remaining Files&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Start Word&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Word Options from the main menu&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Add-ins tab&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;img src="http://i3.codeplex.com/Project/Download/FileDownload.aspx?ProjectName=UCSDBioLit&amp;DownloadId=97264" alt="image004.jpg" title="image004.jpg" /&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;In the lower panel, you’ll see a section for Inactive Application Add-ins. &lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Uninstall any remaining XML Schema&lt;/li&gt;&lt;/ul&gt;
 &lt;br /&gt;There may be several items with the type XML Schema listed that are part of the Ontology Add-in. These need to be manually removed as follows:&lt;br /&gt; &lt;br /&gt;At the bottom of the panel is a drop down box labeled Manage.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=image003.png&amp;referringTitle=uninstallhelp"&gt;image&lt;/a&gt;&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Change the selection in the Manage box to XML Schema and click the Go… button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Templates and Add-ins dialog should appear. You’ll notice that the XML schema you need to remove should be on this list.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=image005.png&amp;referringTitle=uninstallhelp"&gt;image&lt;/a&gt;&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Click the Schema Library… button to remove them&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Schema Library dialog will appear.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Select each schema you want to remove and click the Delete Schema button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=image006.png&amp;referringTitle=uninstallhelp"&gt;image&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Repeat the previous step until you have deleted all Ontology Add-in schemas.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Click OK to close the Schema Library dialog the click OK to close the Templates and Add-ins dialog&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Exit Word&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Install the Latest Version&lt;/h2&gt;
&lt;ul&gt;&lt;li&gt;Restart your computer (just to be safe)&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Reinstall the latest Ontology Add-in for Word 2007&lt;/li&gt;&lt;/ul&gt;
 &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>jlfink</author><pubDate>Wed, 16 Dec 2009 13:07:45 GMT</pubDate><guid isPermaLink="false">Updated Wiki: uninstallhelp 20091216010745P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/wikipage?version=26</link><description>&lt;div class="wikidoc"&gt;&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt;&lt;br /&gt;Read the latest press release &lt;a href="http://www.microsoft.com/presspass/press/2009/mar09/03-11MSCreativeCommonsPR.mspx" class="externalLink"&gt;here&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt;&lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (from the National Center for Biomedical Ontology) and identifiers from major biological databases, and to integrate manuscript content with existing public data repositories.&lt;br /&gt;&lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt;&lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt;&lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=Ontologies%20and%20Controlled%20Vocabularies&amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;
&lt;li&gt;Custom Semantic Markup&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Trying out the add-in:
&lt;ul&gt;&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;
&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;i&gt;If you have installed a previous version of the add-in, you may need to follow &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;referringTitle=Home"&gt;these instructions&lt;/a&gt; to achieve a full uninstall.&lt;/i&gt;
&lt;ul&gt;&lt;li&gt;Examining the source code and contributing to the project:
&lt;ul&gt;&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;
&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;
&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=designdocs&amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;astrocyte&amp;quot; being tagged with the Cell Line ontology.&lt;br /&gt;&lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;named-content&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
 &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;content-type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;ncbo_id=40962;term_id=CL:0000127;term=astrocyte;url=http://purl.org/obo/owl/CL#CL_0000127&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramStart&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt; &lt;span style="color:Red;"&gt;w:rsidRPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;00FA60F6&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
     &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;highlight&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;yellow&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;astrocyte&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramEnd&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;&lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>jlfink</author><pubDate>Wed, 16 Dec 2009 13:00:19 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20091216010019P</guid></item><item><title>Updated Wiki: uninstallhelp</title><link>http://ucsdbiolit.codeplex.com/wikipage?title=uninstallhelp&amp;version=1</link><description>&lt;div class="wikidoc"&gt;
&lt;h1&gt;Full Uninstall of Earlier Add-in Versions&lt;/h1&gt;
If you have previously installed and uninstalled earlier versions, there may be files that were not fully uninstalled. The following directions will explain how to remove any old files.&lt;br /&gt; 
&lt;h2&gt;Uninstall the Ontology Add-in for Word 2007&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Open the Control Panel&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Programs group&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Uninstall a Program&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Ontology Add-in for Word 2007 and click Uninstall&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Remove Any Remaining Files&lt;/h2&gt; 
&lt;ul&gt;&lt;li&gt;Start Word&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select Word Options from the main menu&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Select the Add-ins tab&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;In the lower panel, you’ll see a section for Inactive Application Add-ins. &lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Uninstall any remaining XML Schema&lt;/li&gt;&lt;/ul&gt;
 &lt;br /&gt;There may be several items with the type XML Schema listed that are part of the Ontology Add-in. These need to be manually removed as follows:&lt;br /&gt; &lt;br /&gt;At the bottom of the panel is a drop down box labeled Manage.&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Change the selection in the Manage box to XML Schema and click the Go… button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Templates and Add-ins dialog should appear. You’ll notice that the XML schema you need to remove should be on this list.&lt;br /&gt; 
&lt;ul&gt;&lt;li&gt;Click the Schema Library… button to remove them&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;The Schema Library dialog will appear.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Select each schema you want to remove and click the Delete Schema button&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;Repeat the previous step until you have deleted all Ontology Add-in schemas.&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Click OK to close the Schema Library dialog the click OK to close the Templates and Add-ins dialog&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Exit Word&lt;/li&gt;&lt;/ul&gt;

&lt;h2&gt;Install the Latest Version&lt;/h2&gt;
&lt;ul&gt;&lt;li&gt;Restart your computer (just to be safe)&lt;/li&gt;&lt;/ul&gt;

&lt;ul&gt;&lt;li&gt;Reinstall the latest Ontology Add-in for Word 2007&lt;/li&gt;&lt;/ul&gt;
 &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>jlfink</author><pubDate>Wed, 16 Dec 2009 13:00:10 GMT</pubDate><guid isPermaLink="false">Updated Wiki: uninstallhelp 20091216010010P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/wikipage?version=25</link><description>&lt;div class="wikidoc"&gt;&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt;&lt;br /&gt;Read the latest press release &lt;a href="http://www.microsoft.com/presspass/press/2009/mar09/03-11MSCreativeCommonsPR.mspx" class="externalLink"&gt;here&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt;&lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (from the National Center for Biomedical Ontology) and identifiers from major biological databases, and to integrate manuscript content with existing public data repositories.&lt;br /&gt;&lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt;&lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt;&lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=Ontologies%20and%20Controlled%20Vocabularies&amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;
&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;
&lt;li&gt;Custom Semantic Markup&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;
&lt;ul&gt;&lt;li&gt;Trying out the add-in:
&lt;ul&gt;&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;
&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;
&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;
&lt;li&gt;Examining the source code and contributing to the project:
&lt;ul&gt;&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;
&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;
&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/wikipage?title=designdocs&amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;astrocyte&amp;quot; being tagged with the Cell Line ontology.&lt;br /&gt;&lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;named-content&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
 &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;content-type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;ncbo_id=40962;term_id=CL:0000127;term=astrocyte;url=http://purl.org/obo/owl/CL#CL_0000127&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramStart&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
   &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt; &lt;span style="color:Red;"&gt;w:rsidRPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;00FA60F6&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
     &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;highlight&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;yellow&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;rPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;astrocyte&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
  &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;proofErr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:type&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;gramEnd&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt;&lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;/div&gt;&lt;div class="ClearBoth"&gt;&lt;/div&gt;</description><author>jlfink</author><pubDate>Wed, 16 Dec 2009 12:22:34 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20091216122234P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=24</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;Read the latest press release &lt;a href="http://www.microsoft.com/presspass/press/2009/mar09/03-11MSCreativeCommonsPR.mspx" class="externalLink"&gt;here&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (from the National Center for Biomedical Ontology) and identifiers from major biological databases, and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Ontologies%20and%20Controlled%20Vocabularies&amp;amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt;
&lt;/div&gt;</description><author>jlfink</author><pubDate>Tue, 17 Mar 2009 07:42:59 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090317074259A</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=23</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (from the National Center for Biomedical Ontology) and identifiers from major biological databases, and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Ontologies%20and%20Controlled%20Vocabularies&amp;amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt;
&lt;/div&gt;</description><author>jlfink</author><pubDate>Fri, 13 Mar 2009 18:45:41 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090313064541P</guid></item><item><title>Updated Wiki: Ontologies and Controlled Vocabularies</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Ontologies and Controlled Vocabularies&amp;version=1</link><description>&lt;div class="wikidoc"&gt;
&lt;h1&gt;
NCBO Ontology Availability
&lt;/h1&gt; &lt;br /&gt;The add-in currently downloads the following NCBO ontologies:&lt;br /&gt; &lt;br /&gt; &lt;br /&gt;Amphibian gross anatomy &lt;br /&gt; &lt;br /&gt;Biological imaging methods&lt;br /&gt; &lt;br /&gt;Biological process&lt;br /&gt; &lt;br /&gt;BRENDA tissue / enzyme source&lt;br /&gt; &lt;br /&gt;C. elegans development&lt;br /&gt; &lt;br /&gt;C. elegans phenotype&lt;br /&gt; &lt;br /&gt;Cell type&lt;br /&gt; &lt;br /&gt;Cellular component&lt;br /&gt; &lt;br /&gt;Cereal plant trait&lt;br /&gt; &lt;br /&gt;Dictyostelium discoideum anatomy&lt;br /&gt; &lt;br /&gt;Drosophila development&lt;br /&gt; &lt;br /&gt;Drosophila gross anatomy&lt;br /&gt; &lt;br /&gt;Environment Ontology&lt;br /&gt; &lt;br /&gt;Event (INOH pathway ontology)&lt;br /&gt; &lt;br /&gt;Fungal gross anatomy&lt;br /&gt; &lt;br /&gt;Human developmental anatomy, abstract version&lt;br /&gt; &lt;br /&gt;Human developmental anatomy, timed version&lt;br /&gt; &lt;br /&gt;Human disease&lt;br /&gt; &lt;br /&gt;Infectious disease&lt;br /&gt; &lt;br /&gt;Mammalian phenotype&lt;br /&gt; &lt;br /&gt;Mass spectrometry&lt;br /&gt; &lt;br /&gt;Molecular function&lt;br /&gt; &lt;br /&gt;Mosquito gross anatomy&lt;br /&gt; &lt;br /&gt;Mouse adult gross anatomy&lt;br /&gt; &lt;br /&gt;Mouse pathology&lt;br /&gt; &lt;br /&gt;Pathogen transmission&lt;br /&gt; &lt;br /&gt;Pathway ontology&lt;br /&gt; &lt;br /&gt;Physico-chemical methods and properties&lt;br /&gt; &lt;br /&gt;Physico-chemical process&lt;br /&gt; &lt;br /&gt;Plant environmental conditions&lt;br /&gt; &lt;br /&gt;Plant growth and developmental stage&lt;br /&gt; &lt;br /&gt;Plant structure&lt;br /&gt; &lt;br /&gt;Protein-protein interaction&lt;br /&gt; &lt;br /&gt;Spider Ontology&lt;br /&gt; &lt;br /&gt;Teleost anatomy and development&lt;br /&gt; &lt;br /&gt;Tick gross anatomy&lt;br /&gt; &lt;br /&gt;Xenopus anatomy and development&lt;br /&gt; &lt;br /&gt;Zebrafish anatomy and development&lt;br /&gt;
&lt;/div&gt;</description><author>jlfink</author><pubDate>Fri, 13 Mar 2009 03:02:50 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Ontologies and Controlled Vocabularies 20090313030250A</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=22</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of &lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Ontologies%20and%20Controlled%20Vocabularies&amp;amp;referringTitle=Home"&gt;Ontologies and Controlled Vocabularies&lt;/a&gt; maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt;
&lt;/div&gt;</description><author>jlfink</author><pubDate>Fri, 13 Mar 2009 02:59:14 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090313025914A</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=21</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt;
&lt;/div&gt;</description><author>jlfink</author><pubDate>Fri, 13 Mar 2009 02:21:18 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090313022118A</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=20</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies maintained and delivered by &lt;a href="http://bioportal.bioontology.org/" class="externalLink"&gt;NCBO&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Biological Databases (&lt;a href="http://http:www.rcsb.org" class="externalLink"&gt;Protein Data Bank&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://www.uniprot.org" class="externalLink"&gt;UniProtKB&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;, &lt;a href="http://ncbi.nlm.nih.gov" class="externalLink"&gt;NCBI GenBank/RefSeq&lt;span class="externalLinkIcon"&gt;&lt;/span&gt;&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt;
&lt;/div&gt;</description><author>jlfink</author><pubDate>Fri, 13 Mar 2009 02:20:52 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090313022052A</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=19</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Open the test document from the Releases tab in this page and enable Term Recognition in the Ontology tab within Word&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt;
&lt;/div&gt;</description><author>pablofe</author><pubDate>Tue, 10 Mar 2009 21:24:06 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090310092406P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=18</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Check that the SmartTags feature in Word is enabled:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Click the Microsoft Office Button (circle on the top left)&lt;/li&gt;&lt;li&gt;Click the Word Options button&lt;/li&gt;&lt;li&gt;In the categories pane click Add-ins&lt;/li&gt;&lt;li&gt;In the Manage box, towards the bottom of the dialog, select Smart Tags, and then click Go&lt;/li&gt;&lt;li&gt;Select the Label data with smart tags check box&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Open the test document from the Releases tab and follow the instructions in the page&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Semantic Tagging&lt;/b&gt;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) is to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats, which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt;
&lt;/div&gt;</description><author>pablofe</author><pubDate>Tue, 10 Mar 2009 21:21:04 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090310092104P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=17</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Check that the SmartTags feature in Word is enabled:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Click the Microsoft Office Button (circle on the top left)&lt;/li&gt;&lt;li&gt;Click the Word Options button&lt;/li&gt;&lt;li&gt;In the categories pane click Add-ins&lt;/li&gt;&lt;li&gt;In the Manage box, towards the bottom of the dialog, select Smart Tags, and then click Go&lt;/li&gt;&lt;li&gt;Select the Label data with smart tags check box&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Open the test document from the Releases tab and follow the instructions in the page&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&amp;quot;Semantic Tagging&amp;quot;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&lt;div style="color:Black;background-color:White;"&gt;&lt;pre&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://biolit.ucsd.edu/biolitschema&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;biolit-term&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;id&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;DOID:4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;type&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;status&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;true&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;OntName&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;Human disease&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;attr&lt;/span&gt; &lt;span style="color:Red;"&gt;w:name&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;url&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:val&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;http://purl.org/obo/owl/DOID#DOID_4&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Blue;"&gt;/&amp;gt;&lt;/span&gt; 
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXmlPr&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt; &lt;span style="color:Red;"&gt;w:uri&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;BioLitTags&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt; &lt;span style="color:Red;"&gt;w:element&lt;/span&gt;&lt;span style="color:Blue;"&gt;=&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;tag1&lt;/span&gt;&lt;span style="color:Black;"&gt;&amp;quot;&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
        &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
            &lt;span style="color:Blue;"&gt;&amp;lt;&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;disease&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;t&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt; 
        &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;r&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
    &lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;smartTag&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;span style="color:Blue;"&gt;&amp;lt;/&lt;/span&gt;&lt;span style="color:#A31515;"&gt;w&lt;/span&gt;&lt;span style="color:Blue;"&gt;:&lt;/span&gt;&lt;span style="color:#A31515;"&gt;customXml&lt;/span&gt;&lt;span style="color:Blue;"&gt;&amp;gt;&lt;/span&gt;
&lt;/pre&gt;&lt;/div&gt; &lt;br /&gt;If the Word file (docx) needs to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt;
&lt;/div&gt;</description><author>pablofe</author><pubDate>Tue, 10 Mar 2009 21:19:18 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090310091918P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=16</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Check that the SmartTags feature in Word is enabled:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Click the Microsoft Office Button (circle on the top left)&lt;/li&gt;&lt;li&gt;Click the Word Options button&lt;/li&gt;&lt;li&gt;In the categories pane click Add-ins&lt;/li&gt;&lt;li&gt;In the Manage box, towards the bottom of the dialog, select Smart Tags, and then click Go&lt;/li&gt;&lt;li&gt;Select the Label data with smart tags check box&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Open the test document from the Releases tab and follow the instructions in the page&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&amp;quot;Semantic Tagging&amp;quot;&lt;br /&gt; &lt;br /&gt;When a word or set of words is tagged by the add-in, the word is wrapped with some tags that associate it with the ontology term.  The example below shows the word &amp;quot;disease&amp;quot; being tagged with Human Disease ontology.&lt;br /&gt; &lt;br /&gt;&amp;lt;w:customXml w:uri=&amp;quot;http://biolit.ucsd.edu/biolitschema&amp;quot; w:element=&amp;quot;biolit-term&amp;quot;&amp;gt;&lt;br /&gt;    &amp;lt;w:customXmlPr&amp;gt;&lt;br /&gt;        &amp;lt;w:attr w:name=&amp;quot;id&amp;quot; w:val=&amp;quot;DOID:4&amp;quot; /&amp;gt; &lt;br /&gt;        &amp;lt;w:attr w:name=&amp;quot;type&amp;quot; w:val=&amp;quot;Human disease&amp;quot; /&amp;gt; &lt;br /&gt;        &amp;lt;w:attr w:name=&amp;quot;status&amp;quot; w:val=&amp;quot;true&amp;quot; /&amp;gt; &lt;br /&gt;        &amp;lt;w:attr w:name=&amp;quot;OntName&amp;quot; w:val=&amp;quot;Human disease&amp;quot; /&amp;gt; &lt;br /&gt;        &amp;lt;w:attr w:name=&amp;quot;url&amp;quot; w:val=&amp;quot;http://purl.org/obo/owl/DOID#DOID_4&amp;quot; /&amp;gt; &lt;br /&gt;    &amp;lt;/w:customXmlPr&amp;gt;&lt;br /&gt;    &amp;lt;w:smartTag w:uri=&amp;quot;BioLitTags&amp;quot; w:element=&amp;quot;tag1&amp;quot;&amp;gt;&lt;br /&gt;        &amp;lt;w:r&amp;gt;&lt;br /&gt;            &amp;lt;w:t&amp;gt;disease&amp;lt;/w:t&amp;gt; &lt;br /&gt;        &amp;lt;/w:r&amp;gt;&lt;br /&gt;    &amp;lt;/w:smartTag&amp;gt;&lt;br /&gt;&amp;lt;/w:customXml&amp;gt;&lt;br /&gt; &lt;br /&gt;If the Word file (docx) needs to be transformed to other formats, this set of tags would need to be processed using xslt or other technologies.  Note that there are other CodePlex projects available which implement transformations of docx files to other formats which one can start from.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt;
&lt;/div&gt;</description><author>pablofe</author><pubDate>Tue, 10 Mar 2009 20:57:21 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090310085721P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=15</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Check that the SmartTags feature in Word is enabled:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Click the Microsoft Office Button (circle on the top left)&lt;/li&gt;&lt;li&gt;Click the Word Options button&lt;/li&gt;&lt;li&gt;In the categories pane click Add-ins&lt;/li&gt;&lt;li&gt;In the Manage box, towards the bottom of the dialog, select Smart Tags, and then click Go&lt;/li&gt;&lt;li&gt;Select the Label data with smart tags check box&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Open the test document from the Releases tab and follow the instructions in the page&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt;
&lt;/div&gt;</description><author>pablofe</author><pubDate>Tue, 10 Mar 2009 17:02:38 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090310050238P</guid></item><item><title>Updated Wiki: Home</title><link>http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=Home&amp;version=14</link><description>&lt;div class="wikidoc"&gt;
&lt;b&gt;Project Description&lt;/b&gt;&lt;br /&gt;A Word 2007 add-in that enables the annotation of Word documents based on terms that appear in Ontologies&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br /&gt;Microsoft External Research’s goal  with this project is to enable communities who maintain ontologies to more easily experiment and to enhance the experience of authors who use Microsoft Word for content creation, incorporating semantic knowledge into the content.  This add-in should simplify the development and validation of ontologies, by making ontologies more accessible to a wide audience of authors and by enabling semantic content to be integrated in the authoring experience, capturing the author’s intent and knowledge at the source, and facilitating downstream discoverability. &lt;br /&gt; &lt;br /&gt;The goal of the add-in is to assist scientists in writing a manuscript that is easily integrated with existing and pending electronic resources. The major aims of this project are to add semantic information as XML mark-up to the manuscript using ontologies and controlled vocabularies (using OBO), and to integrate manuscript content with existing public data repositories.&lt;br /&gt; &lt;br /&gt;As part of the publishing workflow and archiving process, the terms added by the add-in, providing the semantic information, can be extracted from Word files, as they are stored as custom XML tags as part of the content.  The semantic knowledge can then be preserved as the documented is converted to other formats, such as HTML or the XML format from the National Library of Medicine, which is commonly used for archiving.&lt;br /&gt; &lt;br /&gt;The full benefit of semantic-rich content will result from an end-to-end approach to the preservation of semantics and metadata through the publishing pipeline, starting with  capturing knowledge from the subject experts, the authors, and enabling this knowledge to be preserved when published, as well as made available to search engines and presented to people consuming the content. &lt;br /&gt; &lt;br /&gt;This project resulted from an initial and ongoing collaboration between Microsoft External Research and  Dr. Phil Bourne and Dr. Lynn Fink, at the University of California San Diego.  Additional collaboration with the staff from Science Commons aims to make the add-in relevant to a wider audience and also to preserve semantic data along the publishing pipeline.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Audience&lt;/b&gt;&lt;br /&gt;This project is focused on researchers and software developers in domains utilizing ontologies– as well as publishers, archivists, and early adopters in the scientific, technical, and scholarly publishing fields.&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Specific features&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Inline Syntax Coloring of Informative Words&lt;/li&gt;&lt;li&gt;Built-in Knowledge of Ontologies and Controlled Vocabularies&lt;/li&gt;&lt;li&gt;Automatic Detection of Identifiers&lt;/li&gt;&lt;li&gt;Custom Semantic Markup&lt;/li&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Getting Started&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;
&lt;li&gt;Trying out the add-in:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;You will need Microsoft Word 2007 (Windows XP or Windows Vista)&lt;/li&gt;&lt;li&gt;Download and install the latest Technology Preview release of the add-in&lt;/li&gt;&lt;li&gt;Check that the SmartTags feature in Word is enabled:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Click the Microsoft Office Button (circle on the top left)&lt;/li&gt;&lt;li&gt;Click the Word Options button&lt;/li&gt;&lt;li&gt;In the categories pane click Add-ins&lt;/li&gt;&lt;li&gt;In the Manage box, towards the bottom of the dialog, select Smart Tags, and then click Go&lt;/li&gt;&lt;li&gt;Select the Label data with smart tags check box&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Open the test document from the Releases tab and follow the instructions in the page&lt;/li&gt;
&lt;/ul&gt;&lt;li&gt;Examining the source code and contributing to the project:&lt;/li&gt;&lt;ul&gt;
&lt;li&gt;Navigate to the Source Code tab&lt;/li&gt;&lt;li&gt;You can use the free version of Visual Studio (Visual C# 2008 Express Edition) to build the project&lt;/li&gt;&lt;li&gt;Add comments in the Discussion tab, and report problems under Issue Tracker&lt;/li&gt;
&lt;/ul&gt;
&lt;/ul&gt; &lt;br /&gt;&lt;b&gt;Architecture&lt;/b&gt;&lt;br /&gt;To be added...&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Design Documentation&lt;/b&gt;&lt;br /&gt;&lt;a href="http://ucsdbiolit.codeplex.com/Wiki/View.aspx?title=designdocs&amp;amp;referringTitle=Home"&gt;Design Documents&lt;/a&gt;&lt;br /&gt; &lt;br /&gt;&lt;b&gt;Background&lt;/b&gt;&lt;br /&gt;Cyberinfrastructure is integral to all aspects of conducting experimental research and distributing those results. However, it has yet to make a similar impact on the way we communicate that information. Peer-reviewed publications have long been the currency of scientific research as they are the fundamental unit through which scientists communicate with and evaluate each other. However, in striking contrast to the data, publications have yet to benefit from the opportunities offered by cyberinfrastructure. While the means of distributing publications has vastly improved, publishers have done little else to capitalize on the electronic medium. In particular, semantic information describing the content of these publications is sorely lacking, as is the integration of this information with data in public repositories. This is confounding considering that many basic tools for marking-up and integrating publication content in this manner already exist, such as a centralized literature database, relevant ontologies, and machine-readable document standards. We propose to address this delay in the maturation of scholarly communication by developing open source tools to facilitate the semantic mark-up of new manuscripts and the submission of those manuscripts directly to a journal’s electronic publishing system. &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt; &lt;br /&gt;
&lt;/div&gt;</description><author>pablofe</author><pubDate>Tue, 10 Mar 2009 17:01:54 GMT</pubDate><guid isPermaLink="false">Updated Wiki: Home 20090310050154P</guid></item></channel></rss>