<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Lotus Base blog</title>
    <description>The developer's blog for Lotus Base
</description>
    <link>https://lotus.au.dk/blog/</link>
    <atom:link href="https://lotus.au.dk/blog/feed.xml" rel="self" type="application/rss+xml"/>
    <pubDate>Tue, 21 Apr 2020 00:03:02 +0200</pubDate>
    <lastBuildDate>Tue, 21 Apr 2020 00:03:02 +0200</lastBuildDate>
    <generator>Jekyll v3.8.5</generator>
    
      <item>
        <title>&lt;em&gt;L. japonicus&lt;/em&gt; Gifu v1.2 genome</title>
        <description>&lt;p&gt;The &lt;em&gt;Lotus japonicus&lt;/em&gt; Gifu genome assembly v1.2 is now officially released and &lt;a href=&quot;https://www.biorxiv.org/content/10.1101/2020.04.17.042473v1&quot;&gt;an associated pre-publication manuscript has been submitted to bioXriv&lt;/a&gt;. Datasets on &lt;em&gt;Lotus&lt;/em&gt; Base has therefore seen some restructuring to support incoming data from a new genome assembly.&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;&lt;strong&gt;Note to users with elevated privileges that have access to the v1.1 assembly preview:&lt;/strong&gt;&lt;/p&gt;

  &lt;p&gt;v1.1 contains gene predictions that have been reworked and with a deprecated naming nomenclature, so there is no one-to-one mapping between the gene IDs from v1.1 to v1.2. We strongly encourage you to use the v1.2 data going forward.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3 id=&quot;gifu-data-available-today&quot;&gt;Gifu data available today&lt;/h3&gt;

&lt;p&gt;The following tools on the site have been updated to allow access to &lt;em&gt;L. japonicus&lt;/em&gt; Gifu data.&lt;/p&gt;

&lt;h4 id=&quot;genome-browser&quot;&gt;Genome browser&lt;/h4&gt;

&lt;p&gt;The Gifu genome v1.2 is now accessible as a new dataset on our JBrowse implementation, &lt;a href=&quot;/genome/?data=genomes/lotus-japonicus/gifu/v1.2&quot;&gt;the Genome Browser&lt;/a&gt;. At the time of writing, the new genome contains the following datasets:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;Gene model with human readable annotations, GO annotations, and InterPro domain predictions (GFF3 file is available for download, see below)&lt;/li&gt;
  &lt;li&gt;Non-coding RNAs&lt;/li&gt;
  &lt;li&gt;Genome gaps&lt;/li&gt;
  &lt;li&gt;Repeats&lt;/li&gt;
&lt;/ul&gt;

&lt;h4 id=&quot;expression-atlas-expat&quot;&gt;Expression Atlas (ExpAt)&lt;/h4&gt;

&lt;p&gt;The ExpAt tool has now been updated with new RNAseq data mapped to the Gifu genome by Dugald Reid. Here is a &lt;a href=&quot;/expat/?ids-input=&amp;amp;ids=LotjaGi3g1v0307700%2CLotjaGi2g1v0343300%2CLotjaGi1g1v0643700%2CLotjaGi3g1v0414350%2CLotjaGi1g1v0257100%2CLotjaGi4g1v0343900%2CLotjaGi5g1v0106700%2CLotjaGi1g1v0001500%2CLotjaGi3g1v0512000&amp;amp;dataset=reidd-2020-gifuatlas&amp;amp;conditions=&amp;amp;data_transform=normalize&amp;amp;idtype=geneid&quot;&gt;sample expression heatmap produced&lt;/a&gt; using the candidate genes published in the manuscript:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/dist/images/content/20200420/expat.svg&quot; alt=&quot;Sample heatmap generated using Gifu predicted proteins&quot; title=&quot;Sample heatmap generated using Gifu predicted proteins&quot; /&gt;&lt;/p&gt;

&lt;p&gt;List of genes included in the heatmap:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi3g1v0307700&quot;&gt;LotjaGi3g1v0307700&lt;/a&gt;, &lt;em&gt;LjCCaMK&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi2g1v0343300&quot;&gt;LotjaGi2g1v0343300&lt;/a&gt;, &lt;em&gt;LjCyclops&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi1g1v0643700&quot;&gt;LotjaGi1g1v0643700&lt;/a&gt;, &lt;em&gt;LjErn1&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi3g1v0414350&quot;&gt;LotjaGi3g1v0414350&lt;/a&gt;, &lt;em&gt;LjNin&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi1g1v0257100&quot;&gt;LotjaGi1g1v0257100&lt;/a&gt;, &lt;em&gt;LjNsp2&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi4g1v0343900&quot;&gt;LotjaGi4g1v0343900&lt;/a&gt;, &lt;em&gt;LjNf-yb1&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi5g1v0106700&quot;&gt;LotjaGi5g1v0106700&lt;/a&gt;, &lt;em&gt;LjNf-ya1&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi1g1v0001500&quot;&gt;LotjaGi1g1v0001500&lt;/a&gt;, &lt;em&gt;LjNin&lt;/em&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/view/gene/LotjaGi3g1v0512000&quot;&gt;LotjaGi3g1v0512000&lt;/a&gt;, &lt;em&gt;LjHar1&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h4 id=&quot;view&quot;&gt;View&lt;/h4&gt;

&lt;p&gt;The &lt;a href=&quot;/view/&quot;&gt;View tool&lt;/a&gt; is seen as a replacement for Transcript Explorer (TrEx), which allows you to have a quick overview for individual transcripts, genes, GO annotations and more. For example, if you are interested in all the data associated with the gene LjNin (LotjaGi1g1v0001500), you can search for it on the View page, or &lt;a href=&quot;/view/gene/LotjaGi1g1v0001500&quot;&gt;access the link directly&lt;/a&gt;.&lt;/p&gt;

&lt;h4 id=&quot;downloadable-data&quot;&gt;Downloadable data&lt;/h4&gt;

&lt;p&gt;All Gifu-related downloadable data can be &lt;a href=&quot;/data/download?search=Gifu&quot;&gt;accessed from our data page&lt;/a&gt;. The newly published files are:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;FASTA files for the genome assembly, coding sequences, and predicted protein sequences&lt;/li&gt;
  &lt;li&gt;GFF3 file for Gifu predicted gene annotations, containing human readable annotations, GO annotations, and InterPro domain predictions&lt;/li&gt;
  &lt;li&gt;Gene Ontology file&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 id=&quot;future-roadmap&quot;&gt;Future roadmap&lt;/h3&gt;

&lt;p&gt;In the next few months, we will be gradually mapping additional data to the Gifu genome:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;em&gt;LORE1&lt;/em&gt; insertion data&lt;/li&gt;
&lt;/ul&gt;
</description>
        <pubDate>Mon, 20 Apr 2020 00:00:00 +0200</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2020/04/20/gifu-genome.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2020/04/20/gifu-genome.html</guid>
        
        <category>lore1</category>
        
        <category>gifu</category>
        
        <category>genome</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>Post-mortem on downtime experienced by BLAST-related tools</title>
        <description>&lt;p&gt;Over the course of the weekend of November 17–18 of 2018 and the subsequent working week that follows, all BLAST-related tools on &lt;em&gt;Lotus&lt;/em&gt; Base became inaccessible. The affected modules were:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;em&gt;Lotus&lt;/em&gt; BLAST, which runs SequenceServer v1.0.9 as a Passenger app&lt;/li&gt;
  &lt;li&gt;The Sequence Retrieval tool (SeqRet), which relies on being able to sniff out BLAST database metadata by executing the &lt;code class=&quot;highlighter-rouge&quot;&gt;blastdbcmd&lt;/code&gt; binary&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 id=&quot;diagnosing-the-issue&quot;&gt;Diagnosing the issue&lt;/h3&gt;

&lt;p&gt;The issue was two-fold:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;SequenceServer was running as a Phusion Passenger app initialized by an arbitrarily named user via the &lt;code class=&quot;highlighter-rouge&quot;&gt;PassengerUser&lt;/code&gt; option in the &lt;code class=&quot;highlighter-rouge&quot;&gt;httpd.conf&lt;/code&gt; file. The user should have been &lt;code class=&quot;highlighter-rouge&quot;&gt;apache&lt;/code&gt;, so that the processes spawned by Apache will have the correct read permissions to access all app binaries.&lt;/li&gt;
  &lt;li&gt;The Sequence Retrieval tool calls an internal API endpoint which relies on being able to execute the &lt;code class=&quot;highlighter-rouge&quot;&gt;blastdbcmd&lt;/code&gt; binary. However, since the binary belongs to a different user group, the API wil fail and return an empty array: this causes PHP to throw an error when attempting to display BLAST database-related metadata.&lt;/li&gt;
&lt;/ol&gt;

&lt;h3 id=&quot;what-was-done-to-fix-it&quot;&gt;What was done to fix it?&lt;/h3&gt;

&lt;p&gt;By updating the read permissions for the BLAST binaries and changing the &lt;code class=&quot;highlighter-rouge&quot;&gt;PassengerUser&lt;/code&gt; for Sequence Server fixes the issue.&lt;/p&gt;
</description>
        <pubDate>Wed, 21 Nov 2018 00:00:00 +0100</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2018/11/21/lotus-blast-postmortem.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2018/11/21/lotus-blast-postmortem.html</guid>
        
        <category>site</category>
        
        <category>postmortem</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>GateKeeper—Migrating away from IP-based controlled access</title>
        <description>&lt;p&gt;We are announcing in a change in user access to controlled, internal data available to CARB members &lt;strong&gt;with immediate effect&lt;/strong&gt;. Traditionally, we have been offering access to CARB members based on their IP address (and VPN connection). This strategy worked out fine for quite awhile as there is no need to fine tune access to internal data.&lt;/p&gt;

&lt;p&gt;However, in light of the changing personnel in the lab, coupled with collaborators who we wish to grant access to sensitive data, detecting IP addresses to restrict data access will become overwhelmingly tedious and unreliable.&lt;/p&gt;

&lt;h3 id=&quot;ensuring-continue-undisrupted-acccess&quot;&gt;Ensuring continue, undisrupted acccess&lt;/h3&gt;

&lt;p&gt;Therefore, all CARB users will be required to &lt;a href=&quot;/users/register&quot;&gt;register for a &lt;em&gt;Lotus Base&lt;/em&gt; account&lt;/a&gt; in order to access internal files, if they have not done so already. Features that are affected by this change are:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;a href=&quot;/expat/&quot;&gt;Expression Atlas&lt;/a&gt; (ExpAt)&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/genome/&quot;&gt;Genome Browser&lt;/a&gt; (JBrowse)&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/tools/primers&quot;&gt;&lt;em&gt;LORE1&lt;/em&gt; genotyping primers&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/blast/&quot;&gt;&lt;em&gt;Lotus&lt;/em&gt; BLAST&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/tools/seqret&quot;&gt;Sequence Retriever&lt;/a&gt; (SeqRet)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The current system administrator (Terry, &lt;a href=&quot;mailto:terry@mbg.au.dk&quot;&gt;terry@mbg.au.dk&lt;/a&gt;) will be responsible for adding verified/validated CARB members into a user group that has exclusive access to internal data. He will be notified when new CARB members have registered for accounts, and will perform necessary validation with new users before granting access.&lt;/p&gt;

&lt;p class=&quot;user-message reminder&quot;&gt;&lt;span class=&quot;icon-info-circled&quot;&gt;&lt;/span&gt;Registering for an account does not automatically grant access to internally-available resources.&lt;/p&gt;

&lt;p class=&quot;user-message&quot;&gt;&lt;span class=&quot;icon-info-circled&quot;&gt;&lt;/span&gt;If you are a CARB collaborator who wish to have access to internal data, &lt;a href=&quot;/meta/contact&quot;&gt;please do not hesitate to reach out to us&lt;/a&gt;.&lt;/p&gt;

&lt;h3 id=&quot;how-are-you-affected&quot;&gt;How are you affected?&lt;/h3&gt;
&lt;p&gt;&lt;strong&gt;Pre-existing CARB users with accounts with &lt;em&gt;Lotus&lt;/em&gt; Base will not see any service disruptions&lt;/strong&gt;—you have been automatically migrated over to the new controlled access system. To access internally available data, simply remember to &lt;a href=&quot;/users/login&quot;&gt;log in&lt;/a&gt;. Users that do not have an account with &lt;em&gt;Lotus&lt;/em&gt; Base, however, are strongly encouraged to register. Terry will keep in touch with you once you have registered for an account.&lt;/p&gt;

&lt;h3 id=&quot;your-security-is-our-priority&quot;&gt;Your security is our priority&lt;/h3&gt;
&lt;p&gt;In order to prevent session hijacking, we recycle user sessions frequently. This means that you might be logged off within 24 hours of logging in, unless you have explicitly asked to be logged in for a week when signing in. You are encouraged not to save your login credentials on public terminals.&lt;/p&gt;

&lt;p&gt;If you suspect your account is being compromised or you have misplaced your login credentials, you can &lt;a href=&quot;/users/reset&quot;&gt;reset your password&lt;/a&gt; and regain control over your account.&lt;/p&gt;
</description>
        <pubDate>Mon, 12 Dec 2016 00:00:00 +0100</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2016/12/12/controlled-access.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2016/12/12/controlled-access.html</guid>
        
        <category>site</category>
        
        <category>feature</category>
        
        <category>security</category>
        
        <category>users</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>Using InterProScan like a pro</title>
        <description>&lt;p&gt;Biologists are often challenged with this question when working with proteins:&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;Now… what does your protein do?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h3 id=&quot;domain-predictionbest-friend-or-worst-nightmare&quot;&gt;Domain prediction—best friend or worst nightmare?&lt;/h3&gt;

&lt;p&gt;People want to know &lt;em&gt;everything&lt;/em&gt; about your-favourite-protein-1 (YFP1). How does
it look like? What are the predicted domains? Do these domains have any
functions and processes associated with them? Are they located in specific parts
of the cell?&lt;/p&gt;

&lt;p&gt;A very simplified pipeline would be as follow:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Check the amino acid sequence of YFP1 against various domain prediction programs&lt;/li&gt;
  &lt;li&gt;Obtain domain and/or structural predictions of YFP1&lt;/li&gt;
  &lt;li&gt;Infer biological function, molecular processes, and/or cellular components
associated from said domain predictions&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;However, there are so many domain prediction algorithms out there, and an
overwhelming bunch of them using Hidden Markov Models. These algorithms—such as
&lt;a href=&quot;http://www.pantherdb.org/tools/hmmScoreForm.jsp&quot;&gt;PANTHER&lt;/a&gt;,
&lt;a href=&quot;http://phobius.sbc.su.se/&quot;&gt;Phobius&lt;/a&gt;, &lt;a href=&quot;http://pfam.xfam.org/&quot;&gt;Pfam&lt;/a&gt;,
&lt;a href=&quot;http://supfam.org/SUPERFAMILY/hmm.html&quot;&gt;SuperFamily&lt;/a&gt;,
&lt;a href=&quot;http://www.cbs.dtu.dk/services/TMHMM/&quot;&gt;TMHMM&lt;/a&gt;—offer simple web interfaces that
allows end-users to submit single (or a small number of, at best) sequences.
EMBL-EBI offers &lt;a href=&quot;http://www.ebi.ac.uk/interpro/&quot;&gt;InterPro&lt;/a&gt;, which integrates all
of these prediction algorithms, but again only allows single sequence submission
from their web interface.&lt;/p&gt;

&lt;p&gt;There appears to be no simple way of submitting a set of protein sequence to
multiple prediction algorithms—through a web interface, at least. If you are
willing to dive into the world of command line interfaces, things start to look
a bit better.&lt;/p&gt;

&lt;p&gt;This article is written based on my experience with using InterPro, and my work
with using RESTful services made available by EMBL-EBI on offering comprehensive
&lt;em&gt;Lotus&lt;/em&gt; data to legume researchers around the world.&lt;/p&gt;

&lt;h3 id=&quot;example-use-case-lotus-base&quot;&gt;Example use case: &lt;em&gt;Lotus&lt;/em&gt; Base&lt;/h3&gt;

&lt;p&gt;As the principle developer and designer behind &lt;a href=&quot;https://lotus.au.dk/&quot;&gt;Lotus
Base&lt;/a&gt;, I have worked on performing predictions on the
entire set of predicted proteins using the most recently published &lt;em&gt;Lotus
japonicus&lt;/em&gt; genome — meaning 50,000+ predicted proteins in total that has to be
parsed. The screenshot below shows an example of how I have pulled
protein-specific data from a MySQL database built based on InterProScan’s domain
prediction results, and merged the data with additional metadata obtained with
the EB-eye REST service.&lt;/p&gt;

&lt;figure&gt;
&lt;img src=&quot;/dist/images/content/20161128/domain_prediction.png&quot; alt=&quot;Domain predictions for my-favourite-gene, the flagellin receptor
LjFls2&quot; title=&quot;Domain predictions for my-favourite-gene, the flagellin receptor
LjFls2&quot; /&gt;
&lt;figcaption&gt;Domain predictions for my-favourite-gene, the flagellin receptor
&lt;a href=&quot;https://lotus.au.dk/view/transcript/Lj4g3v0281040.1&quot;&gt;&lt;em&gt;LjFls2&lt;/em&gt;&lt;/a&gt;. Domain prediction
graph made using &lt;a href=&quot;https://github.com/d3/d3&quot;&gt;d3.js&lt;/a&gt;.&lt;/figcaption&gt;
&lt;/figure&gt;

&lt;p&gt;Of course, &lt;em&gt;Lotus&lt;/em&gt; Base presents itself as a rather extreme use case due to the
large volume of predicted proteins analysed. However, the methods described
below would be just as applicable to a researcher who say, has obtained a list
of proteins that are significantly enriched in one biological sample compared to
another. The first step towards unraveling the functions of these proteins,
based on their gene ontology predictions, would be to obtain their domain
predictions first.&lt;/p&gt;

&lt;h3 id=&quot;interproscan-vs-interpro-restful-service&quot;&gt;InterProScan vs InterPro RESTful service&lt;/h3&gt;

&lt;p&gt;You have two options from here on—if you are blessed with access to a computing
cluster running on Linux, you can download and install a local version of
&lt;a href=&quot;https://github.com/ebi-pf-team/interproscan&quot;&gt;InterProScan&lt;/a&gt;, and run
InterProScan with FASTA files containing *n *number of sequences in parallel or
in queue. The second, less handy option—but also the most accessible one—is to
take advantage of the &lt;a href=&quot;http://www.ebi.ac.uk/Tools/webservices/services/pfa/iprscan5_rest&quot;&gt;RESTful
service&lt;/a&gt;
provided by EMBL-EBI. The latter can be run on any computer, although preferably
one running Unix/Linux (because that’s what my code will be running on). The
only drawback is that EMBL-EBI’s fair use agreement only allows you to run
InterProScan on 30 sequences at any one time.&lt;/p&gt;

&lt;p&gt;Both services will give you the most up-to-date domain predictions, and
necessitates re-running your proteins if they have included additional datasets.
When InterProScan includes additional prediction algorithms, you can simply
select to run said algorithms—instead of the entire set—on your sequences, and
simply join the output with existing predictions.&lt;/p&gt;

&lt;hr /&gt;

&lt;h3 id=&quot;option-a-using-embl-ebis-interpro-rest-service&quot;&gt;Option A: Using EMBL-EBI’s InterPro REST service&lt;/h3&gt;

&lt;p&gt;Using the REST service provided by EMBL-EBI is a way to perform domain
predictions on your protein(s) of interest without needing to invest in an
expensive computing cluster, or obtaining access to one. For this part of the
tutorial to work, you will need to ensure that Python3 is installed (InterPro
provides a Python2 client library, but that is not covered in this section).&lt;/p&gt;

&lt;h4 id=&quot;explode-fasta-file-into-individual-sequence-files&quot;&gt;Explode FASTA file into individual sequence files&lt;/h4&gt;

&lt;p&gt;As the InterPro REST service only accepts single sequences, the easiest way is
to split a multi-sequence FASTA file into individual sequence files. If your
FASTA files are formatted such that each entry takes up two lines — one for the
header and one for the sequence—you can do something like:&lt;/p&gt;

&lt;div class=&quot;highlighter-rouge&quot;&gt;&lt;div class=&quot;highlight&quot;&gt;&lt;pre class=&quot;highlight&quot;&gt;&lt;code&gt;split -l 2 /path/to/your/fasta/file
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;However, this is often not the case, as FASTA files are recommended to be broken
into lines containing no more than 60 characters long. If that is the case, you
might want to rely on BioPython to do the parsing for you:&lt;/p&gt;

&lt;script src=&quot;https://gist.github.com/74b89de2116b52f06e2917d9ec8ce0ad.js?file=&quot;&gt;&lt;/script&gt;
&lt;div&gt;&lt;noscript&gt;&lt;pre&gt;&lt;code&gt;400: Invalid request
&lt;/code&gt;&lt;/pre&gt;&lt;/noscript&gt;&lt;/div&gt;

&lt;h4 id=&quot;hand-individual-fasta-file-off-to-the-rest-service&quot;&gt;Hand individual FASTA file off to the REST service&lt;/h4&gt;

&lt;p&gt;When you have generated We can then iterate through these individual FASTA files
and pass them to InterPro’s REST service. InterPro has provided us with various
clients to interface with their REST service—I have chosen to work with their
&lt;a href=&quot;http://www.ebi.ac.uk/Tools/webservices/download_clients/python/urllib/iprscan5_urllib3.py&quot;&gt;Python3
client&lt;/a&gt;.
I did not modify their client script, with the exception of commenting out the
line that prints the status in the  function, so that my console will not be
crowded with  printouts.&lt;/p&gt;

&lt;p&gt;It is important to respect the 30 sequences per batch limit of the InterPro REST
service. Therefore, we will use a simple bash script that, while iterating
through all individual FASTA file, stops after 30 files until the outcome from
all 30 jobs have been returned:&lt;/p&gt;

&lt;script src=&quot;https://gist.github.com/889f28314643d429496881adbdd40039.js?file=&quot;&gt;&lt;/script&gt;
&lt;div&gt;&lt;noscript&gt;&lt;pre&gt;&lt;code&gt;400: Invalid request
&lt;/code&gt;&lt;/pre&gt;&lt;/noscript&gt;&lt;/div&gt;

&lt;p&gt;If you want to obtain other output formats, remember to modify the  option.
According to my experience, each batch (of 30 sequences) takes around 2 minutes
to complete.&lt;/p&gt;

&lt;p&gt;The major drawback of this method is that it is a rather nuclear option if you
are attempting to scan the entire collection of predicted proteins/transcripts.
&lt;strong&gt;Use InterProScan on a computing cluster, if ever possible.&lt;/strong&gt;&lt;/p&gt;

&lt;hr /&gt;

&lt;h3 id=&quot;option-b-interproscan-on-a-computing-cluster&quot;&gt;Option B: InterProScan on a computing cluster&lt;/h3&gt;

&lt;h4 id=&quot;installing-interproscan&quot;&gt;Installing InterProScan&lt;/h4&gt;

&lt;p&gt;Follow the &lt;a href=&quot;https://github.com/ebi-pf-team/interproscan/wiki/HowToDownload&quot;&gt;published instructions on installing
InterProScan&lt;/a&gt;. I
have ran into small hiccups, such as accidentally using an outdated version of
Java (≤1.7) and having a dated GCC library. Loading the most updated one ensured
that both InterProScan and the bundled BLAST+ package can be executed properly.&lt;/p&gt;

&lt;h4 id=&quot;adding-proprietary-algorithms&quot;&gt;Adding proprietary algorithms&lt;/h4&gt;

&lt;p&gt;Note that InterProScan does not come with
&lt;a href=&quot;http://phobius.sbc.su.se/data.html&quot;&gt;Phobius&lt;/a&gt;,
&lt;a href=&quot;http://www.cbs.dtu.dk/services/SignalP/&quot;&gt;SignalP&lt;/a&gt;, and
&lt;a href=&quot;http://www.cbs.dtu.dk/services/TMHMM/&quot;&gt;TMHMM&lt;/a&gt; preinstalled. You will have to
request for the compiled binaries of these algorithms, and upload them to their
respective folders in the  directory.&lt;/p&gt;

&lt;blockquote&gt;
  &lt;p&gt;If you are unable to get hold of these libraries, you will have to retrieve the
output of these algorithms via InterPro REST service.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;A hitch with SignalP is that it assumes a fixed directory for loading the
library files. This causes a fatal error where FASTA.pm cannot be
loaded—remember to update the environment so that it points to the signalp
directory (it will load libraries from the  subfolder automagically).&lt;/p&gt;

&lt;div class=&quot;highlighter-rouge&quot;&gt;&lt;div class=&quot;highlight&quot;&gt;&lt;pre class=&quot;highlight&quot;&gt;&lt;code&gt;# full path to the signalp-4.1 directory on your system (mandatory)
BEGIN {
    $ENV{SIGNALP} = '&amp;lt;path/to/interproscan&amp;gt;/bin/signalp/4.1';
}
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;

&lt;h4 id=&quot;check-that-all-prediction-algorithms-are-loaded&quot;&gt;Check that all prediction algorithms are loaded&lt;/h4&gt;

&lt;p&gt;After you’ve done that, ensure that  file is properly updated with the file
paths of your binaries for the added libraries. After that, proceed to run 
without any arguments. It will print out all the algorithms that were detected
and loaded correctly. Ensure that none is left behind—InterProScan will inform
you if any of them has failed to load.&lt;/p&gt;

&lt;p&gt;Depending on the number of sequences you want to submit per batch, you will have
to update&lt;/p&gt;

&lt;h4 id=&quot;getting-your-fasta-files-ready&quot;&gt;Getting your FASTA files ready&lt;/h4&gt;

&lt;p&gt;You would want to process FASTA files in batches instead of all at one go. I
have decided to split a unified FASTA file that contains all 50,000+ of the
amino acid sequences into files containing 500 entries each. If your FASTA files
are formatted such that each entry takes up two lines—one for the header and one
for the sequence—you can do something like:&lt;/p&gt;

&lt;div class=&quot;highlighter-rouge&quot;&gt;&lt;div class=&quot;highlight&quot;&gt;&lt;pre class=&quot;highlight&quot;&gt;&lt;code&gt;split -l 1000 /path/to/your/fasta/file
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;…assuming that you want 500 entries per file. However, this is often not the
case, as FASTA files are recommended to be broken into lines containing no more
than 60 characters long. If that is the case, you might want to rely on
BioPython to do the parsing for you. The first step is to create a filtered
FASTA file that is formatted such that each entry occupies two lines, generating
a  file. The second step is to batch parse this filtered file using itertools,
to create batches of FASTA files containing 500 entries (i.e. 1000 lines) each:&lt;/p&gt;

&lt;script src=&quot;https://gist.github.com/1ecb80f22afb9a6e6600d5355b80351d.js?file=&quot;&gt;&lt;/script&gt;
&lt;div&gt;&lt;noscript&gt;&lt;pre&gt;&lt;code&gt;400: Invalid request
&lt;/code&gt;&lt;/pre&gt;&lt;/noscript&gt;&lt;/div&gt;

&lt;h4 id=&quot;submit-your-jobs-to-iteratively-to-the-computing-cluster&quot;&gt;Submit your jobs to iteratively to the computing cluster&lt;/h4&gt;

&lt;p&gt;In this case, I am using SLURM for batch job submission. I will not go into
details on how the job submission is done, as it is highly dependent on the
configuration of individual clusters. The actual command is quite simple:&lt;/p&gt;

&lt;div class=&quot;highlighter-rouge&quot;&gt;&lt;div class=&quot;highlight&quot;&gt;&lt;pre class=&quot;highlight&quot;&gt;&lt;code&gt;/path/to/interproscan.sh \
-i /path/to/fasta.fa -dp -iprlookup --goterms --pathways
&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;Note that I have turned off precalcualted match lookup using the  flag because
the computing cluster I am on blocks external connections for security reasons.
Moreover, the *Lotus japonicus *proteins are yet to be submitted to UniProt so
it is highly unlikely that we will find too many matching proteins in the public
database.&lt;/p&gt;

&lt;p&gt;Here is an example of how a batch job submission template you can use:&lt;/p&gt;

&lt;script src=&quot;https://gist.github.com/db98b4e869d82948bfa383cdcf01ac43.js?file=&quot;&gt;&lt;/script&gt;
&lt;div&gt;&lt;noscript&gt;&lt;pre&gt;&lt;code&gt;400: Invalid request
&lt;/code&gt;&lt;/pre&gt;&lt;/noscript&gt;&lt;/div&gt;

&lt;p&gt;Boom! Run it and wait for magic to happen.&lt;/p&gt;

&lt;h4 id=&quot;performance&quot;&gt;Performance&lt;/h4&gt;

&lt;p&gt;In the case of &lt;em&gt;Lotus&lt;/em&gt; Base and our collection of predicted transcripts, we have
49,598 sequences scanned in batches of 1,000, creating 50 jobs. The jobs were
run with an allocated 24Gb memory over 12 cores, on nodes equipped with Intel
“Sandy Bridge” E5–2670 (2.67GHz)or “Haswell” E5–2680v3 (2.5GHz) CPUs. After
normalizing for processor speeds and library sizes, the real time consumed per
job stands at 2.50±0.28h (CPU time: 4.32±0.35h).&lt;/p&gt;

&lt;hr /&gt;

&lt;h3 id=&quot;parsing-interprointerproscan-outputs&quot;&gt;Parsing InterPro/InterProScan outputs&lt;/h3&gt;

&lt;p&gt;The file that contains all the juicy data is the TSV file, which you can easily
import into a relational database such as MySQL. The &lt;a href=&quot;https://github.com/ebi-pf-team/interproscan/wiki/OutputFormats#tab-separated-values-format-tsv&quot;&gt;InterProScan
wiki&lt;/a&gt;
has the information on what does each individual column in the TSV file contain.&lt;/p&gt;

&lt;p&gt;For &lt;em&gt;Lotus&lt;/em&gt; Base, I simply imported the TSV file *as-is *into a MySQL table, and
used  statements to merge transcript metadata from additional tables we have.
It’s as simple as that!&lt;/p&gt;

&lt;hr /&gt;

&lt;p&gt;This article is also published on &lt;a href=&quot;https://medium.com/@teddyrised/using-interproscan-like-a-pro-ad18b8c3ccc0#.fj9o7v89z&quot;&gt;Medium.com&lt;/a&gt;.&lt;/p&gt;
</description>
        <pubDate>Mon, 28 Nov 2016 00:00:00 +0100</pubDate>
        <link>https://lotus.au.dk/blog/bioinformatics/2016/11/28/interproscan.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/bioinformatics/2016/11/28/interproscan.html</guid>
        
        <category>programming</category>
        
        <category>python</category>
        
        <category>EMBL-EBI</category>
        
        
        <category>bioinformatics</category>
        
      </item>
    
      <item>
        <title>Introducing ExpAt, the &lt;em&gt;Lotus japonicus&lt;/em&gt; Expression Atlas</title>
        <description>&lt;p&gt;Expression data from the model legume &lt;em&gt;Lotus japonicus&lt;/em&gt;, while publicly available through other online resources, face a fragmented landscape that lacks accessibility and options for analysis and visualisation. Here we introduce &lt;strong&gt;ExpAt&lt;/strong&gt;, the &lt;em&gt;L. japonicus&lt;/em&gt; Expression Atlas, that offers features that empower legume researchers without the need for extensive knowledge in computation or skills in data visualisation.&lt;/p&gt;

&lt;p class=&quot;align-center&quot;&gt;&lt;a href=&quot;/expat&quot; title=&quot;Expression Atlas for Lotus japonicus&quot; class=&quot;button&quot;&gt;Give &lt;strong&gt;ExPat&lt;/strong&gt; a try&lt;/a&gt;&lt;/p&gt;

&lt;h3 id=&quot;what-does-expat-do&quot;&gt;What does ExpAt do?&lt;/h3&gt;

&lt;p&gt;ExpAt is a tool that allows you to query for the expression levels of your genes/transcripts of interest. It generates almost publication-ready, vector-based graphics based on the retrieved expression data, and presents them in a line graph and a heatmap. The line graph feature will be turned off when too many genes/transcripts were used in a single search, as it offers little insight on the expression patterns. You may export all the relevant data and charts by visiting the “export data” options that appears above the charts.&lt;/p&gt;

&lt;p&gt;Here is an example of an unmodified ExpAt chart, showing a line graph and a clustered heatmap with one dendrogram on each axis:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/dist/images/content/expat01.png&quot; alt=&quot;ExpAt raw example&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Here is an example of a ExpAt chart which is slightly tweaked in Adobe Illustrator, and used in a publication (Mun et al., in review):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/dist/images/content/expat02.png&quot; alt=&quot;ExpAt modified example&quot; /&gt;&lt;/p&gt;

&lt;h4 id=&quot;data-transformation&quot;&gt;Data transformation&lt;/h4&gt;

&lt;p&gt;For easing quick visual comparison across genes with signicantly different levels of absolute expression—measured by either (1) reads per kilobase of transcript (RPKM) for RNAseq datasets, or (2) arbitrary Affymetrix units for Affymetrix MicroArray datasets—we included two possibilities to transform the expression levels, by normalisation or standardisation.&lt;/p&gt;

&lt;p&gt;Data normalisation is simply the rescaling of expression values to the domain $[0, 1]$, by subtracting the log-transformed sample expression levels $x_s$ with the lowest log-transformed expression level, $(\log_{10} x)_{\min}$, followed by the division of the difference between the log-transformed maximum and minimum expression levels. In order to allow comparison for extreme values, expression values are $\log_{10}$-transformed prior to normalisation.&lt;/p&gt;

&lt;script type=&quot;math/tex; mode=display&quot;&gt;x^\prime_s = \frac{(\log_{10} x_s) - (\log_{10} x)_{\min}}{(\log_{10} x)_{\max} - (\log_{10} x)_{\min}}&lt;/script&gt;

&lt;p&gt;Meanwhile, data standardisation serves to rescale the expression levels on a per row basis, across conditions, to have a mean of zero and a standard deviation of one. This is performed by subtracting the sample expression levels $x_s$ by the average expression level $\mu$ across all samples, and dividing the didderence with the sample standard deviation computed across all samples $\sigma$. This strategy is however erroneously labelled as “normalisation” in some studies.&lt;/p&gt;

&lt;script type=&quot;math/tex; mode=display&quot;&gt;x^\prime_s = \frac{x_s - \mu}{\sigma}&lt;/script&gt;

&lt;h4 id=&quot;clustering&quot;&gt;Clustering&lt;/h4&gt;

&lt;p&gt;ExpAt offers the possibility to cluster your expression level data by gene/transcript idenifiers and/or conditions/samples. This two-dimensional data is referred to as a matrix—when this matrix has a dimension of $1 \times n$ or $n \times 1$, &lt;em&gt;k&lt;/em&gt;-means clustering is used; when this matrix is larger than that, hierarchical agglomerative clustering is used. Changes to the clustering parameters can be modified on the fly and the heatmap and/or line graphs will be updated accordingly.&lt;/p&gt;

&lt;h3 id=&quot;dataset-availability&quot;&gt;Dataset availability&lt;/h3&gt;

&lt;p&gt;We have integrated several publicly-available datasets:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;The &lt;em&gt;L. japonicus&lt;/em&gt; gene expression (LjGEA)&lt;sup&gt;1–5&lt;/sup&gt;, with probe identifiers mapped to &lt;em&gt;L. japonicus&lt;/em&gt; v3.0 proteins using NCBI BLAST, and&lt;/li&gt;
  &lt;li&gt;The early &lt;em&gt;Lotus&lt;/em&gt; root responses to germinating spore exudates from arbuscular mycorrhizal fungi&lt;sup&gt;6&lt;/sup&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Should you want to add your own expression data to the &lt;strong&gt;ExpAt&lt;/strong&gt; tool, feel free to reach out to us via the &lt;a href=&quot;/meta/contact&quot;&gt;contact form&lt;/a&gt;.&lt;/p&gt;

&lt;h3 id=&quot;citation&quot;&gt;Citation&lt;/h3&gt;

&lt;p&gt;If you have used ExpAt for data transformation, analysis (&lt;em&gt;k&lt;/em&gt;-means or hierarchical clustering), and/or visualisation, we ask that you cite &lt;em&gt;Lotus&lt;/em&gt; Base&lt;sup&gt;7&lt;/sup&gt;, and the relevant publications that generated the said dataset.&lt;/p&gt;

&lt;h3 class=&quot;refs&quot;&gt;References&lt;/h3&gt;

&lt;ol&gt;
  &lt;li&gt;Verdier, J., Torres-Jerez, I., Wang, M., Andriankaja, A., Allen, S. N., He, J., Tang, Y., Murray, J. D., and Udvardi, M. K. (2013). Establishment of the &lt;em&gt;Lotus japonicus&lt;/em&gt; gene expression atlas (LjGEA) and its use to explore legume seed maturation. &lt;em&gt;Plant J&lt;/em&gt;, 74(&lt;strong&gt;2&lt;/strong&gt;):351–362.&lt;/li&gt;
  &lt;li&gt;Díaz, P., Betti, M., Sánchez, D. H., Udvardi, M. K., Monza, J., and Márquez, A. J. (2010). De ciency in plastidic glutamine synthetase alters proline metabolism and transcriptomic response in &lt;em&gt;Lotus japonicus&lt;/em&gt; under drought stress. &lt;em&gt;New Phytol&lt;/em&gt;, 188(&lt;strong&gt;4&lt;/strong&gt;):1001–1013.&lt;/li&gt;
  &lt;li&gt;Guether, M., Balestrini, R., Hannah, M., He, J., Udvardi, M. K., and Bonfante, P. (2009). Genome-wide reprogramming of regulatory networks, transport, cell wall and membrane biogenesis during arbuscular mycorrhizal symbiosis in &lt;em&gt;Lotus japonicus&lt;/em&gt;. &lt;em&gt;New Phytol&lt;/em&gt;, 182(&lt;strong&gt;1&lt;/strong&gt;):200–212.&lt;/li&gt;
  &lt;li&gt;Høgslund, N., Radutoiu, S., Krusell, L., Voroshilova, V., Hannah, M. A., Go ard, N., Sanchez, D. H., Lippold, F., Ott, T., Sato, S., Tabata, S., Liboriussen, P., Lohmann, G. V., Schauser, L., Weiller, G. F., Udvardi, M. K., and Stougaard, J. (2009). Dissection of symbiosis and organ development by integrated transcrip- tome analysis of &lt;em&gt;Lotus japonicus&lt;/em&gt; mutant and wild-type plants. &lt;em&gt;PLoS One&lt;/em&gt;, 4(&lt;strong&gt;8&lt;/strong&gt;):e6556.&lt;/li&gt;
  &lt;li&gt;Sanchez, D. H., Lippold, F., Redestig, H., Hannah, M. A., Erban, A., Krämer, U., Kopka, J., and Udvardi, M. K. (2008). Integrative functional genomics of salt acclimatization in the model legume &lt;em&gt;Lotus japonicus&lt;/em&gt;. &lt;em&gt;Plant J&lt;/em&gt;, 53(&lt;strong&gt;6&lt;/strong&gt;):973–987.&lt;/li&gt;
  &lt;li&gt;Giovannetti, M., Mari, A., Novero, M., and Bonfante, P. (2015). Early &lt;em&gt;Lotus japonicus&lt;/em&gt; root transcriptomic responses to symbiotic and pathogenic fungal exudates. &lt;em&gt;Front Plant Sci&lt;/em&gt;, 6:480.&lt;/li&gt;
  &lt;li&gt;Mun, T., Bachmann, A., Gupta, V., Stougaard, J., and Andersen, S. U. (under review). &lt;em&gt;Lotus&lt;/em&gt; base, an integrated information portal for &lt;em&gt;Lotus japonicus&lt;/em&gt;.&lt;/li&gt;
&lt;/ol&gt;
</description>
        <pubDate>Fri, 26 Aug 2016 00:00:00 +0200</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2016/08/26/expat.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2016/08/26/expat.html</guid>
        
        <category>expat</category>
        
        <category>expression</category>
        
        <category>tools</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>Mapping transcripts across &lt;em&gt;Lotus&lt;/em&gt; genome versions</title>
        <description>&lt;p&gt;The availability of various versions of the &lt;em&gt;L. japonicus&lt;/em&gt; genome, while proving to be an important resource in legume research, makes it difficult for users to map annotated genes and/or transcripts from one version to another. The &lt;a href=&quot;/tools/tram&quot;&gt;Transcript Mapper (TRAM) tool&lt;/a&gt; can be used for exactly this purpose, and we currently support versions 2.5 and 3.0. As with all the other tooklits provided with &lt;em&gt;Lotus&lt;/em&gt; Base, TRAM provides deep-linking to other tools for your convenience.&lt;/p&gt;
</description>
        <pubDate>Wed, 24 Aug 2016 00:00:00 +0200</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2016/08/24/tram.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2016/08/24/tram.html</guid>
        
        <category>tram</category>
        
        <category>tools</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>Redesigned &lt;em&gt;LORE1&lt;/em&gt; search form, and pan-version TREX searches</title>
        <description>&lt;p&gt;&lt;em&gt;Lotus&lt;/em&gt; Base was originally conceived as a very simple web interface for the searching for, and ordering of, LORE1 lines, but over the years it gradually evolved into a fully-fledged &lt;em&gt;Lotus japonicus&lt;/em&gt; online resource. Therefore, it is not surprising that the LORE1 search form is one of the most antiquated and complicated components of the site, which we never really got around to upgrading it.&lt;/p&gt;

&lt;p&gt;Now the &lt;a href=&quot;/lore1/search&quot;&gt;LORE1 line search page&lt;/a&gt; has been revamped and brought up to date with the cleaner style of the site in general. To improve user experience, we have removed the step-form-like search flow, which complicated the decidedly simple purpose of the form anyway—to search for LORE1 lines of interest.&lt;/p&gt;

&lt;p&gt;In other news, we have enabled pan-&lt;em&gt;Lj&lt;/em&gt;-genome-version Transcript Explorer (TREX) searches. Although the form defaults to the &lt;strong&gt;latest version of the genome&lt;/strong&gt; (at the time of writing, this would be &lt;strong&gt;v3.0&lt;/strong&gt;), it is possible to select from other versions, either in a standalone or combinatory manner, of all hitherto published &lt;em&gt;L. japonicus&lt;/em&gt; genomes. Do note that due to the way the genome is assembled, &lt;strong&gt;genome coordinates are not preserved across versions&lt;/strong&gt;. For example, position 65,536 on chromosome 1 in v2.5 will not be position 65,536 on chromosome 1 in v3.0.&lt;/p&gt;
</description>
        <pubDate>Sat, 05 Mar 2016 00:00:00 +0100</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2016/03/05/redesigned-lore1-search-form-pan-version-trex-search.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2016/03/05/redesigned-lore1-search-form-pan-version-trex-search.html</guid>
        
        <category>lore1</category>
        
        <category>trex</category>
        
        <category>tools</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>JBrowse updated to v1.12.0</title>
        <description>&lt;blockquote class=&quot;twitter-tweet&quot; data-lang=&quot;en&quot;&gt;&lt;p lang=&quot;en&quot; dir=&quot;ltr&quot;&gt;Just upgraded the &lt;em&gt;Lotus japonicus&lt;/em&gt; &lt;a href=&quot;https://twitter.com/hashtag/genome?src=hash&quot;&gt;#genome&lt;/a&gt; browser to the latest version of JBrowse, v1.12.0 &lt;a href=&quot;https://t.co/ttAO0z7O8o&quot;&gt;https://t.co/ttAO0z7O8o&lt;/a&gt;—thank you, &lt;a href=&quot;https://twitter.com/usejbrowse&quot;&gt;@usejbrowse&lt;/a&gt;!&lt;/p&gt;&amp;mdash; LotusBase (@lotusbase) &lt;a href=&quot;https://twitter.com/lotusbase/status/704761942020399107&quot;&gt;March 1, 2016&lt;/a&gt;&lt;/blockquote&gt;
&lt;script async=&quot;&quot; src=&quot;https://platform.twitter.com/widgets.js&quot; charset=&quot;utf-8&quot;&gt;&lt;/script&gt;

&lt;p&gt;We have successfully upgraded our jBrowse installation, used to power visualization of the &lt;a href=&quot;/genome&quot;&gt;&lt;em&gt;L. japonicus&lt;/em&gt; genome&lt;/a&gt;, to &lt;a href=&quot;http://jbrowse.org/jbrowse-1-12-0/&quot;&gt;version 1.12.0&lt;/a&gt;. The new version comes with several exciting features, such as:&lt;/p&gt;

&lt;blockquote&gt;
  &lt;ul&gt;
    &lt;li&gt;Added ability to open a new genome in FASTA format from the browser. Also supports indexed FASTA.&lt;/li&gt;
    &lt;li&gt;Support for inline reference sequence configurations.&lt;/li&gt;
  &lt;/ul&gt;
  &lt;p class=&quot;align-right&quot;&gt;Source: &lt;a href=&quot;http://jbrowse.org/jbrowse-1-12-0/&quot; title=&quot;JBrowse-1.12.0: Open new genome from FASTA, in-line refseqs, NeatFeatures, Desktop&quot;&gt;http://jbrowse.org/jbrowse-1-12-0/&lt;/a&gt;&lt;/p&gt;
&lt;/blockquote&gt;
</description>
        <pubDate>Tue, 01 Mar 2016 00:00:00 +0100</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2016/03/01/jbrowse-updated-to-v1.12.0.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2016/03/01/jbrowse-updated-to-v1.12.0.html</guid>
        
        <category>jbrowse</category>
        
        <category>tools</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>&lt;em&gt;Lotus&lt;/em&gt; Base Soft Launch</title>
        <description>&lt;p&gt;&lt;img src=&quot;/dist/images/content/lotusbase-soft-launch.jpg&quot; alt=&quot;Lotus Base&quot; /&gt;&lt;/p&gt;

&lt;p&gt;After in a few years of continuous development and a total of 160,000+ lines of code written, we are soft-launching &lt;em&gt;Lotus&lt;/em&gt; Base, an online resource platform for the model legume &lt;em&gt;Lotus japonicus&lt;/em&gt;.&lt;/p&gt;

&lt;h3 id=&quot;what-is-lotus-base&quot;&gt;What is &lt;em&gt;Lotus&lt;/em&gt; Base?&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;Lotus&lt;/em&gt; Base aims to be a one-stop online resource for everything concering the model legume &lt;em&gt;Lotus&lt;/em&gt; japonicus. We are currently hosting both v2.5 and v3.0 of the &lt;em&gt;L. japonicus&lt;/em&gt; genome, as well as other databases associated with it, such as LORE1 insertions, protein sequences, coding sequences, mRNA libraries and more.&lt;/p&gt;

&lt;h3 id=&quot;lore1-lines-ordering&quot;&gt;LORE1 lines ordering&lt;/h3&gt;
&lt;p&gt;For now we are sticking to the old site for processing LORE1 orders, due to delays in implementing the new order system. We will let you know once this migration in complete. Otherwise, the ordering protocol is the same as usual.&lt;/p&gt;

&lt;h3 id=&quot;made-by-users-for-users&quot;&gt;Made by users, for users&lt;/h3&gt;
&lt;p&gt;&lt;em&gt;Lotus&lt;/em&gt; Base is developed in-house in the Centre for Carboydrate Recognition and Signalling, Aarhus University, Denmark. Before this public release, we have tested the site extensively with the help of our own users—the user experience of the site is tailored to the needs of researchers like us.&lt;/p&gt;

&lt;p&gt;We have an extensive set of tools for the end-user, and have integrated various third-party, open-source project to make most of our resources easily available to the public. For example, we are using &lt;a href=&quot;http://www.sequenceserver.com&quot;&gt;SequenceServer&lt;/a&gt; as a wrapper for &lt;a href=&quot;https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastDocs&amp;amp;DOC_TYPE=Download&quot;&gt;NCBI BLAST binaries&lt;/a&gt; to power our &lt;a href=&quot;/blast&quot;&gt;customized BLAST&lt;/a&gt;, as well as &lt;a href=&quot;http://jbrowse.org&quot;&gt;JBrowse&lt;/a&gt; for our &lt;a href=&quot;/genome&quot;&gt;genome browser&lt;/a&gt;. We will update these tools on a regular basis whenever we see fit.&lt;/p&gt;

&lt;p&gt;We have some tools that are only available for internal access, marked with a “Closed Beta” message if you ever encounter them. If you want to gain access to them, please &lt;a href=&quot;/meta/contact&quot;&gt;contact us&lt;/a&gt;.&lt;/p&gt;

&lt;h3 id=&quot;help-us-make-lotus-base-better&quot;&gt;Help us make &lt;em&gt;Lotus&lt;/em&gt; Base better&lt;/h3&gt;
&lt;p&gt;If you have encountered any technical issues with using the site, feel free to &lt;a href=&quot;/issues&quot;&gt;let us know via our issue tracker&lt;/a&gt;. We have a small but dedicated team of developers working on the project.&lt;/p&gt;

&lt;h3 id=&quot;stay-updated&quot;&gt;Stay updated&lt;/h3&gt;
&lt;p&gt;&lt;a href=&quot;#mc-embedded-subscribe-form&quot;&gt;Subscribe to the &lt;em&gt;Lotus&lt;/em&gt; Base newsletter&lt;/a&gt; to stay updated with the most recent news.&lt;/p&gt;
</description>
        <pubDate>Wed, 27 Jan 2016 00:00:00 +0100</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2016/01/27/lotus-base-soft-launch.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2016/01/27/lotus-base-soft-launch.html</guid>
        
        <category>site</category>
        
        
        <category>announcement</category>
        
      </item>
    
      <item>
        <title>Change in v3.0 gene nomenclature</title>
        <description>&lt;p&gt;Due to the upcoming release of version 4 of the &lt;em&gt;Lotus&lt;/em&gt; genome and gene accession IDs, and that we are expecting coordinates to change drastically, we pre-empted a possible clash in the namespace of gene accessions. Therefore, we have implemented a change in version 3.0 gene accessions for &lt;em&gt;Lotus japonicus&lt;/em&gt; with immediate effect.&lt;/p&gt;

&lt;p&gt;For example, the old accessions ID for the gene “ATP synthase D chain-related protein” is &lt;code class=&quot;highlighter-rouge&quot;&gt;Lj1g2536050.1&lt;/code&gt;. With the updated nomenclature where the version number is appended after the chromosome name, the new accession ID for the same gene will be &lt;code&gt;Lj1g&lt;strong&gt;3v&lt;/strong&gt;2536050.1&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;A quick way to convert your existing gene IDs, should you want to search them against our databases, would be to append &lt;code class=&quot;highlighter-rouge&quot;&gt;3v&lt;/code&gt; after the &lt;code class=&quot;highlighter-rouge&quot;&gt;Lj[…]g[…]&lt;/code&gt; text in your gene accession ID so that it becomes &lt;code class=&quot;highlighter-rouge&quot;&gt;Lj[…]g3v[…]&lt;/code&gt;. The databases and site features affected by this update in nomenclature:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;&lt;a href=&quot;/lore1/search&quot;&gt;LORE1 search&lt;/a&gt;&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/blast/&quot;&gt;BLAST&lt;/a&gt; databases&lt;/li&gt;
  &lt;li&gt;Gene annotations (≥v3.0)&lt;/li&gt;
  &lt;li&gt;Genic and exonic insertions databases (≥v3.0)&lt;/li&gt;
  &lt;li&gt;&lt;a href=&quot;/expat/&quot;&gt;Expression Atlas (ExpAt)&lt;/a&gt; databases&lt;/li&gt;
&lt;/ul&gt;
</description>
        <pubDate>Thu, 13 Aug 2015 00:00:00 +0200</pubDate>
        <link>https://lotus.au.dk/blog/announcement/2015/08/13/change-in-v30-gene-nomenclature.html</link>
        <guid isPermaLink="true">https://lotus.au.dk/blog/announcement/2015/08/13/change-in-v30-gene-nomenclature.html</guid>
        
        <category>gene</category>
        
        <category>nomenclature</category>
        
        
        <category>announcement</category>
        
      </item>
    
  </channel>
</rss>
