<?xml version="1.0"?><?xml-stylesheet type="text/xsl" href="/rss.xsl"?><rss version="2.0"><channel><title>mbf Discussions Rss Feed</title><link>http://mbf.codeplex.com/Thread/List.aspx</link><description>mbf Discussions Rss Description</description><item><title>New Post: A utility like UCSC liftover </title><link>http://mbf.codeplex.com/discussions/274965</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;The LiftOver utility provided by UCSC converts genome annotations from one assembly to another. This requires a defined transform between the source and target assembly, or a route from source to target via intermediate assemblies &amp;ndash; for the UCSC utility, these transforms are supplied in the form of chain files.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;MBF does not provide equivalent functionality currently; while some of the groundwork to support annotations has been done, MBF as a whole does not yet contain a full-featured annotation model. This is one of the things we would like to see added to MBF by the community, and would appreciate any feedback on what the requirements would be, how it would be used by the community, and how to improve on the current LiftOver utility. We would be happy to provide assistance to any programmer interested in extending MBF in this way.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;An alternative would simply be to port LiftOver to .NET &amp;ndash; if done correctly, the resulting application could run on Windows, Linux and other platforms.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;Rick Benge &lt;/span&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;for the MBF team&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><author>Rickbe</author><pubDate>Fri, 07 Oct 2011 22:01:26 GMT</pubDate><guid isPermaLink="false">New Post: A utility like UCSC liftover  20111007100126P</guid></item><item><title>New Post: A utility like UCSC liftover </title><link>http://mbf.codeplex.com/discussions/274965</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;Just wondering if MBF supply a liftover utility like the one provided by UCSC&amp;nbsp;&lt;a href="http://genome.ucsc.edu/cgi-bin/hgLiftOver"&gt;http://genome.ucsc.edu/cgi-bin/hgLiftOver&lt;/a&gt;&amp;nbsp;&lt;/p&gt;
&lt;/div&gt;</description><author>mkhushi</author><pubDate>Thu, 06 Oct 2011 11:42:59 GMT</pubDate><guid isPermaLink="false">New Post: A utility like UCSC liftover  20111006114259A</guid></item><item><title>New Post: use of HPC pack 2008 to launch Microsoft Assembler like a cluster apliccation</title><link>http://mbf.codeplex.com/discussions/274141</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;&lt;span style="color: #000080;"&gt;Leonard and I have exchanged some emails and I wanted to recap for the community here.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;The Sequence Assembler application&amp;nbsp; is provided as a demonstration application only &amp;ndash; it shows what can be done with MBF when the library is integrated with a GUI application, in this case one built using Windows Presentation Foundation and Silverlight. The Sequence Assembler application&amp;nbsp;has not been tested for&amp;nbsp;lab use, and would be&amp;nbsp;non performant&amp;nbsp;with any significant dataset.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;None of our work has been designed to take advantge of a cluster at this point. In other words it will work in that environ but it is not architected specifically to take advantage of clustered hardware.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;In our exchanges using his dataset he had used a Kmer value &amp;gt;32 as a parameter to padeNA and got the message&amp;nbsp;&amp;lsquo;values of K over 32 are not supported&amp;rsquo;. Further details are available in the Padena paper, but kmer length is a parameter for the Padena assembly, not the length of the input sequences from the FASTA file. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;Any dataset&amp;nbsp;that contains many &amp;lsquo;N&amp;rsquo; ambiguity characters,&amp;nbsp;any characters other than ACGT, are not compatible with the current version of the Padena algorithm. There are two approaches to dealing with this:&lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;span style="color: #1f497d;"&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;1. Filter the reads, excluding those that contain N characters. Use the smaller remaining dataset in the assembly. This will work, but will lose a lot of data that could be valuable.&amp;nbsp;&lt;/span&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p style="padding-left: 30px;"&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;2. Split reads containing N characters, so the string AAAAAAAAAANGGGGGGGGGG would become two reads, AAAAAAAAAA and GGGGGGGGGG, etc. This will preserve more of the data and would be a better solution&lt;/span&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;The filtered reads can be assembled using PadenaUtil commandline utility, and should only take a minute or so to complete with the dataset he provided.&amp;nbsp; I would recommend setting the parameter k to 32 initially, but for the best results you should try varying all the parameters; read the Padena paper to learn more.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;I hope this information is helpful to other members of the community as well.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><author>Rickbe</author><pubDate>Wed, 05 Oct 2011 21:18:48 GMT</pubDate><guid isPermaLink="false">New Post: use of HPC pack 2008 to launch Microsoft Assembler like a cluster apliccation 20111005091848P</guid></item><item><title>New Post: use of HPC pack 2008 to launch Microsoft Assembler like a cluster apliccation</title><link>http://mbf.codeplex.com/discussions/274141</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;Leonard, thanks for your post. Just to clarify you are trying to run the Sequence assembler GUI based sample app on a cluster. I guess the first thing to examine is your file location spelling. Presumably you have done that but just to make sure here.&lt;/p&gt;
&lt;p&gt;If you want to flesh out more of what you are trying to accomplish here perhaps we can assist you more. I was just curious as to why for instance you wouldn't be using PadeNA assembler in this case.&lt;/p&gt;
&lt;p&gt;As a side note - we are planning our V2 release shortly (although in this case that should not make a difference for this report)&lt;/p&gt;
&lt;p&gt;If you care to share your location and MBF plans please either post here or you can reply to myself privately at &lt;a href="mailto:a-rickbe@microsoft.com"&gt;a-rickbe@microsoft.com&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Thanks&lt;/p&gt;&lt;/div&gt;</description><author>Rickbe</author><pubDate>Wed, 28 Sep 2011 18:05:42 GMT</pubDate><guid isPermaLink="false">New Post: use of HPC pack 2008 to launch Microsoft Assembler like a cluster apliccation 20110928060542P</guid></item><item><title>New Post: use of HPC pack 2008 to launch Microsoft Assembler like a cluster apliccation</title><link>http://mbf.codeplex.com/discussions/274141</link><description>&lt;div style="line-height: normal;"&gt;hey everybody!!!! i'm using a cluster with microsoft technologys with 3 nodes, using windows server HPC 2008, with HPC pack 2008 , i need to run microsoft assembler, but its imposible, because i have the next error: Unable to open standard input file on
 node NODO1:Exception 'Failed to open standard input file 'D:\aplicaciones\Microsoft Biology Initiative\2.0\MBT\Sequence Assembler', Access is denied' reported creating the task. after i had this: Aborting: failed to launch 'Sequence Assembler.exe' on nodo1
 Error (2) The system cannot find the file specified. i think i'm lauching the job to the right form. thanks for the help,&lt;/div&gt;</description><author>leonardoBio</author><pubDate>Wed, 28 Sep 2011 17:06:18 GMT</pubDate><guid isPermaLink="false">New Post: use of HPC pack 2008 to launch Microsoft Assembler like a cluster apliccation 20110928050618P</guid></item><item><title>New Post: Attendance at Hackathon 2011</title><link>http://mbf.codeplex.com/discussions/268601</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;I was just curious if anyone in the community might be attending Hackathon 2011 -
&lt;a href="http://2011.biohackathon.org/home"&gt;http://2011.biohackathon.org/home&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;I would love to see an MBF entry but I haven't heard of anyone at this point and it starts in 11 days.&lt;/p&gt;
&lt;p&gt;If you do plan to enter or attend would love to correspond about your experience and especially if it is using MBF.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;span style="color:#1f497d"&gt;Rick Benge &lt;/span&gt;&lt;/strong&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color:#1f497d; font-size:9pt"&gt;Community Program Manager &lt;/span&gt;&lt;span style="color:#1f497d"&gt;&amp;ndash;
&lt;/span&gt;&lt;span style="color:#1f497d; font-size:9pt"&gt;&lt;a href="http://research.microsoft.com/en-us/projects/bio/"&gt;&lt;span style="color:blue"&gt;Microsoft Biology Foundation&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span style="color:red; font-size:9pt"&gt;&amp;nbsp;|
&lt;/span&gt;&lt;span style="color:#1f497d; font-size:9pt"&gt;Microsoft Research Connections&lt;/span&gt;&lt;span style="color:red; font-size:9pt"&gt;&amp;nbsp;|
&lt;/span&gt;&lt;span style="color:#1f497d; font-size:9pt"&gt;Office &amp;#43;1 425-538-4921&lt;/span&gt;&lt;span style="color:red; font-size:9pt"&gt;
&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;/div&gt;</description><author>Rickbe</author><pubDate>Wed, 10 Aug 2011 21:43:58 GMT</pubDate><guid isPermaLink="false">New Post: Attendance at Hackathon 2011 20110810094358P</guid></item><item><title>New Post: Another genome browser framework</title><link>http://mbf.codeplex.com/discussions/265751</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;Hello Wang,&lt;/p&gt;
&lt;p&gt;Thanks for your interest in MBF, I'm sorry I missed you at the poster. I would be very interested in learning more about your project, and if there is any way in which MBF can assist you. As you know from the poster, we have a prototype genome browser built on top of MBF and which will be released in the coming months.&lt;/p&gt;
&lt;p&gt;Please fell free to email me with more details, and we can discuss,&lt;/p&gt;
&lt;p&gt;Many thanks,&lt;/p&gt;
&lt;p&gt;Simon Mercer&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;/div&gt;</description><author>sjmercer</author><pubDate>Thu, 21 Jul 2011 20:20:33 GMT</pubDate><guid isPermaLink="false">New Post: Another genome browser framework 20110721082033P</guid></item><item><title>New Post: Another genome browser framework</title><link>http://mbf.codeplex.com/discussions/265751</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;You attended the presentation done by Simon Mercer. His Microsoft email address is &lt;a href="mailto:simon.mercer@microsoft.com"&gt;simon.mercer@microsoft.com&lt;/a&gt;. Feel free to include myself (Rick Benge) in the correspondence. I am the community program manager for MBF. My email address is &lt;a href="mailto:a-rickbe@microsoft.com"&gt;a-rickbe@microsoft.com&lt;/a&gt;. There are several programs and additions that use MBF - you may have seen GenoZoom2 at his presentation. You can get additional information from this primary link &lt;a href="http://research.microsoft.com/en-US/projects/bio/mbf.aspx"&gt;http://research.microsoft.com/en-US/projects/bio/mbf.aspx&lt;/a&gt;&amp;nbsp;but feel free to ask questions on our discussion area. Thanks for your interest.&lt;/p&gt;
&lt;p&gt;Rick Benge&lt;/p&gt;
&lt;p&gt;Community Program Manager for MBF&lt;/p&gt;&lt;/div&gt;</description><author>Rickbe</author><pubDate>Tue, 19 Jul 2011 16:26:03 GMT</pubDate><guid isPermaLink="false">New Post: Another genome browser framework 20110719042603P</guid></item><item><title>New Post: Another genome browser framework</title><link>http://mbf.codeplex.com/discussions/265751</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;I just listened the MBF introduction in vienna ISMB 2011 conference, and it's really beautiful. And I want to talk to you in face during poster session, but I did not find somebody in front of your poster...&lt;/p&gt;
&lt;p&gt;Actually, we are now developing a genome browser framework for annotation visulization: http://www.abrowse.org/. Moreover, user can write on the genome instantly besides browsing. And built-in query system and BioMart are all supported for users to fetch
 bulk dataset.&lt;/p&gt;
&lt;p&gt;Hope to learn from you. Thank you very much.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Wang Jun&lt;/p&gt;
&lt;p&gt;Center for Bioinformatics&lt;/p&gt;
&lt;p&gt;Peking University&lt;/p&gt;
&lt;p&gt;P.R.China&lt;/p&gt;
&lt;/div&gt;</description><author>kathleenxppp</author><pubDate>Tue, 19 Jul 2011 16:00:18 GMT</pubDate><guid isPermaLink="false">New Post: Another genome browser framework 20110719040018P</guid></item><item><title>New Post: Venn Diagram based on Genomic Intervals issue</title><link>http://mbf.codeplex.com/discussions/265428</link><description>&lt;div style="line-height: normal;"&gt;Thank you for the reply.
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;I am trying to do the following.&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;I have two ChipSeq results. These consist of interval data, chromosome, start, end, peak location and peak height.&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;I want to see the number of peaks that overlap in these two datasets and the number that is unique to each set.
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;Exp A has 2000 peaks and Exp B has 5000 peaks. &lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;The Excel Addin for Venn Diagram is creating more than 9000 'regions' from these two data. This is incorrect for my purposes (because I only have 7000 total peaks).&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;I can send you some of my data as well as the analysis I have done so far.&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;Please tell me how to send the data.&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;Best,&lt;/div&gt;
&lt;div&gt;Vivek&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;div&gt;&lt;br&gt;
&lt;/div&gt;
&lt;/div&gt;</description><author>kumar35885</author><pubDate>Mon, 18 Jul 2011 16:55:08 GMT</pubDate><guid isPermaLink="false">New Post: Venn Diagram based on Genomic Intervals issue 20110718045508P</guid></item><item><title>New Post: Venn Diagram based on Genomic Intervals issue</title><link>http://mbf.codeplex.com/discussions/265428</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;Hi Vivek,&lt;/p&gt;
&lt;p&gt;It has been a while since I looked at the interval or&amp;nbsp;VennDiagram code, so I am not sure exactly what you are trying to do and the steps you are using to do it.&amp;nbsp; &lt;br /&gt;It sounds like you want to join&amp;nbsp;the two intervals A and B together to create the 'A and B' interval using the IntersectOutputType::OverlappingIntervals rather than the OverlappingPiecesOfIntervals modifier for the SequenceRangeGrouping in the CreateSequenceRangeGroupingForVennDiagram in VennToNodeXL.cs.&lt;/p&gt;
&lt;p&gt;If you are writing C# and using the library to produce the Venn diagram, you might be able to&amp;nbsp;clone a bit of the code into your source and set the IntersetOutputType for what you want to do.&amp;nbsp; If you are doing it in excel, the plug-in and/or the library will have to be modified.&lt;/p&gt;
&lt;p&gt;If you have a set of small sample files and the steps you are taking, we could arrange some discuss what you want to and how to do it.&lt;/p&gt;
&lt;p&gt;-bobd-&lt;/p&gt;&lt;/div&gt;</description><author>bobd00</author><pubDate>Mon, 18 Jul 2011 08:52:03 GMT</pubDate><guid isPermaLink="false">New Post: Venn Diagram based on Genomic Intervals issue 20110718085203A</guid></item><item><title>New Post: Venn Diagram based on Genomic Intervals issue</title><link>http://mbf.codeplex.com/discussions/265428</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;Hello Everyone,&lt;/p&gt;
&lt;p&gt;I have results from two ChipSeq experiments. &amp;nbsp;&lt;/p&gt;
&lt;p&gt;Experiment A has 1900 peaks and Experiment B has 4000 peaks.&lt;/p&gt;
&lt;p&gt;When I create Venn Diagram using the interval data, the total number of resulting regions numbers almost 10000. &amp;nbsp;This is much higher than the sum of the two groups.&lt;/p&gt;
&lt;p&gt;This occurs because certain interval from the two Experiments are being split. &amp;nbsp;&lt;/p&gt;
&lt;p&gt;So if interval 1 is 10-100 and interval 2 is 50-150, this results in 3 separate genomic regions &amp;nbsp;(0-50 (A only), 50-100 (A and B), and 100-150 (B only)).&lt;/p&gt;
&lt;p&gt;Is there a way to control this behavior. For instance, I do not want these intervals to be split into three, but treated as one. &amp;nbsp;So even if there is only one base pair overlap, it should be called A and B. &amp;nbsp;&lt;/p&gt;
&lt;p&gt;Otherwise, 5900 peaks end up in 10,000 genomic regions which is hard to explain to biologists.&lt;/p&gt;
&lt;p&gt;Any help is appreciated on this matter.&lt;/p&gt;
&lt;p&gt;Vivek&lt;/p&gt;
&lt;/div&gt;</description><author>kumar35885</author><pubDate>Sun, 17 Jul 2011 03:10:54 GMT</pubDate><guid isPermaLink="false">New Post: Venn Diagram based on Genomic Intervals issue 20110717031054A</guid></item><item><title>New Post: NUCmer parameters "--maxmatch" and "--nooptimize"</title><link>http://mbf.codeplex.com/discussions/262550</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;Sending for Bob, who is on vacation:&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;Hi Robert,&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;You should be able to use the -maxmatch flag to nucmerutil.exe.&amp;nbsp; (I think the short form is -x)&amp;nbsp; You can look at the code in nucmerutil if you want to see how it is used to get the library to do what you want.&amp;nbsp; I found the use of a default parameter and setting it to true can be a bit confusing, so be careful there.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;I don&amp;rsquo;t know the situation on &amp;ndash;noopt off the top of my head, and I am on vacation right now, but I&amp;rsquo;ll see what I can dig up when I get back if it has not been addressed before then.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;-bobd-&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;/div&gt;</description><author>sjmercer</author><pubDate>Fri, 24 Jun 2011 16:47:33 GMT</pubDate><guid isPermaLink="false">New Post: NUCmer parameters "--maxmatch" and "--nooptimize" 20110624044733P</guid></item><item><title>New Post: NUCmer parameters "--maxmatch" and "--nooptimize"</title><link>http://mbf.codeplex.com/discussions/262550</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;Hi,&lt;/p&gt;
&lt;p&gt;The NUCmer aligner in MBF uses different names for some of the MUMmer/NUCmer parameters.&amp;nbsp; Why&amp;nbsp;didn't you use the same parameter names in MBF as in NUCmer/MUMmer?&lt;/p&gt;
&lt;p&gt;In addition, some of these parameters don't seem to be exposed as public properties at all, or at least not yet.&lt;/p&gt;
&lt;p&gt;For example, at the link below there's an excellent set of instructions from the MUMmer developers for using NUCmer to align short DNA sequence reads to a genome:&lt;/p&gt;
&lt;p&gt;How to use MUMmer to align short reads to the human genome: &lt;a href="http://www.cbcb.umd.edu/research/mummer_reads.shtml"&gt;
http://www.cbcb.umd.edu/research/mummer_reads.shtml&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Unfortunately, you can't set the following two parameters in MBF's NUCmer, which as the instructions in the link mentioned above indicate, are important for aligning short DNA sequence reads to a genome:&lt;/p&gt;
&lt;p&gt;--maxmatch&lt;br&gt;
&amp;nbsp;Necessary; otherwise legitimate hits may be missed&lt;br&gt;
--nooptimize&lt;br&gt;
&amp;nbsp;This encourages nucmer to extend the alignment all the way to the end of the read; without this, nucmer may fail to include ends where a substitution occurs close to an end&lt;/p&gt;
&lt;p&gt;Looking at the MBF source code, it seems straightforward to hard code the &amp;quot;--maxmatch&amp;quot; parameter, i.e., in your own custom version of &amp;quot;Bio.dll&amp;quot;.&amp;nbsp; In the &amp;quot;NUCmer.cs&amp;quot; source code file you just need to change:&lt;/p&gt;
&lt;p&gt;isUniqueInReference = true&lt;/p&gt;
&lt;p&gt;to&lt;/p&gt;
&lt;p&gt;isUniqueInReference = false&lt;/p&gt;
&lt;p&gt;at two points in the source code (see below)&lt;/p&gt;
&lt;p&gt;***********************************************************************&lt;/p&gt;
&lt;p&gt;NUCmer.cs (excerpts from the original, in build 78593)&lt;/p&gt;
&lt;p&gt;ln 211-215:&lt;/p&gt;
&lt;p&gt;/// &amp;lt;param name=&amp;quot;isUniqueInReference&amp;quot;&amp;gt;flag to indicate that the matches should be unique in reference.&amp;lt;/param&amp;gt;&lt;br&gt;
/// &amp;lt;returns&amp;gt;Returns clusters.&amp;lt;/returns&amp;gt;&lt;br&gt;
public IList&amp;lt;Cluster&amp;gt; GetClusters(&lt;br&gt;
&amp;nbsp;ISequence querySequence,&lt;br&gt;
&amp;nbsp;bool isUniqueInReference = true)&lt;/p&gt;
&lt;p&gt;ln 277-281:&lt;/p&gt;
&lt;p&gt;/// &amp;lt;param name=&amp;quot;isUniqueInReference&amp;quot;&amp;gt;Whether MUMs are unique in query or not.&amp;lt;/param&amp;gt;&lt;br&gt;
/// &amp;lt;returns&amp;gt;List of enumerable of delta alignments.&amp;lt;/returns&amp;gt;&lt;br&gt;
public IEnumerable&amp;lt;DeltaAlignment&amp;gt; GetDeltaAlignments(&lt;br&gt;
&amp;nbsp;ISequence querySequence,&lt;br&gt;
&amp;nbsp;bool isUniqueInReference = true)&lt;/p&gt;
&lt;p&gt;***********************************************************************&lt;/p&gt;
&lt;p&gt;However, I'm not exactly sure how to set the &amp;quot;--nooptimize&amp;quot; flag in the source code (I don't mind compiling my own derivative of Bio.dll with the &amp;quot;--maxmatch&amp;quot; and &amp;quot;--nooptimize&amp;quot; flags set).&lt;/p&gt;
&lt;p&gt;At the following NUCmer.cs lines:&lt;/p&gt;
&lt;p&gt;715&lt;br&gt;
781&lt;br&gt;
790&lt;br&gt;
803&lt;br&gt;
898&lt;br&gt;
915&lt;/p&gt;
&lt;p&gt;There is a statement (where the tooltip message is shown as a trailing comment)&lt;/p&gt;
&lt;p&gt;methodName |= ModifiedSmithWaterman.OptimalFlag; // Maximise the alignment score&lt;/p&gt;
&lt;p&gt;My guess is that to set the &amp;quot;--nooptimize&amp;quot; flag, at one or more of these lines the value used should be as follows:&lt;/p&gt;
&lt;p&gt;methodName |= ModifiedSmithWaterman.SeqendFlag;&amp;nbsp; // Align till end of shortest sequence&lt;/p&gt;
&lt;p&gt;But I'm not sure which lines I should change.&lt;/p&gt;
&lt;p&gt;So, if a developer needs to set the &amp;quot;--nooptimize&amp;quot; flag in the source code and compile their own derivative of Bio.dll with the &amp;quot;--maxmatch&amp;quot; and &amp;quot;--nooptimize&amp;quot; flags set for NUCmer, could you please describe&amp;nbsp;exactly how that should be accomplished?&lt;/p&gt;
&lt;p&gt;Thanks in advance for your help!&lt;/p&gt;
&lt;p&gt;Robert&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;br&gt;
&amp;nbsp;&lt;/p&gt;
&lt;/div&gt;</description><author>rboissy</author><pubDate>Thu, 23 Jun 2011 18:58:14 GMT</pubDate><guid isPermaLink="false">New Post: NUCmer parameters "--maxmatch" and "--nooptimize" 20110623065814P</guid></item><item><title>New Post: Setting Similarity Matrix in MBF</title><link>http://mbf.codeplex.com/discussions/258705</link><description>&lt;div style="line-height: normal;"&gt;&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;Hi &lt;a class="UserProfileLink" href="http://www.codeplex.com/site/users/view/kponcodeplex2011"&gt;kponcodeplex2011&lt;/a&gt;&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;One of our Developers gave me the below infromation: &lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;We have 11 well defined Similarity Matrices in Bio.dll. These matrices are constructed from the resource files (.txt). You can find the .txt files under &amp;hellip;.Bio\Source\Framework\Bio\SimilarityMatrices\Resources\SimilarityMatrices. We have generic class &lt;/span&gt;&lt;span style="color: black;"&gt;SimilarityMatrix &lt;/span&gt;&lt;span style="color: #1f497d;"&gt;which does the construction based our &amp;nbsp;&lt;/span&gt;&lt;span style="color: #2b91af;"&gt;StandardSimilarityMatrix &lt;/span&gt;&lt;span style="color: #1f497d;"&gt;enum.&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;Example:&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: black;"&gt;sm&amp;nbsp;=&amp;nbsp;&lt;/span&gt;&lt;span style="color: blue;"&gt;new&lt;/span&gt;&lt;span style="color: black;"&gt;&amp;nbsp;&lt;/span&gt;&lt;span style="color: #2b91af;"&gt;SimilarityMatrix&lt;/span&gt;&lt;span style="color: black;"&gt;(&lt;/span&gt;&lt;span style="color: #2b91af;"&gt;SimilarityMatrix&lt;/span&gt;&lt;span style="color: black;"&gt;.&lt;/span&gt;&lt;span style="color: #2b91af;"&gt;StandardSimilarityMatrix&lt;/span&gt;&lt;span style="color: black;"&gt;.Blosum50);&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;This will get you Blosum50 matrix.&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #1f497d;"&gt;We also support custom matrix by passing your file. Currently, we support ('\t',&amp;nbsp;'&amp;nbsp;',&amp;nbsp;',') delimited matrix files.&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: #2b91af;"&gt;SimilarityMatrix&lt;/span&gt;&lt;span style="color: black;"&gt;&amp;nbsp;sm&amp;nbsp;=&amp;nbsp;&lt;/span&gt;&lt;span style="color: blue;"&gt;new&lt;/span&gt;&lt;span style="color: black;"&gt;&amp;nbsp;&lt;/span&gt;&lt;span style="color: #2b91af;"&gt;SimilarityMatrix&lt;/span&gt;&lt;span style="color: black;"&gt;(blosumFilePath);&lt;/span&gt;&lt;/pre&gt;
&lt;pre&gt;&lt;span style="color: black;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/pre&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 10pt;"&gt;Please let me know if you need any more clarification.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 10pt;"&gt;Fadi Fakhouri on behalf of the MBF Team&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 10pt;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;/div&gt;</description><author>FadiF</author><pubDate>Tue, 24 May 2011 21:55:32 GMT</pubDate><guid isPermaLink="false">New Post: Setting Similarity Matrix in MBF 20110524095532P</guid></item><item><title>New Post: Setting Similarity Matrix in MBF</title><link>http://mbf.codeplex.com/discussions/258705</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;Greetings,&lt;/p&gt;
&lt;p&gt;I am using Bio library (MBF 2.0) for sequence alignment, and I have a general Question.&amp;nbsp; While we can select various options for an aligner, such as Gap Open Cost, etc, including similarity matrix, I cannot tell exactly where the matrix data comes from.&amp;nbsp;
 For example, if I choose Blosum50 as my similarity matrix, where is this matrix actually constructed&amp;nbsp;that the algorithm will look for by default.&amp;nbsp; Is each and every algorithm creates a matrix based on the options specified on the fly?&lt;/p&gt;
&lt;p&gt;If you can show me exactly where in Bio.dll this is constructed or selected, I would greatly appreciate it.&amp;nbsp; I know you set similarity matrix that an aligner will use, but where is this matrix?&lt;/p&gt;
&lt;p&gt;Thank you, KPOnCodeplex2011&lt;/p&gt;
&lt;/div&gt;</description><author>kponcodeplex2011</author><pubDate>Mon, 23 May 2011 23:32:07 GMT</pubDate><guid isPermaLink="false">New Post: Setting Similarity Matrix in MBF 20110523113207P</guid></item><item><title>New Post: Where and how do you set match and mismatch score before doing alignment</title><link>http://mbf.codeplex.com/discussions/256408</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;Hi Chris, and thanks for your post.&lt;/p&gt;
&lt;p&gt;One of our developers have suggested the following, hope it helps:&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;Match and Mismatch scores are properties of &amp;nbsp;similarity matrices and not aligners. It is used only with DiagonalSimilarityMatrix. Other similarity matrices have their own standard and well defined scores(which are called as BLOSUM50 etc&amp;hellip;)&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;&amp;nbsp;&lt;/span&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;So if one has to use custom scores, he can always go ahead and use a DSM which has a constructor which takes Match and Mismatch scores and assign this SM to any aligner.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: #1f497d; font-size: 11pt;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="color: blue; font-size: 9.5pt;"&gt;public&lt;/span&gt;&lt;span style="font-size: 9.5pt;"&gt; DiagonalSimilarityMatrix(&lt;span style="color: blue;"&gt;int&lt;/span&gt; matchValue, &lt;span style="color: blue;"&gt;int&lt;/span&gt; mismatchValue)&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: 9.5pt;"&gt;Please let us know if this resolves your question. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: 9.5pt;"&gt;Thanks&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: 9.5pt;"&gt;The MBF Team&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;</description><author>FadiF</author><pubDate>Thu, 05 May 2011 15:17:09 GMT</pubDate><guid isPermaLink="false">New Post: Where and how do you set match and mismatch score before doing alignment 20110505031709P</guid></item><item><title>New Post: Where and how do you set match and mismatch score before doing alignment</title><link>http://mbf.codeplex.com/discussions/256408</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;I know we can set GapOpenCost or GapExtensionCost etc. to aligner, for example:&lt;/p&gt;
&lt;p&gt;Bio.Algorithms.MUMmer.MUMmerAligner myMummer = new Bio.Algorithms.MUMmer.MUMmerAligner ();&lt;br&gt;
SimilarityMatrix mySimMatrix =&amp;nbsp; new SimilarityMatrix(SimilarityMatrix.StandardSimilarityMatrix.Blosum50);&lt;br&gt;
myMummer .GapOpenCost = -1;&lt;br&gt;
myMummer.SimilarityMatrix = mySimMatrix;&lt;/p&gt;
&lt;p&gt;//now perform the alignment&lt;/p&gt;
&lt;p&gt;Where and how do you actually set MatchScore and MisMatchScore before doing alignment?&amp;nbsp; A code snippet would be quite useful.&amp;nbsp; Thank you&lt;br&gt;
Chris&lt;/p&gt;
&lt;/div&gt;</description><author>kponcodeplex2011</author><pubDate>Wed, 04 May 2011 23:41:24 GMT</pubDate><guid isPermaLink="false">New Post: Where and how do you set match and mismatch score before doing alignment 20110504114124P</guid></item><item><title>New Post: MBF installer - wrong .dll reference</title><link>http://mbf.codeplex.com/discussions/256072</link><description>&lt;div style="line-height: normal;"&gt;
&lt;p&gt;The V2 beta 1 MBF installer will install .CSproj files to your machine, however the references are set to the wrong .dll's. If you open the project from the beta you will be missing references.&lt;/p&gt;
&lt;p&gt;We are tracking this under TFS item 28559 - you can manually correct the binaries or remove the .csproj file as noted in the work item.&lt;/p&gt;
&lt;p&gt;I am posting because I thought this visibility would help prevent any initial frustrations in getting this working.&lt;/p&gt;
&lt;p&gt;Rick Benge&lt;/p&gt;
&lt;p&gt;Comunity Program Manager MBF&lt;/p&gt;
&lt;/div&gt;</description><author>Rickbe</author><pubDate>Mon, 02 May 2011 23:56:36 GMT</pubDate><guid isPermaLink="false">New Post: MBF installer - wrong .dll reference 20110502115636P</guid></item><item><title>New Post: Building MBF v2.0 Preview</title><link>http://mbf.codeplex.com/discussions/243359</link><description>&lt;div style="line-height: normal;"&gt;&lt;p&gt;You can unblock the zip file before extracting and then all extracted files become unblocked as well. Also extracting with WinRar should produce unblocked files since WinRar doesn't care about this attribute on the archive.&lt;/p&gt;&lt;/div&gt;</description><author>bitdisaster</author><pubDate>Sat, 30 Apr 2011 21:30:00 GMT</pubDate><guid isPermaLink="false">New Post: Building MBF v2.0 Preview 20110430093000P</guid></item></channel></rss>