PIE fragments report progress

by Brian | 21st October 2009

Fragments Report Null modifications Carboxymethyl (C) Acetyl (N-term) Carbamidomethyl (C) Oxidation (M) Phospho (ST) Or in bar form…. Null modifications Carboxymethyl (C) Acetyl (N-term) Carbamidomethyl (C) Oxidation (M) Phospho (ST) Read More

GFS Performance experiments

by Brian | 12th October 2009

The trick with problem sets so sparsely filled with noteworthy information is skipping as many processing steps as possible without marring the final results by skipping something important. I have made some modifications to GFS and am testing those now: If MSMSFit goes through all of the y-ions and finds less than 12% match then [...] Read More

Tandem Mass data hash

by Brian | 9th October 2009

“Tandem mass data hash,” try to say that five times fast. To get a better feel for how MS/MS data will be represented as a hash I wrote a quick visualizer. How the MSMS data is hashed: an NxM bin matrix is computed where N is the number of bins to store mass and M [...] Read More

XCode not recognizing break points

by Brian | 9th October 2009

This was confusing the hell out of me. I’d have an fprintf call that would say something like “loading spectra…” and a breakpoint before that and in the console it would still print “loading spectra…” and continue on its merry way like the breakpoint didn’t even exist. So what’s the real problem? I don’t know [...] Read More

How GFS handles MS/MS files

by Brian | 8th October 2009

I’m curious if a job on one DTA/PKL file with N spectra will run faster than a job that processes the exact same data broken into N different files. I just wrote a quick java app to take OutputFile2.dta which has 112 spectra and split it into 112 different dta files. At the moment I [...] Read More

A guy walks into a barbershop…

by Brian | 30th September 2009

…and sits down in the barber’s chair awaiting his hair cut. “That’ll be eight dollars, please,” says the barber. “What about my hair cut?” “I gave you one. It’s a null hair cut.” haha. Read More

cleanSpectra with local maxima

by Brian | 24th September 2009

Some example intensity data of MS/MS data. My thought is that perhaps selecting local maxima may be a good way to cull the peaks in our PKL and DTA files. The basic algorithm: Select a peak if its intensity is greater than both the previous peak and the next peak. Peaks that would be selected [...] Read More

Morgan’s summary of proteogenomic mapping

by Brian | 18th September 2009

Read More

Stuarts first graphic

by Brian | 15th September 2009

Of course he doesn’t read this blog, so he won’t know until this afternoon that I’ve already created the visualizer for his first set of data: (Larger View) Read More

Storing MySQL database in memory

by Brian | 10th September 2009

This assuages my performance fears of porting to a MySQL database. The MEMORY (HEAP) Storage Engine Read More

Fixed readDTA

by Brian | 8th September 2009

in Spectrum.m. It’s a minor retooling of readPKL. Working on combining the two…. Read More

Objective-C MySQL API

by Brian | 4th September 2009

The one everyone thinks is useful, the one on SourceForge, is out of date. Google Code has a more recent fork of the project HERE. Usage: Google on MCPConnection. In general I’m surprised at the low amount of MySQL support in OS X. Read More

Parallelizing GFS links

by Brian | 2nd September 2009

Grand Central Dispatch: Wikipedia Apple PDF Read More

proteomic database fields

by Brian | 1st September 2009

For a basic database which holds peptides from in silico digestion of a genome, I think these fields will be helpful: peptide – the chain sequence of amino acids location – This will probably be many fields which will track location information mass – (theoretical) size – the number of amino acids cleavage – number [...] Read More

Redundant mass list loading?

by Brian | 27th August 2009

It looks like finding mass list files goes on in runListsWithParams in ClientManager loadTandemData in FHMSMSCategory. tandemFilesContainedInDirectory in ClientManager At first glance it looks like the code in runListsWithParams in ClientManager is useless… this is also where a bug is popping up. Read More

More on cleanSpectra and cleaning MSMS data

by Brian | 26th August 2009

Jainab and I have discussed this before, but I want to get it down: Taking your peak list, sorting by intensity, then selecting the top X of them can be detrimental for this reason: Some locations where fragments are very likely to occur and with high intensities can act as intensity hogs. They will be [...] Read More

Git first CVS import attempt

by Brian | 18th August 2009

For my reference more than for yours: frenchbroad:gfs.git risk2$ git cvsimport -v -d :local:/Volumes/LabShare/cvsroot GFS-Vec-V2 Initialized empty Git repository in /Users/risk2/Documents/gfs.git/.git/ Running cvsps... cvs_direct initialized to CVSROOT /Volumes/LabShare/cvsroot cvs [rlog aborted]: -t/-f wrappers not supported by this version of CVS A legacy version of cvs with -t/-f wrapper support is available as: /usr/bin/ocvs. Read More

download git-cvsimport tool

by Brian | 17th August 2009

You probably don’t need to. If you have the Git tool installed it is probably one of the tools that comes with it. To check try this on the command line: git help -a | grep cvsimport I got that from this site on how to import CVS into Git. The official Git page on [...] Read More

readDTA

by Brian | 6th August 2009

need to work on getting this method (if that is what it is called) to work. Right now we are having to convert from DTA to PKL. NOTE: reference readPKL — it works with all newline characters. Read More

Processing a directory of PKL files

by Brian | 5th August 2009

So MSMSFit may be a good replacement for seqtagscore, but it doesn’t solve the redundant overhead processing that comes with comparing a directory of fasta files to a directory of pkl files. The way we are doing it now is like an individual job for each comparison; that means that if there are 100 pkl [...] Read More

Page 2 of 512345