Posts Tagged ‘Research life’

Last two weeks in the blogs…

Monday, June 8th, 2009

Ok, with the RSI I didn’t collect a list for last week, so here’s a list spanning the last two weeks instead…

Fun

Genetics

Geology

Programming

Public health

Research life

159-162=-3

Lots of links about commenting…

Thursday, May 28th, 2009

A Blog Around the Clock has a list of links to blog posts about commenting on (scientific) papers.

There have been quite a few posts over the last few days about commenting, in particular about posting comments, notes and ratings on scientific papers. But this also related to commenting on blogs and social networks, commenting on newspaper online articles, the question of moderation vs. non-moderation, and the question of anonymity vs. pseudonymity vs. RL identity.

Read the post to get all the links.

I must admit that I have never left a comment on an online paper.  If I blog about a paper, I leave a traceback, but that is as far as it goes.

Since putting a review of a paper on my blog, just to add a comment, is a lot of work, I guess I should just get used to leaving comments instead.

Still, I am reluctant to comment on papers.  I don’t mind firing off a half thought through comment off at a blog, but I feel that for a scientific paper I should make sure I understand all the details of the paper before I start commenting on it.  I guess I just have to overcome that feeling.

147-158=-11

Updating BiRC’s web pages

Wednesday, May 20th, 2009

At AU they are trying to move all the various departments’ web pages to a common TYPO3 CMS with a uniform look’n'feel.  See examples below:

We resisted a bit at BiRC, mainly because of the price the IT department at the Dept. of Computer Science wanted for hosting it.  We were quite happy with our own Skeletonz CMS and were hosting it for free.

Anyway, the Faculty of Science moved the web group from computer science to work directly under the faculty, and with that move made hosting free for all groups under the faculty, so we decided to try it out.

Yesterday I moved some of our existing pages to the new CMS.  I find TYPO3 a lot harder to work with than Skeletonz, but I’m sure I can get the hang of it eventually.  One thing I really like about it is that it is easy to duplicate web elements between pages so you only need to edit any particular element once and it will be updated on all the pages refering to it.  I use that extensively on the page describing our various projects.

Another selling point is that the CMS will soon be integrated with PU:RE, our publication management system.  We have to report all our publications – and various other activities – to that system, so being able to extract information from that database will reduce the work in maintaining the web pages significantly.  The plugin for that integration should be released shortly, and after that, I think we are ready for the move.

139-149=-10

Now how exactly was it I did that?

Thursday, May 14th, 2009

RRResearch has some thoughts about keeping records of computer work:

When I do benchwork I consistently keep pretty good notes.  I write down everything I do as I do it, on numbered and dated sheets of paper that go into looseleaf binders, organized by experiment.

But I don’t seem to be able to apply these good record-keeping habits when I’m working with computers.  Instead everything I do feels ‘exploratory’, as if everything I do is just a preliminary check to see what effect a modification will have, before I do something worth writing down.

I recognise this all too well.

It is not so much a problem when I do some exploratory data analysis.  I will have my R log to see what I actually did, and if I find an interesting pattern I know what I found and I don’t really need the history of how I got there so much.

When writing programs I don’t have the problem either.  There I have source control and bug trackers to help me.

My problem is with scripts.

I write a small script to format my data into something I can analyse.  Run a program or two on the data. Write another script to re-format the data.  A small script to pull out relevant data.  Look at that.  Then I need to just check a few things, and that is easy as another little script.

Very soon I have ten to twenty small scripts of five to ten lines each. None of them are really worth putting in version control or cleaning up or anything, ’cause it was all just exploratory anyway, but if I come back to the data a few weeks later, I have no way of reproducing what I did.

It is really horrible.

Ideally, once I know what I want to do with the data, I should clean up the pipeline, put it under version control and document it, but by then I am already done with the data analysis so I rarely bother.

Until I have to do it all again a few weeks or month later on some new data.

At that point I should really clean up the pipeline, but most likely I need to do something slightly different.  Not drastically different, but a few of the steps should be modified anyway, and depending on the results I need a few more scripts and it just spirals out of control.

I don’t really know how to solve this, I only know that what I am doing is quite sub-optimal.

134-146=-12

Last week in the blogs

Monday, May 11th, 2009

Another week, another list…

Computing/programming

Flu

Genetics

Genomics

Paleontology

Research life

Statistics

131-144=-13