Larsblog - technology

Bitcoin: how it works

Imagine an alternative digital currency based on cryptographic principles, designed and launched by persons unknown, running as an uncontrolled and uncontrollable peer-to-peer network. The network has no central authority, and no single point of failure. It is, simply, made up of a self-regulated network of computers connected to each other, speaking the Bitcoin protocol between one another, and nothing more. ...

Read | 2013-06-12 17:03 | 0 comment(s)

RDF triple stores — an overview

There's a huge range of triple stores out there, and it's not trivial to find the one most suited for your exact needs. I reviewed all those I could find earlier this year for a project, and here is the result. I've evaluated the stores against the requirements that mattered for that particular project. I haven't summarized the scores, as everyone's weights for these requirements will be different. ...

Read | 2012-09-17 19:56 | 41 comment(s)

Experiments in genetic programming

I made an engine called Duke that can automatically match records to see if they represent the same thing. For more background, see a previous post about it. The biggest problem people seem to have with using it is coming up with a sensible configuration. I stumbled across a paper that described using so-called genetic programming to configure a record linkage engine, and decided to basically steal the idea. ...

Read | 2012-03-18 10:06 | 4 comment(s)

Bayesian identity resolution

Stian Danenbarger has been telling me for a while about entity resolution (as he and many others call it), or identity resolution (as Wikipedia calls it). Basically, it's the process of working out which records/entities/objects actually represent the same real-world things by comparing their properties. Once Stian confirmed that Bayesian inferencing was a common method for this, I suddenly saw how you can actually do a poor man's version of this with just a little basic scripting. ...

Read | 2011-02-11 13:23 | 18 comment(s)

What's up?

While RSS and Atom are a great way to stay up to date on what is published around the web, I think the feed-centric approach taken by most feed readers is suboptimal. For some feeds I want to read everything that is posted, but for others I want to read only those few posts which are about subjects I care about, or by authors I like particularly. Another problem is that some feeds (for example those of newspapers) have hundreds of posts every day. Staying on top of that is just too much manual effort. ...

Read | 2011-02-03 19:50 | 11 comment(s)

The applications of SDshare

Graham Moore a few years ago came up with the idea of publishing changes to topic maps using Atom, and a CEN project has now developed and published a specification for it called SDshare. Work is also underway to make SDshare a full ISO standard. ...

Read | 2010-11-21 14:29 | 0 comment(s)

My report on OOXML and ODF

Disclaimer: Work on this in the Norwegian government has been going on for years. I worked on this for four months, producing a 45-page report. This blog posting oversimplifies most of the way through in the interests of brevity. ...

Read | 2010-05-09 20:47 | 13 comment(s)

A path language for Topic Maps

I sketched a little path-based query language for Topic Maps this summer, mostly to explore what such a language might look like. My TMQL co-editor, Rani Pinchuk, asked me to write up a more detailed description of it, and that's what this blog posting is. ...

Read | 2009-09-23 11:01 | 13 comment(s)

Datatype validation with TMCL

It's long been generally assumed that TMCL (the Topic Maps Constraint Language) should be able to validate datatyped values, but very little thought has so far been devoted to exactly how. It may look like a trivial issue, but in fact datatypes is an enormous tangle of complex problems. To pick one example at random, consider the ordering of time durations in XML Schema. This posting is an attempt to consider what TMCL should and, equally important, should not do. ...

Read | 2009-07-20 14:37 | 3 comment(s)

A Topic Maps file system

The idea of a Topic Maps file system is not new. Robert Barta presented one such at TMRA 2008, and Inge Henriksen is also working on one. However, I had my own take on this that I wanted to realize for several years. The starting point was the Mac screensaver which shows all photos from a given directory as a kind of slide show. I've set it to the root folder I store my photos in, but then it shows all photos, which is not always that pleasant when you're on a projector in a meeting, for example. ...

Read | 2009-06-03 16:25 | 5 comment(s)

<< Previous

Last comments
RSS

Michael@programming on 7 tips on writing cl...

Nasreddine on RDF triple stores &m...

Vilhelm Erichsen on 10 tips on presentin...

Lars Marius on RDF triple stores &m...

Fahad M Ali on What is an informati...

Jerven Bolleman on RDF triple stores &m...

Sergei Sheinin on RDF triple stores &m...

Lars Marius on 7 tips on writing cl...

Grant on 7 tips on writing cl...

mahrukh on 10 tips on presentin...