A B C D E F G H I J K L M N O P Q R S T U V W X Y

A

AbstractMatchListener - Class in no.priv.garshol.duke.matchers
Convenience implementation with dummy methods, since most implementations will only implement matches().
AbstractMatchListener() - Constructor for class no.priv.garshol.duke.matchers.AbstractMatchListener
 
AbstractRuleBasedCleaner - Class in no.priv.garshol.duke.cleaners
Helper class for building regular-expression based cleaners.
AbstractRuleBasedCleaner() - Constructor for class no.priv.garshol.duke.cleaners.AbstractRuleBasedCleaner
Initializes an empty cleaner.
add(long) - Method in class no.priv.garshol.duke.Bucket
 
add(String, String) - Method in class no.priv.garshol.duke.cleaners.AbstractRuleBasedCleaner
Adds a rule replacing all substrings matching the regular expression with the replacement string.
add(String, String, int) - Method in class no.priv.garshol.duke.cleaners.AbstractRuleBasedCleaner
Adds a rule replacing all substrings matching the specified group within the regular expression with the replacement string.
add(Record) - Method in class no.priv.garshol.duke.datasources.InMemoryDataSource
Adds a record to the collection held by the source.
addBooleanOption(String, char) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
addColumn(Column) - Method in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
addDataSource(int, DataSource) - Method in class no.priv.garshol.duke.ConfigurationImpl
Adds a data source to the configuration.
addLink(String, String) - Method in interface no.priv.garshol.duke.EquivalenceClassDatabase
Add a new link between two records.
addLink(String, String) - Method in class no.priv.garshol.duke.InMemoryClassDatabase
 
addLink(String, String) - Method in class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
addMatchListener(MatchListener) - Method in class no.priv.garshol.duke.Processor
Adds a listener to be notified of processing events.
addRow(String[]) - Method in class no.priv.garshol.duke.utils.SparqlResult
 
addStringOption(String, char) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
addUnescaped(String) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
INTERNAL: Add given text unmodified and unescaped to the output.
addValue(String, String) - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
addValue(Column, String) - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
addValue(String, String) - Method in class no.priv.garshol.duke.RecordImpl
 
addVariable(String) - Method in class no.priv.garshol.duke.utils.SparqlResult
 
Aspect - Class in no.priv.garshol.duke.genetic
Represents an aspect of a Configuration that might be changed by the genetic algorithm.
Aspect() - Constructor for class no.priv.garshol.duke.genetic.Aspect
 
assertLink(Link) - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
assertLink(Link) - Method in interface no.priv.garshol.duke.LinkDatabase
Assert a link.
assertLink(Link) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
atStart() - Method in class no.priv.garshol.duke.comparators.Matcher
 

B

BasicTimer - Class in no.priv.garshol.duke.server
A basic timer implementation that will work in any context, but which does not (unfortunately) provide managed threads in more advanced servlet containers.
BasicTimer() - Constructor for class no.priv.garshol.duke.server.BasicTimer
 
basicTokens(String, int) - Static method in class no.priv.garshol.duke.comparators.QGramComparator
Produces basic q-grams, so that 'gail' -> 'ga', 'ai', 'il'.
batchDone() - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
batchDone() - Method in class no.priv.garshol.duke.matchers.ClassDatabaseMatchListener
 
batchDone() - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
batchDone() - Method in interface no.priv.garshol.duke.matchers.MatchListener
Notification that Duke has finished processing a batch of records.
batchDone() - Method in class no.priv.garshol.duke.Processor.Profiler
 
batchDone() - Method in class no.priv.garshol.duke.server.DukeController
 
batchProcessed() - Method in class no.priv.garshol.duke.RecordIterator
Informs the iterator that the latest batch of records retrieved from the iterator has been processed.
batchReady(int) - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
batchReady(int) - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
batchReady(int) - Method in interface no.priv.garshol.duke.matchers.MatchListener
Notification that Duke is about to process a new batch of records.
batchReady(int) - Method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
batchReady(int) - Method in class no.priv.garshol.duke.Processor.Profiler
 
Bucket - Class in no.priv.garshol.duke
 
Bucket() - Constructor for class no.priv.garshol.duke.Bucket
 

C

CapitalCleaner - Class in no.priv.garshol.duke.examples
 
CapitalCleaner() - Constructor for class no.priv.garshol.duke.examples.CapitalCleaner
 
ChainedCleaner - Class in no.priv.garshol.duke.cleaners
Internal cleaner used to implement chaining of multiple cleaners.
ChainedCleaner(Cleaner[]) - Constructor for class no.priv.garshol.duke.cleaners.ChainedCleaner
 
characters(char[], int, int) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
checkConsistency() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
ClassDatabaseMatchListener - Class in no.priv.garshol.duke.matchers
Writes recorded matches to an EquivalenceClassDatabase.
ClassDatabaseMatchListener(Configuration, EquivalenceClassDatabase) - Constructor for class no.priv.garshol.duke.matchers.ClassDatabaseMatchListener
 
classdb - Variable in class no.priv.garshol.duke.matchers.ClassDatabaseMatchListener
 
clean(String) - Method in interface no.priv.garshol.duke.Cleaner
Returns a cleaned value.
clean(String) - Method in class no.priv.garshol.duke.cleaners.AbstractRuleBasedCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.ChainedCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.DigitsOnlyCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.FamilyCommaGivenCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.GenericValueCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.LowerCaseNormalizeCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.MappingFileCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.NorwegianAddressCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.NorwegianCompanyNameCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.PersonNameCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.PhoneNumberCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.RegexpCleaner
 
clean(String) - Method in class no.priv.garshol.duke.cleaners.TrimCleaner
 
clean(String) - Method in class no.priv.garshol.duke.examples.CapitalCleaner
 
clean(String) - Method in class no.priv.garshol.duke.examples.CountryNameCleaner
 
Cleaner - Interface in no.priv.garshol.duke
A function which can turn a value into a normalized value suitable for comparison.
clear() - Method in class no.priv.garshol.duke.datasources.InMemoryDataSource
Removes all records held by the data source.
clear() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
clear() - Method in interface no.priv.garshol.duke.LinkDatabase
Removes all links from the database.
clear() - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
Empties the link database.
close() - Method in interface no.priv.garshol.duke.Database
Stores state to disk and closes all open resources.
close() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource.JDBCIterator
 
close() - Method in class no.priv.garshol.duke.InMemoryDatabase
Stores state to disk and closes all open resources.
close() - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
close() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
close() - Method in class no.priv.garshol.duke.KeyValueDatabase
Stores state to disk and closes all open resources.
close() - Method in interface no.priv.garshol.duke.KeyValueStore
Stores state to disk and closes all open resources.
close() - Method in interface no.priv.garshol.duke.LinkDatabase
Shuts down the database, releasing resources.
close() - Method in class no.priv.garshol.duke.LuceneDatabase
Stores state to disk and closes all open resources.
close() - Method in class no.priv.garshol.duke.Processor
Commits all state to disk and frees up resources.
close() - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
close() - Method in class no.priv.garshol.duke.RecordIterator
Releases any resources held by this iterator, and cleans up any temporary storage.
close() - Method in class no.priv.garshol.duke.server.DukeController
Shuts down the controller, releasing all resources.
close() - Method in class no.priv.garshol.duke.utils.CSVReader
 
close(Statement) - Static method in class no.priv.garshol.duke.utils.JDBCUtils
Closes the JDBC statement and its associated connection.
Column - Class in no.priv.garshol.duke.datasources
 
Column(String, String, String, Cleaner) - Constructor for class no.priv.garshol.duke.datasources.Column
 
ColumnarDataSource - Class in no.priv.garshol.duke.datasources
Abstract class for sharing code that is common to column-based data sources.
ColumnarDataSource() - Constructor for class no.priv.garshol.duke.datasources.ColumnarDataSource
 
columns - Variable in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
CommandLineParser - Class in no.priv.garshol.duke.utils
 
CommandLineParser() - Constructor for class no.priv.garshol.duke.utils.CommandLineParser
 
CommandLineParser.BooleanOption - Class in no.priv.garshol.duke.utils
 
CommandLineParser.BooleanOption(String, char) - Constructor for class no.priv.garshol.duke.utils.CommandLineParser.BooleanOption
 
CommandLineParser.BooleanOption(String) - Constructor for class no.priv.garshol.duke.utils.CommandLineParser.BooleanOption
 
CommandLineParser.CommandLineParserException - Exception in no.priv.garshol.duke.utils
 
CommandLineParser.CommandLineParserException(String) - Constructor for exception no.priv.garshol.duke.utils.CommandLineParser.CommandLineParserException
 
CommandLineParser.Option - Class in no.priv.garshol.duke.utils
 
CommandLineParser.Option(String) - Constructor for class no.priv.garshol.duke.utils.CommandLineParser.Option
 
CommandLineParser.Option(String, char) - Constructor for class no.priv.garshol.duke.utils.CommandLineParser.Option
 
CommandLineParser.StringOption - Class in no.priv.garshol.duke.utils
 
CommandLineParser.StringOption(String, char) - Constructor for class no.priv.garshol.duke.utils.CommandLineParser.StringOption
 
commit() - Method in interface no.priv.garshol.duke.Database
Flushes all changes to disk.
commit() - Method in interface no.priv.garshol.duke.EquivalenceClassDatabase
Commit changes made to persistent store.
commit() - Method in class no.priv.garshol.duke.InMemoryClassDatabase
 
commit() - Method in class no.priv.garshol.duke.InMemoryDatabase
Flushes all changes to disk.
commit() - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
commit() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
commit() - Method in class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
commit() - Method in class no.priv.garshol.duke.KeyValueDatabase
Flushes all changes to disk.
commit() - Method in interface no.priv.garshol.duke.KeyValueStore
Flushes all changes to disk.
commit() - Method in interface no.priv.garshol.duke.LinkDatabase
Commit asserted links to persistent store.
commit() - Method in class no.priv.garshol.duke.LuceneDatabase
Flushes all changes to disk.
commit() - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
CommonJTimer - Class in no.priv.garshol.duke.server
Timer implementation which uses the JSR-236 API, in order to provide managed threads within servlet containers that support them.
CommonJTimer() - Constructor for class no.priv.garshol.duke.server.CommonJTimer
 
compactDistance(String, String) - Static method in class no.priv.garshol.duke.comparators.Levenshtein
Optimized version of the Wagner & Fischer algorithm that only keeps a single column in the matrix in memory at a time.
compactDistance(String, String, WeightedLevenshtein.WeightEstimator) - Static method in class no.priv.garshol.duke.comparators.WeightedLevenshtein
Optimized version of the Wagner & Fischer algorithm that only keeps a single column in the matrix in memory at a time.
Comparator - Interface in no.priv.garshol.duke
An operator which compares two values for similarity, and returns a number in the range 0.0 to 1.0 indicating the degree of similarity.
ComparatorAspect - Class in no.priv.garshol.duke.genetic
Sets the comparator.
ComparatorAspect(Property) - Constructor for class no.priv.garshol.duke.genetic.ComparatorAspect
 
compare(String, String) - Method in interface no.priv.garshol.duke.Comparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.DiceCoefficientComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.DifferentComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.ExactComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.GeopositionComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.JaccardIndexComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.JaroWinkler
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.JaroWinklerTokenized
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.Levenshtein
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.MetaphoneComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.NorphoneComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.NumericComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.PersonNameComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.QGramComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.SoundexComparator
 
compare(String, String) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein
 
compare(Record, Record) - Method in class no.priv.garshol.duke.Processor
Compares two records and returns the probability that they represent the same real-world entity.
compare(String, String) - Method in interface no.priv.garshol.duke.Property
Returns the probability that the records v1 and v2 represent the same entity, based on high and low probability settings etc.
compare(String, String) - Method in class no.priv.garshol.duke.PropertyImpl
Returns the probability that the records v1 and v2 came from represent the same entity, based on high and low probability settings etc.
compareCandidatesBest(Record, Collection<Record>) - Method in class no.priv.garshol.duke.Processor
Passes on only the best match for each record.
compareCandidatesSimple(Record, Collection<Record>) - Method in class no.priv.garshol.duke.Processor
Passes on all matches found.
compareTo(Bucket) - Method in class no.priv.garshol.duke.Bucket
 
compareTo(GeneticConfiguration) - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
 
compareTo(KeyValueDatabase.Score) - Method in class no.priv.garshol.duke.KeyValueDatabase.Score
 
computeBayes(double, double) - Static method in class no.priv.garshol.duke.utils.Utils
Combines two probabilities using Bayes' theorem.
ConfigLoader - Class in no.priv.garshol.duke
Can read XML configuration files and return a fully set up configuration.
ConfigLoader() - Constructor for class no.priv.garshol.duke.ConfigLoader
 
Configuration - Interface in no.priv.garshol.duke
 
ConfigurationImpl - Class in no.priv.garshol.duke
Holds the configuration details for a dataset.
ConfigurationImpl() - Constructor for class no.priv.garshol.duke.ConfigurationImpl
 
ConfigWriter - Class in no.priv.garshol.duke
Can write XML configuration files.
ConfigWriter() - Constructor for class no.priv.garshol.duke.ConfigWriter
 
ConsoleOracle - Class in no.priv.garshol.duke.genetic
This oracle asks the user via the console.
ConsoleOracle() - Constructor for class no.priv.garshol.duke.genetic.ConsoleOracle
 
contains(long) - Method in class no.priv.garshol.duke.Bucket
 
copy() - Method in interface no.priv.garshol.duke.Configuration
Returns an exact copy of the configuration.
copy() - Method in class no.priv.garshol.duke.ConfigurationImpl
 
copy() - Method in interface no.priv.garshol.duke.Property
Returns a copy of the property.
copy() - Method in class no.priv.garshol.duke.PropertyImpl
 
counter - Variable in class no.priv.garshol.duke.genetic.Pair
 
CountryNameCleaner - Class in no.priv.garshol.duke.examples
 
CountryNameCleaner() - Constructor for class no.priv.garshol.duke.examples.CountryNameCleaner
 
create() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Creates the initial population.
createDatabase(boolean) - Method in interface no.priv.garshol.duke.Configuration
 
createDatabase(boolean) - Method in class no.priv.garshol.duke.ConfigurationImpl
 
createIndexableFields(String) - Method in class no.priv.garshol.duke.GeoProperty
Geoindexes the coordinates.
CSVDataSource - Class in no.priv.garshol.duke.datasources
 
CSVDataSource() - Constructor for class no.priv.garshol.duke.datasources.CSVDataSource
 
CSVDataSource.CSVRecordIterator - Class in no.priv.garshol.duke.datasources
 
CSVDataSource.CSVRecordIterator(CSVReader) - Constructor for class no.priv.garshol.duke.datasources.CSVDataSource.CSVRecordIterator
 
CSVReader - Class in no.priv.garshol.duke.utils
 
CSVReader(Reader) - Constructor for class no.priv.garshol.duke.utils.CSVReader
 
CSVReader(Reader, int) - Constructor for class no.priv.garshol.duke.utils.CSVReader
 

D

Database - Interface in no.priv.garshol.duke
Used to store and index records for later matching.
database - Variable in class no.priv.garshol.duke.Processor
 
DatabaseProperties - Class in no.priv.garshol.duke
A class representing configurable properties on the record database.
DatabaseProperties() - Constructor for class no.priv.garshol.duke.DatabaseProperties
 
DatabaseProperties.DatabaseImplementation - Enum in no.priv.garshol.duke
 
DataSource - Interface in no.priv.garshol.duke
Any class which implements this interface can be used as a data source, so you can plug in your own data sources.
debug(String) - Method in class no.priv.garshol.duke.DummyLogger
 
debug(String) - Method in interface no.priv.garshol.duke.Logger
 
DebugCompare - Class in no.priv.garshol.duke
Compare two specific records to understand their scores.
DebugCompare() - Constructor for class no.priv.garshol.duke.DebugCompare
 
deduplicate() - Method in class no.priv.garshol.duke.Processor
Reads all available records from the data sources and processes them in batches, notifying the listeners throughout.
deduplicate(int) - Method in class no.priv.garshol.duke.Processor
Reads all available records from the data sources and processes them in batches, notifying the listeners throughout.
deduplicate(Collection<DataSource>, int) - Method in class no.priv.garshol.duke.Processor
Reads all available records from the data sources and processes them in batches, notifying the listeners throughout.
deduplicate(Collection<Record>) - Method in class no.priv.garshol.duke.Processor
Deduplicates a newly arrived batch of records.
DefaultRecordIterator - Class in no.priv.garshol.duke.utils
 
DefaultRecordIterator(Iterator<Record>) - Constructor for class no.priv.garshol.duke.utils.DefaultRecordIterator
 
delete(int, char) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
delete(int, char) - Method in interface no.priv.garshol.duke.comparators.WeightedLevenshtein.WeightEstimator
 
destroy() - Method in class no.priv.garshol.duke.server.StatusServlet
 
dice(int, Set<String>, Set<String>) - Static method in class no.priv.garshol.duke.comparators.QGramComparator
 
DiceCoefficientComparator - Class in no.priv.garshol.duke.comparators
An implementation of the Dice coefficient using exact matching by default, but can be overridden to use any sub-comparator.
DiceCoefficientComparator() - Constructor for class no.priv.garshol.duke.comparators.DiceCoefficientComparator
 
DifferentComparator - Class in no.priv.garshol.duke.comparators
A comparator which returns 0.0 if two values are exactly equal, and 1.0 if they are different.
DifferentComparator() - Constructor for class no.priv.garshol.duke.comparators.DifferentComparator
 
DigitsOnlyCleaner - Class in no.priv.garshol.duke.cleaners
Cleaner which removes all characters except the digits 0-9.
DigitsOnlyCleaner() - Constructor for class no.priv.garshol.duke.cleaners.DigitsOnlyCleaner
 
distance(double, double, double, double) - Static method in class no.priv.garshol.duke.comparators.GeopositionComparator
Returns the distance between the two points in meters.
distance(String, String) - Static method in class no.priv.garshol.duke.comparators.Levenshtein
This is the original, naive implementation, using the Wagner & Fischer algorithm from 1974.
distance(String, String, WeightedLevenshtein.WeightEstimator) - Static method in class no.priv.garshol.duke.comparators.WeightedLevenshtein
 
DocumentRecord - Class in no.priv.garshol.duke
Wraps a Lucene Document to provide a representation of it as a Record.
DocumentRecord(int, Document) - Constructor for class no.priv.garshol.duke.DocumentRecord
 
doGet(HttpServletRequest, HttpServletResponse) - Method in class no.priv.garshol.duke.server.StatusServlet
 
done() - Method in class no.priv.garshol.duke.utils.NTriplesWriter
 
doPost(HttpServletRequest, HttpServletResponse) - Method in class no.priv.garshol.duke.server.StatusServlet
 
Driver - Class in no.priv.garshol.duke.genetic
Command-line interface to the genetic algorithm.
Driver() - Constructor for class no.priv.garshol.duke.genetic.Driver
 
dropControlChars - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
Duke - Class in no.priv.garshol.duke
Command-line interface to the engine.
Duke() - Constructor for class no.priv.garshol.duke.Duke
 
DukeConfigException - Exception in no.priv.garshol.duke
Thrown when there is an error in the configuration of Duke.
DukeConfigException(String) - Constructor for exception no.priv.garshol.duke.DukeConfigException
 
DukeController - Class in no.priv.garshol.duke.server
The central class that receives notifications from the UI and timer threads, controlling the actual work performed.
DukeController(Properties) - Constructor for class no.priv.garshol.duke.server.DukeController
 
DukeException - Exception in no.priv.garshol.duke
Used to signal that something has gone wrong during Duke processing.
DukeException(String) - Constructor for exception no.priv.garshol.duke.DukeException
 
DukeException(String, Throwable) - Constructor for exception no.priv.garshol.duke.DukeException
 
DukeException(Throwable) - Constructor for exception no.priv.garshol.duke.DukeException
 
DukeTimer - Interface in no.priv.garshol.duke.server
 
DummyLogger - Class in no.priv.garshol.duke
 
DummyLogger() - Constructor for class no.priv.garshol.duke.DummyLogger
 

E

encodeCharsFrom - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
encoding - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
endDocument() - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
endElement(String) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
endProcessing() - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
endProcessing() - Method in interface no.priv.garshol.duke.matchers.MatchListener
Notification that this processing run is over.
endProcessing() - Method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
endProcessing() - Method in class no.priv.garshol.duke.matchers.TestFileListener
 
endProcessing() - Method in class no.priv.garshol.duke.Processor.Profiler
 
endRecord() - Method in class no.priv.garshol.duke.server.DukeController
 
endRecord_() - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
endsTokens(String, int) - Static method in class no.priv.garshol.duke.comparators.QGramComparator
Produces q-grams with padding, so that 'gail' -> '.g', 'ga', 'ai', 'il', 'l.'.
equals(Object) - Method in class no.priv.garshol.duke.DocumentRecord
 
equals(Object) - Method in class no.priv.garshol.duke.genetic.Pair
 
equals(Object) - Method in class no.priv.garshol.duke.Link
 
EquivalenceClassDatabase - Interface in no.priv.garshol.duke
A tool for collecting matching records into groups where all records are considered to match.
error(String) - Method in class no.priv.garshol.duke.DummyLogger
 
error(String, Throwable) - Method in class no.priv.garshol.duke.DummyLogger
 
error(String) - Method in interface no.priv.garshol.duke.Logger
 
error(String, Throwable) - Method in interface no.priv.garshol.duke.Logger
 
escapeAttrValue(String, Writer) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
evolve(int) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
Creates a new generation.
ExactComparator - Class in no.priv.garshol.duke.comparators
Comparator which compares two values exactly.
ExactComparator() - Constructor for class no.priv.garshol.duke.comparators.ExactComparator
 
execute(String, String) - Static method in class no.priv.garshol.duke.utils.SparqlClient
 
ExemplarsTracker - Class in no.priv.garshol.duke.genetic
A listener to decide which potential matches to ask the oracle about.
ExemplarsTracker(Configuration, Comparator) - Constructor for class no.priv.garshol.duke.genetic.ExemplarsTracker
 

F

FamilyCommaGivenCleaner - Class in no.priv.garshol.duke.cleaners
Experimental cleaner for person names of the form "Smith, John".
FamilyCommaGivenCleaner() - Constructor for class no.priv.garshol.duke.cleaners.FamilyCommaGivenCleaner
 
findCandidateMatches(Record) - Method in interface no.priv.garshol.duke.Database
Look up potentially matching records.
findCandidateMatches(Record) - Method in class no.priv.garshol.duke.InMemoryDatabase
Look up potentially matching records.
findCandidateMatches(Record) - Method in class no.priv.garshol.duke.KeyValueDatabase
Look up potentially matching records.
findCandidateMatches(Record) - Method in class no.priv.garshol.duke.LuceneDatabase
Look up potentially matching records.
findRecordById(String) - Method in interface no.priv.garshol.duke.Database
Look up record by identity.
findRecordById(String) - Method in class no.priv.garshol.duke.InMemoryDatabase
Look up record by identity.
findRecordById(String) - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
findRecordById(long) - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
findRecordById(String) - Method in class no.priv.garshol.duke.KeyValueDatabase
Look up record by identity.
findRecordById(String) - Method in interface no.priv.garshol.duke.KeyValueStore
Returns the record with the given external ID.
findRecordById(long) - Method in interface no.priv.garshol.duke.KeyValueStore
Returns the record with the given internal ID.
findRecordById(String) - Method in class no.priv.garshol.duke.LuceneDatabase
Look up record by identity.
flush(Writer) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 

G

GenericValueCleaner - Class in no.priv.garshol.duke.cleaners
A cleaner which returns values as they are, but removes specific values.
GenericValueCleaner() - Constructor for class no.priv.garshol.duke.cleaners.GenericValueCleaner
 
GeneticAlgorithm - Class in no.priv.garshol.duke.genetic
The class that actually runs the genetic algorithm.
GeneticAlgorithm(Configuration, String, boolean) - Constructor for class no.priv.garshol.duke.genetic.GeneticAlgorithm
Creates the algorithm.
GeneticConfiguration - Class in no.priv.garshol.duke.genetic
A configuration created by the genetic algorithm.
GeneticConfiguration(Configuration) - Constructor for class no.priv.garshol.duke.genetic.GeneticConfiguration
Creates an initial copy of the starting configuration, with no changes.
GeneticConfiguration(GeneticConfiguration) - Constructor for class no.priv.garshol.duke.genetic.GeneticConfiguration
Creates a copy of the starting configuration, keeping the aspects list.
GeneticPopulation - Class in no.priv.garshol.duke.genetic
Keeps track of the population.
GeneticPopulation(Configuration) - Constructor for class no.priv.garshol.duke.genetic.GeneticPopulation
 
GeopositionComparator - Class in no.priv.garshol.duke.comparators
Comparator which compares two geographic positions given by coordinates by the distance between them along the earth's surface.
GeopositionComparator() - Constructor for class no.priv.garshol.duke.comparators.GeopositionComparator
 
GeoProperty - Class in no.priv.garshol.duke
All spatial Lucene search functionality is located in this class.
GeoProperty(Property) - Constructor for class no.priv.garshol.duke.GeoProperty
 
geoSearch(String) - Method in class no.priv.garshol.duke.GeoProperty
Returns a geoquery.
get(Properties, String) - Static method in class no.priv.garshol.duke.utils.PropertyUtils
Used for getting required properties, will throw an exception if the property is not specified.
get(Properties, String, String) - Static method in class no.priv.garshol.duke.utils.PropertyUtils
Returns the value of an optional property, if the property is set.
get(Properties, String, int) - Static method in class no.priv.garshol.duke.utils.PropertyUtils
Returns the value of an optional property, if the property is set.
getAllLinks() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
getAllLinks() - Method in interface no.priv.garshol.duke.LinkDatabase
Get all links.
getAllLinks() - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
getAllLinksFor(String) - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
getAllLinksFor(String) - Method in interface no.priv.garshol.duke.LinkDatabase
Get all links for this identity.
getAllLinksFor(String) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
getBestConfiguration() - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
getBestConfiguration() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Returns the best configuration.
getbyid(String) - Static method in enum no.priv.garshol.duke.DatabaseProperties.DatabaseImplementation
 
getbyid(int) - Static method in enum no.priv.garshol.duke.LinkKind
 
getbyid(int) - Static method in enum no.priv.garshol.duke.LinkStatus
 
getChangesSince(long) - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
getChangesSince(long) - Method in interface no.priv.garshol.duke.LinkDatabase
Returns all links modified since the given time.
getChangesSince(long) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
getChangesSince(long, long) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
getChangesSince(long, long, int) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
getClass(String) - Method in interface no.priv.garshol.duke.EquivalenceClassDatabase
Get all records linked to the given record (that is, all records in the same equivalence class as the given record).
getClass(String) - Method in class no.priv.garshol.duke.InMemoryClassDatabase
 
getClass(String) - Method in class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
getClassCount() - Method in interface no.priv.garshol.duke.EquivalenceClassDatabase
Returns the number of equivalence classes in the database.
getClassCount() - Method in class no.priv.garshol.duke.InMemoryClassDatabase
 
getClassCount() - Method in class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
getClasses() - Method in interface no.priv.garshol.duke.EquivalenceClassDatabase
Returns an iterator over all the classes in the database.
getClasses() - Method in class no.priv.garshol.duke.InMemoryClassDatabase
 
getClasses() - Method in class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
getClassId(String) - Method in class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
getCleaner() - Method in class no.priv.garshol.duke.datasources.Column
 
getColumn(String) - Method in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
getColumns() - Method in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
getComparator() - Method in interface no.priv.garshol.duke.Property
 
getComparator() - Method in class no.priv.garshol.duke.PropertyImpl
 
getComparisonCount() - Method in class no.priv.garshol.duke.Processor
Returns the number of records that have been compared.
getConfigs() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Returns all configurations in the current generation.
getConfiguration() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Returns the underlying Duke configuration.
getConnectionString() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getCreateTable() - Method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
 
getDatabase() - Method in class no.priv.garshol.duke.Processor
Returns the actual Lucene index being used.
getDatabaseImplementation() - Method in class no.priv.garshol.duke.DatabaseProperties
 
getDatabaseProperties() - Method in interface no.priv.garshol.duke.Configuration
 
getDatabaseProperties() - Method in class no.priv.garshol.duke.ConfigurationImpl
 
getDataSources() - Method in interface no.priv.garshol.duke.Configuration
Returns the data sources to use (in deduplication mode; don't use this method in record linkage mode).
getDataSources(int) - Method in interface no.priv.garshol.duke.Configuration
Returns the data sources belonging to a particular group of data sources.
getDataSources() - Method in class no.priv.garshol.duke.ConfigurationImpl
Returns the data sources to use (in deduplication mode; don't use this method in record linkage mode).
getDataSources(int) - Method in class no.priv.garshol.duke.ConfigurationImpl
Returns the data sources belonging to a particular group of data sources.
getDriverClass() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getEnumConstantByName(Class, String) - Static method in class no.priv.garshol.duke.utils.ObjectUtils
Returns the enum constant from the given enum class representing the constant with the given identifier/name.
getExemplars() - Method in class no.priv.garshol.duke.genetic.ExemplarsTracker
 
getFNumber() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Returns the F-score of this configuration.
getFNumber() - Method in class no.priv.garshol.duke.matchers.TestFileListener
 
getHighProbability() - Method in interface no.priv.garshol.duke.Property
 
getHighProbability() - Method in class no.priv.garshol.duke.PropertyImpl
 
getId() - Method in enum no.priv.garshol.duke.DatabaseProperties.DatabaseImplementation
 
getId() - Method in enum no.priv.garshol.duke.LinkKind
 
getId() - Method in enum no.priv.garshol.duke.LinkStatus
 
getID1() - Method in class no.priv.garshol.duke.Link
 
getID2() - Method in class no.priv.garshol.duke.Link
 
getIdentityProperties() - Method in interface no.priv.garshol.duke.Configuration
The properties which are used to identify records, rather than compare them.
getIdentityProperties() - Method in class no.priv.garshol.duke.ConfigurationImpl
The properties which are used to identify records, rather than compare them.
getJndiPath() - Method in class no.priv.garshol.duke.datasources.JNDIDataSource
 
getKind() - Method in class no.priv.garshol.duke.Link
 
getLastCheck() - Method in class no.priv.garshol.duke.server.DukeController
 
getLastRecord() - Method in class no.priv.garshol.duke.server.DukeController
 
getLimit(int) - Method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
 
getLinkDatabase() - Method in class no.priv.garshol.duke.genetic.LinkFileOracle
 
getLinkKind(String, String) - Method in class no.priv.garshol.duke.genetic.ConsoleOracle
 
getLinkKind(String, String) - Method in class no.priv.garshol.duke.genetic.LinkFileOracle
 
getLinkKind(String, String) - Method in interface no.priv.garshol.duke.genetic.Oracle
Asks the oracle whether the two IDs represent the same thing or not, and returns the answer.
getLinks() - Method in interface no.priv.garshol.duke.LinkSource
Returns the links known by the source.
getListeners() - Method in class no.priv.garshol.duke.Processor
Returns all registered listeners.
getLongname() - Method in class no.priv.garshol.duke.utils.CommandLineParser.Option
 
getLookupBehaviour() - Method in interface no.priv.garshol.duke.Property
 
getLookupBehaviour() - Method in class no.priv.garshol.duke.PropertyImpl
 
getLookupProperties() - Method in interface no.priv.garshol.duke.Configuration
Returns the properties Duke queries for in the Lucene index.
getLookupProperties() - Method in class no.priv.garshol.duke.ConfigurationImpl
Returns the properties Duke queries for in the Lucene index.
getLowProbability() - Method in interface no.priv.garshol.duke.Property
 
getLowProbability() - Method in class no.priv.garshol.duke.PropertyImpl
 
getMatchCount() - Method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
getMaxDistance() - Method in class no.priv.garshol.duke.comparators.GeopositionComparator
 
getMaxSearchHits() - Method in class no.priv.garshol.duke.DatabaseProperties
 
getMaybeThreshold() - Method in interface no.priv.garshol.duke.Configuration
The probability threshold used to decide whether two records may represent the same entity.
getMaybeThreshold() - Method in class no.priv.garshol.duke.ConfigurationImpl
The probability threshold used to decide whether two records may represent the same entity.
getMetaTableName() - Method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
 
getMinRelevance() - Method in class no.priv.garshol.duke.DatabaseProperties
 
getName() - Method in class no.priv.garshol.duke.datasources.Column
 
getName() - Method in class no.priv.garshol.duke.GeoProperty
Returns the name of the property.
getName() - Method in interface no.priv.garshol.duke.Property
Returns the name of the property.
getName() - Method in class no.priv.garshol.duke.PropertyImpl
 
getNow() - Method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
 
getOption(String) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
getOption(char) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
getOptionInteger(String, int) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
getOptionState(String) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
getOptionValue(String) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
getOtherId(String) - Method in class no.priv.garshol.duke.Link
Get the ID (out of ID1 and ID2) which is not equal to the given ID.
getParent() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Returns the configuration this configuration was derived from, if any.
getPassword() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getPath() - Method in interface no.priv.garshol.duke.Configuration
Returns the path to the Lucene index directory.
getPath() - Method in class no.priv.garshol.duke.ConfigurationImpl
Returns the path to the Lucene index directory.
getPopulation() - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
getPrefix() - Method in class no.priv.garshol.duke.datasources.Column
 
getProfiler() - Method in class no.priv.garshol.duke.Processor
Returns the performance profiler, if any.
getProperties() - Method in interface no.priv.garshol.duke.Configuration
The set of properties Duke records can have, and their associated cleaners, comparators, and probabilities.
getProperties() - Method in class no.priv.garshol.duke.ConfigurationImpl
The set of properties Duke records can have, and their associated cleaners, comparators, and probabilities.
getProperties() - Method in class no.priv.garshol.duke.DocumentRecord
 
getProperties() - Method in interface no.priv.garshol.duke.Record
The names of the properties this record has.
getProperties() - Method in class no.priv.garshol.duke.RecordImpl
 
getProperty() - Method in class no.priv.garshol.duke.datasources.Column
 
getPropertyByName(String) - Method in interface no.priv.garshol.duke.Configuration
Returns the property with the given name, or null if there is no such property.
getPropertyByName(String) - Method in class no.priv.garshol.duke.ConfigurationImpl
Returns the property with the given name, or null if there is no such property.
getQuery() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getRank() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Returns the ranking of this configuration within its generation.
getRecord() - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
getRecordCount() - Method in class no.priv.garshol.duke.server.DukeController
 
getRecords() - Method in interface no.priv.garshol.duke.DataSource
Return an iterator over all the records in this data source.
getRecords() - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
getRecords() - Method in class no.priv.garshol.duke.datasources.InMemoryDataSource
 
getRecords() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getRecords() - Method in class no.priv.garshol.duke.datasources.JNDIDataSource
 
getRecords() - Method in class no.priv.garshol.duke.datasources.NTriplesDataSource
 
getRecords() - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
 
getRows() - Method in class no.priv.garshol.duke.utils.SparqlResult
 
getScore() - Method in class no.priv.garshol.duke.Bucket
 
getShortname() - Method in class no.priv.garshol.duke.utils.CommandLineParser.Option
 
getSourceName() - Method in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
getSourceName() - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
getSourceName() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getSourceName() - Method in class no.priv.garshol.duke.datasources.JNDIDataSource
 
getSourceName() - Method in class no.priv.garshol.duke.datasources.NTriplesDataSource
 
getSourceName() - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
 
getState() - Method in class no.priv.garshol.duke.utils.CommandLineParser.BooleanOption
 
getStatus() - Method in class no.priv.garshol.duke.Link
 
getStatus() - Method in class no.priv.garshol.duke.server.DukeController
 
getThreads() - Method in class no.priv.garshol.duke.Processor
Returns the number of threads.
getThreshold() - Method in interface no.priv.garshol.duke.Configuration
The probability threshold used to decide whether two records represent the same entity.
getThreshold() - Method in class no.priv.garshol.duke.ConfigurationImpl
The probability threshold used to decide whether two records represent the same entity.
getTimestamp() - Method in class no.priv.garshol.duke.Link
 
getUserName() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
getValue(String) - Method in class no.priv.garshol.duke.DocumentRecord
 
getValue(String) - Method in interface no.priv.garshol.duke.Record
Returns a value for the named property.
getValue(String) - Method in class no.priv.garshol.duke.RecordImpl
 
getValue() - Method in class no.priv.garshol.duke.utils.CommandLineParser.StringOption
 
getValues(String) - Method in class no.priv.garshol.duke.DocumentRecord
 
getValues(String) - Method in interface no.priv.garshol.duke.Record
All values for the named property.
getValues(String) - Method in class no.priv.garshol.duke.RecordImpl
 
getVariables() - Method in class no.priv.garshol.duke.utils.SparqlResult
 
getVersionString() - Static method in class no.priv.garshol.duke.Duke
 
getWhereLimit(int) - Method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
 
getWorstConfiguration() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Returns the worst configuration.

H

hashCode() - Method in class no.priv.garshol.duke.DocumentRecord
 
hashCode() - Method in class no.priv.garshol.duke.genetic.Pair
 
hashCode() - Method in class no.priv.garshol.duke.Link
 
hasNext() - Method in class no.priv.garshol.duke.comparators.Matcher
 
hasNext() - Method in class no.priv.garshol.duke.datasources.CSVDataSource.CSVRecordIterator
 
hasNext() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource.JDBCIterator
 
hasNext() - Method in class no.priv.garshol.duke.utils.DefaultRecordIterator
 
HighProbabilityAspect - Class in no.priv.garshol.duke.genetic
Sets the high probability.
HighProbabilityAspect(Property) - Constructor for class no.priv.garshol.duke.genetic.HighProbabilityAspect
 
htmlCompare(Record, Record, double, String, List<Property>) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 

I

id - Variable in class no.priv.garshol.duke.KeyValueDatabase.Score
 
id1 - Variable in class no.priv.garshol.duke.genetic.Pair
 
id2 - Variable in class no.priv.garshol.duke.genetic.Pair
 
ignorableWhitespace(char[], int, int) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
indent() - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
index(Record) - Method in interface no.priv.garshol.duke.Database
Add the record to the index.
index(Record) - Method in class no.priv.garshol.duke.InMemoryDatabase
Add the record to the index.
index(Record) - Method in class no.priv.garshol.duke.KeyValueDatabase
Add the record to the index.
index(Record) - Method in class no.priv.garshol.duke.LuceneDatabase
Add the record to the index.
index(Collection<DataSource>, int) - Method in class no.priv.garshol.duke.Processor
Index all new records from the given data sources.
inferLink(String, String) - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
inferLink(String, String) - Method in interface no.priv.garshol.duke.LinkDatabase
Can we work out, based on what we know, the relationship between these two? Returns null if we don't know the relationship.
inferLink(String, String) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
info(String) - Method in class no.priv.garshol.duke.DummyLogger
 
info(String) - Method in interface no.priv.garshol.duke.Logger
 
init() - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
This method must be called to initialize the database.
init(Properties) - Method in class no.priv.garshol.duke.server.BasicTimer
 
init(Properties) - Method in class no.priv.garshol.duke.server.CommonJTimer
 
init(Properties) - Method in interface no.priv.garshol.duke.server.DukeTimer
Initializes the timer, giving it access to configuration settings.
init(ServletConfig) - Method in class no.priv.garshol.duke.server.StatusServlet
 
InMemoryClassDatabase - Class in no.priv.garshol.duke
An equivalence class database which maintains the entire structure in memory.
InMemoryClassDatabase() - Constructor for class no.priv.garshol.duke.InMemoryClassDatabase
Instantiates an empty class database.
InMemoryDatabase - Class in no.priv.garshol.duke
Naïve in-memory store of records.
InMemoryDatabase(Configuration) - Constructor for class no.priv.garshol.duke.InMemoryDatabase
 
InMemoryDataSource - Class in no.priv.garshol.duke.datasources
Data source which can be passed Record objects, and which then returns them.
InMemoryDataSource() - Constructor for class no.priv.garshol.duke.datasources.InMemoryDataSource
Creates an empty source.
InMemoryDataSource(Collection<Record>) - Constructor for class no.priv.garshol.duke.datasources.InMemoryDataSource
Creates a source populated with the records in the records parameter.
InMemoryKeyValueStore - Class in no.priv.garshol.duke
A simple key value store that keeps all data in memory.
InMemoryKeyValueStore() - Constructor for class no.priv.garshol.duke.InMemoryKeyValueStore
 
InMemoryLinkDatabase - Class in no.priv.garshol.duke
A link database implementation which keeps everything in memory.
InMemoryLinkDatabase() - Constructor for class no.priv.garshol.duke.InMemoryLinkDatabase
 
insert(int, char) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
insert(int, char) - Method in interface no.priv.garshol.duke.comparators.WeightedLevenshtein.WeightEstimator
 
instantiate(String) - Static method in class no.priv.garshol.duke.utils.ObjectUtils
 
isAnalyzedProperty() - Method in interface no.priv.garshol.duke.Property
 
isAnalyzedProperty() - Method in class no.priv.garshol.duke.PropertyImpl
 
isDebugEnabled() - Method in class no.priv.garshol.duke.DummyLogger
 
isDebugEnabled() - Method in interface no.priv.garshol.duke.Logger
 
isDeduplicationMode() - Method in interface no.priv.garshol.duke.Configuration
Returns true iff we are in deduplication mode.
isDeduplicationMode() - Method in class no.priv.garshol.duke.ConfigurationImpl
Returns true iff we are in deduplication mode.
isEmpty() - Method in class no.priv.garshol.duke.RecordImpl
 
isErrorBlocked() - Method in class no.priv.garshol.duke.server.DukeController
 
isErrorEnabled() - Method in class no.priv.garshol.duke.DummyLogger
 
isErrorEnabled() - Method in interface no.priv.garshol.duke.Logger
 
isIdProperty() - Method in interface no.priv.garshol.duke.Property
Returns true iff the property is an identifying property.
isIdProperty() - Method in class no.priv.garshol.duke.PropertyImpl
 
isIgnoreProperty() - Method in interface no.priv.garshol.duke.Property
Iff true the property should not be used for comparing records.
isIgnoreProperty() - Method in class no.priv.garshol.duke.PropertyImpl
Iff true the property should not be used for comparing records.
isInfoEnabled() - Method in class no.priv.garshol.duke.DummyLogger
 
isInfoEnabled() - Method in interface no.priv.garshol.duke.Logger
 
isInMemory() - Method in interface no.priv.garshol.duke.Database
Returns true iff the database is held entirely in memory, and thus is not persistent.
isInMemory() - Method in class no.priv.garshol.duke.InMemoryDatabase
Returns true iff the database is held entirely in memory, and thus is not persistent.
isInMemory() - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
isInMemory() - Method in class no.priv.garshol.duke.KeyValueDatabase
Returns true iff the database is held entirely in memory, and thus is not persistent.
isInMemory() - Method in interface no.priv.garshol.duke.KeyValueStore
Returns true iff the database is held entirely in memory, and thus is not persistent.
isInMemory() - Method in class no.priv.garshol.duke.LuceneDatabase
Returns true iff the Lucene index is held in memory rather than on disk.
isLast() - Method in class no.priv.garshol.duke.comparators.Matcher
 
isNext(char) - Method in class no.priv.garshol.duke.comparators.Matcher
 
isRecordEmpty() - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
isRunning() - Method in class no.priv.garshol.duke.server.BasicTimer
 
isRunning() - Method in class no.priv.garshol.duke.server.CommonJTimer
Returns true iff the background thread is running.
isRunning() - Method in interface no.priv.garshol.duke.server.DukeTimer
Returns true iff the background thread is running.
isSplit() - Method in class no.priv.garshol.duke.datasources.Column
Returns true iff this column needs to be split into multiple values.
isTokenized() - Method in interface no.priv.garshol.duke.Comparator
Returns true if the comparator breaks string values up into tokens when comparing.
isTokenized() - Method in class no.priv.garshol.duke.comparators.DiceCoefficientComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.DifferentComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.ExactComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.GeopositionComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.JaccardIndexComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.JaroWinkler
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.JaroWinklerTokenized
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.Levenshtein
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.MetaphoneComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.NorphoneComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.NumericComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.PersonNameComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.QGramComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.SoundexComparator
 
isTokenized() - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein
 
isTraceEnabled() - Method in class no.priv.garshol.duke.DummyLogger
 
isTraceEnabled() - Method in interface no.priv.garshol.duke.Logger
 
isWarnEnabled() - Method in class no.priv.garshol.duke.DummyLogger
 
isWarnEnabled() - Method in interface no.priv.garshol.duke.Logger
 
isWindowsOS() - Static method in class no.priv.garshol.duke.utils.Utils
Returns true iff we are running on Windows.

J

jaccard(int, Set<String>, Set<String>) - Static method in class no.priv.garshol.duke.comparators.QGramComparator
 
JaccardIndexComparator - Class in no.priv.garshol.duke.comparators
An implementation of the Jaccard index using exact matching by default, but can be overridden to use any sub-comparator.
JaccardIndexComparator() - Constructor for class no.priv.garshol.duke.comparators.JaccardIndexComparator
 
JaroWinkler - Class in no.priv.garshol.duke.comparators
An implementation of the Jaro-Winkler string similarity measure.
JaroWinkler() - Constructor for class no.priv.garshol.duke.comparators.JaroWinkler
 
JaroWinklerTokenized - Class in no.priv.garshol.duke.comparators
A tokenized approach to string similarity, based on Jaccard equivalence and the Jaro-Winkler metric.
JaroWinklerTokenized() - Constructor for class no.priv.garshol.duke.comparators.JaroWinklerTokenized
 
JDBCDataSource - Class in no.priv.garshol.duke.datasources
 
JDBCDataSource() - Constructor for class no.priv.garshol.duke.datasources.JDBCDataSource
 
JDBCDataSource.JDBCIterator - Class in no.priv.garshol.duke.datasources
 
JDBCDataSource.JDBCIterator(ResultSet) - Constructor for class no.priv.garshol.duke.datasources.JDBCDataSource.JDBCIterator
 
JDBCEquivalenceClassDatabase - Class in no.priv.garshol.duke
An eq.
JDBCEquivalenceClassDatabase(String, String, String, Properties) - Constructor for class no.priv.garshol.duke.JDBCEquivalenceClassDatabase
 
JDBCLinkDatabase - Class in no.priv.garshol.duke
A link database which can maintain a set of links in an H2 or Oracle database over JDBC.
JDBCLinkDatabase(String, String, String, Properties) - Constructor for class no.priv.garshol.duke.JDBCLinkDatabase
 
JDBCUtils - Class in no.priv.garshol.duke.utils
Utilities for making life with JDBC easier.
JDBCUtils() - Constructor for class no.priv.garshol.duke.utils.JDBCUtils
 
JNDIDataSource - Class in no.priv.garshol.duke.datasources
Data source which retrieves a JDBC connection from JNDI.
JNDIDataSource() - Constructor for class no.priv.garshol.duke.datasources.JNDIDataSource
 
JNDILinkDatabase - Class in no.priv.garshol.duke
A link database that gets its connection via JNDI lookup.
JNDILinkDatabase(String, String) - Constructor for class no.priv.garshol.duke.JNDILinkDatabase
 
join(String[]) - Static method in class no.priv.garshol.duke.utils.StringUtils
 

K

KeyValueDatabase - Class in no.priv.garshol.duke
A database that uses a key-value store to index and find records.
KeyValueDatabase(Configuration, DatabaseProperties) - Constructor for class no.priv.garshol.duke.KeyValueDatabase
 
KeyValueDatabase.PriorityQueue - Class in no.priv.garshol.duke
 
KeyValueDatabase.PriorityQueue(KeyValueDatabase.Score[]) - Constructor for class no.priv.garshol.duke.KeyValueDatabase.PriorityQueue
 
KeyValueDatabase.Score - Class in no.priv.garshol.duke
 
KeyValueDatabase.Score(long) - Constructor for class no.priv.garshol.duke.KeyValueDatabase.Score
 
KeyValueStore - Interface in no.priv.garshol.duke
An interface encapsulating the way KeyValueDatabase interacts with the underlying database.

L

level - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
Levenshtein - Class in no.priv.garshol.duke.comparators
An implementation of the Levenshtein distance metric.
Levenshtein() - Constructor for class no.priv.garshol.duke.comparators.Levenshtein
 
Link - Class in no.priv.garshol.duke
Immutable representation of a link between two identities.
Link(String, String, LinkStatus, LinkKind) - Constructor for class no.priv.garshol.duke.Link
 
Link(String, String, LinkStatus, LinkKind, long) - Constructor for class no.priv.garshol.duke.Link
 
link() - Method in class no.priv.garshol.duke.Processor
Does record linkage across the two groups, but does not link records within each group.
link(Collection<DataSource>, Collection<DataSource>, int) - Method in class no.priv.garshol.duke.Processor
Does record linkage across the two groups, but does not link records within each group.
link(Collection<DataSource>, Collection<DataSource>, boolean, int) - Method in class no.priv.garshol.duke.Processor
Does record linkage across the two groups, but does not link records within each group.
LinkDatabase - Interface in no.priv.garshol.duke
A LinkDatabase is a class which can keep track of links between entities.
LinkDatabaseMatchListener - Class in no.priv.garshol.duke.matchers
Maintains a LinkDatabase of the recorded matches.
LinkDatabaseMatchListener(Configuration, LinkDatabase) - Constructor for class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
LinkDatabaseUtils - Class in no.priv.garshol.duke.utils
Utilities for dealing with link databases.
LinkDatabaseUtils() - Constructor for class no.priv.garshol.duke.utils.LinkDatabaseUtils
 
LinkFileOracle - Class in no.priv.garshol.duke.genetic
This oracle looks up the answer in a link file.
LinkFileOracle(String) - Constructor for class no.priv.garshol.duke.genetic.LinkFileOracle
 
LinkFileWriter - Class in no.priv.garshol.duke.utils
Utility class for writing link files.
LinkFileWriter(Writer) - Constructor for class no.priv.garshol.duke.utils.LinkFileWriter
 
LinkFileWriter(Writer, Configuration) - Constructor for class no.priv.garshol.duke.utils.LinkFileWriter
 
LinkKind - Enum in no.priv.garshol.duke
Represents the meaning of a link between two identities.
linkRecords(Collection<DataSource>) - Method in class no.priv.garshol.duke.Processor
Retrieve new records from data sources, and match them to previously indexed records.
linkRecords(Collection<DataSource>, boolean) - Method in class no.priv.garshol.duke.Processor
Retrieve new records from data sources, and match them to previously indexed records.
linkRecords(Collection<DataSource>, boolean, int) - Method in class no.priv.garshol.duke.Processor
Retrieve new records from data sources, and match them to previously indexed records.
LinkSource - Interface in no.priv.garshol.duke
Experimental interface for retrieving link information from outside sources for use inside the Duke processing.
LinkStatus - Enum in no.priv.garshol.duke
Represents the status of a link between two identities.
load(String) - Static method in class no.priv.garshol.duke.ConfigLoader
Note that if file starts with 'classpath:' the resource is looked up on the classpath instead.
load(String) - Static method in class no.priv.garshol.duke.utils.TestFileUtils
 
loadResultSet(InputSource) - Static method in class no.priv.garshol.duke.utils.SparqlClient
 
loadTestFile(String) - Static method in class no.priv.garshol.duke.utils.LinkDatabaseUtils
Loads a test file into an in-memory link database.
loadTestFile(String, LinkDatabase) - Static method in class no.priv.garshol.duke.utils.LinkDatabaseUtils
Loads a test file into an in-memory link database.
logger - Variable in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
Logger - Interface in no.priv.garshol.duke
Experimental attempt at internal log handling which works naturally on the command-line, doesn't introduce dependencies, and at the same time allows integration with a full logging system.
lookup(Property, String) - Method in class no.priv.garshol.duke.LuceneDatabase
 
lookupToken(String, String) - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
lookupToken(String, String) - Method in interface no.priv.garshol.duke.KeyValueStore
Returns the IDs of all records which have the given token in a value for this property.
LowerCaseNormalizeCleaner - Class in no.priv.garshol.duke.cleaners
A cleaner which removes leading and trailing whitespace, normalized internal whitespace, lowercases all characters, and (by default) strips accents.
LowerCaseNormalizeCleaner() - Constructor for class no.priv.garshol.duke.cleaners.LowerCaseNormalizeCleaner
 
LowProbabilityAspect - Class in no.priv.garshol.duke.genetic
Sets the low probability.
LowProbabilityAspect(Property) - Constructor for class no.priv.garshol.duke.genetic.LowProbabilityAspect
 
LuceneDatabase - Class in no.priv.garshol.duke
Represents the Lucene index, and implements record linkage services on top of it.
LuceneDatabase(Configuration, boolean, DatabaseProperties) - Constructor for class no.priv.garshol.duke.LuceneDatabase
 

M

main(String[]) - Static method in class no.priv.garshol.duke.DebugCompare
 
main(String[]) - Static method in class no.priv.garshol.duke.Duke
 
main(String[]) - Static method in class no.priv.garshol.duke.genetic.Driver
 
main(String[]) - Static method in class no.priv.garshol.duke.RecordSearch
 
main_(String[]) - Static method in class no.priv.garshol.duke.Duke
 
makeNewRecordId() - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
makeNewRecordId() - Method in interface no.priv.garshol.duke.KeyValueStore
Returns a new internal record ID.
makeRandomCopy() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Returns a randomized copy of the configuration.
makeStartLineBuffer(int) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
makeSubelements(int) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
MappingFileCleaner - Class in no.priv.garshol.duke.cleaners
A cleaner which loads a mapping file in CSV format and maps values according to that file.
MappingFileCleaner() - Constructor for class no.priv.garshol.duke.cleaners.MappingFileCleaner
 
Matcher - Class in no.priv.garshol.duke.comparators
Wrapping this around the input string to simplify the code.
Matcher(String) - Constructor for class no.priv.garshol.duke.comparators.Matcher
 
matches(Record, Record, double) - Method in class no.priv.garshol.duke.genetic.ExemplarsTracker
 
matches(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
matches(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.ClassDatabaseMatchListener
 
matches(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
matches(Record, Record, double) - Method in interface no.priv.garshol.duke.matchers.MatchListener
Notification that the two records match.
matches(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
matches(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.TestFileListener
 
matchesPerhaps(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
matchesPerhaps(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
matchesPerhaps(Record, Record, double) - Method in interface no.priv.garshol.duke.matchers.MatchListener
Notification that the two records might match.
matchesPerhaps(Record, Record, double) - Method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
MatchListener - Interface in no.priv.garshol.duke.matchers
Interface implemented by code which can receive notifications that two records are considered to match.
mateWith(GeneticConfiguration) - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Mates this configuration with another configuration.
merge(Record) - Method in class no.priv.garshol.duke.DocumentRecord
 
merge(Record) - Method in interface no.priv.garshol.duke.Record
Merges the other record into this one.
merge(Record) - Method in class no.priv.garshol.duke.RecordImpl
 
metaphone(String) - Static method in class no.priv.garshol.duke.comparators.MetaphoneComparator
Produces the Metaphone key for the given string.
MetaphoneComparator - Class in no.priv.garshol.duke.comparators
An implementation of the Metaphone algorithm, and a comparator which considers strings to have a score of 0.9 if their Metaphone values match.
MetaphoneComparator() - Constructor for class no.priv.garshol.duke.comparators.MetaphoneComparator
 
mutate() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Makes one random change to the configuration.

N

newRecord() - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
next() - Method in class no.priv.garshol.duke.comparators.Matcher
 
next() - Method in class no.priv.garshol.duke.datasources.CSVDataSource.CSVRecordIterator
 
next() - Method in class no.priv.garshol.duke.datasources.JDBCDataSource.JDBCIterator
 
next() - Method in class no.priv.garshol.duke.KeyValueDatabase.PriorityQueue
 
next() - Method in class no.priv.garshol.duke.utils.CSVReader
 
next() - Method in class no.priv.garshol.duke.utils.DefaultRecordIterator
 
nextfree - Variable in class no.priv.garshol.duke.Bucket
 
nextIsLast() - Method in class no.priv.garshol.duke.comparators.Matcher
 
no.priv.garshol.duke - package no.priv.garshol.duke
The main Duke API is here.
no.priv.garshol.duke.cleaners - package no.priv.garshol.duke.cleaners
Duke's built-in cleaners, plus utility classes for building your own cleaners.
no.priv.garshol.duke.comparators - package no.priv.garshol.duke.comparators
Duke's built-in comparators.
no.priv.garshol.duke.datasources - package no.priv.garshol.duke.datasources
Duke's built-in data sources.
no.priv.garshol.duke.examples - package no.priv.garshol.duke.examples
This package contains helper classes for the examples, like cleaners and comparators which are too specific to include as part of Duke.
no.priv.garshol.duke.genetic - package no.priv.garshol.duke.genetic
Implementation of the genetic algorithm for automatically creating Duke configurations.
no.priv.garshol.duke.matchers - package no.priv.garshol.duke.matchers
Duke's event listeners for receiving matches between records.
no.priv.garshol.duke.server - package no.priv.garshol.duke.server
Contains classes for running Duke as an app in a servlet container, allowing it to incrementally process new and changed data as it arrives from a remote service.
no.priv.garshol.duke.utils - package no.priv.garshol.duke.utils
Various helper classes used by Duke.
noMatchFor(Record) - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
noMatchFor(Record) - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
noMatchFor(Record) - Method in interface no.priv.garshol.duke.matchers.MatchListener
Called if no link is found for the record.
noMatchFor(Record) - Method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
noMatchFor(Record) - Method in class no.priv.garshol.duke.matchers.TestFileListener
 
normalizeWS(String) - Static method in class no.priv.garshol.duke.utils.StringUtils
Removes trailing and leading whitespace, and also reduces each sequence of internal whitespace to a single space.
norphone(String) - Static method in class no.priv.garshol.duke.comparators.NorphoneComparator
Produces the Norphone key for the given string.
NorphoneComparator - Class in no.priv.garshol.duke.comparators
My own algorithm for phonetic matching of Norwegian names, inspired by Metaphone.
NorphoneComparator() - Constructor for class no.priv.garshol.duke.comparators.NorphoneComparator
 
NorwegianAddressCleaner - Class in no.priv.garshol.duke.cleaners
 
NorwegianAddressCleaner() - Constructor for class no.priv.garshol.duke.cleaners.NorwegianAddressCleaner
 
NorwegianCompanyNameCleaner - Class in no.priv.garshol.duke.cleaners
 
NorwegianCompanyNameCleaner() - Constructor for class no.priv.garshol.duke.cleaners.NorwegianCompanyNameCleaner
 
NTriplesDataSource - Class in no.priv.garshol.duke.datasources
A data source which can read RDF data from NTriples files.
NTriplesDataSource() - Constructor for class no.priv.garshol.duke.datasources.NTriplesDataSource
 
NTriplesParser - Class in no.priv.garshol.duke.utils
A basic NTriples parser used by NTriplesDataSource.
NTriplesParser(StatementHandler) - Constructor for class no.priv.garshol.duke.utils.NTriplesParser
Alternate entry point to the parser for when the driving loop is outside the parser.
NTriplesWriter - Class in no.priv.garshol.duke.utils
A simplified NTriples serializer, used for writing the link file to NTriples format.
NTriplesWriter(OutputStream) - Constructor for class no.priv.garshol.duke.utils.NTriplesWriter
 
NumericComparator - Class in no.priv.garshol.duke.comparators
Comparator which compares two values numerically.
NumericComparator() - Constructor for class no.priv.garshol.duke.comparators.NumericComparator
 

O

ObjectUtils - Class in no.priv.garshol.duke.utils
 
ObjectUtils() - Constructor for class no.priv.garshol.duke.utils.ObjectUtils
 
offset - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
open(String) - Static method in class no.priv.garshol.duke.utils.JDBCUtils
Get a configured database connection via JNDI.
open(String, String, Properties) - Static method in class no.priv.garshol.duke.utils.JDBCUtils
Opens a JDBC connection with the given parameters.
openSearchers() - Method in class no.priv.garshol.duke.LuceneDatabase
 
Oracle - Interface in no.priv.garshol.duke.genetic
An oracle can say whether a given match is correct or not.
overlap(int, Set<String>, Set<String>) - Static method in class no.priv.garshol.duke.comparators.QGramComparator
 
overrides(Link) - Method in class no.priv.garshol.duke.Link
Returns true if the information in this link should take precedence over the information in the other link.

P

pagesize - Variable in class no.priv.garshol.duke.datasources.SparqlDataSource
 
Pair - Class in no.priv.garshol.duke.genetic
Represents a pair of records.
Pair(String, String) - Constructor for class no.priv.garshol.duke.genetic.Pair
 
parse(String[]) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
parse(Reader, StatementHandler) - Static method in class no.priv.garshol.duke.utils.NTriplesParser
Reads the NTriples file from the reader, pushing statements into the handler.
parseLine(String) - Method in class no.priv.garshol.duke.utils.NTriplesParser
Push a line into the parser.
parseTokens(BooleanQuery, String, String, boolean) - Method in class no.priv.garshol.duke.LuceneDatabase
Parses Lucene query.
PersonNameCleaner - Class in no.priv.garshol.duke.cleaners
Experimental cleaner for person names, which understands about abbreviations like "joe" for "joseph", etc.
PersonNameCleaner() - Constructor for class no.priv.garshol.duke.cleaners.PersonNameCleaner
 
PersonNameComparator - Class in no.priv.garshol.duke.comparators
An operator which knows about comparing names.
PersonNameComparator() - Constructor for class no.priv.garshol.duke.comparators.PersonNameComparator
 
PhoneNumberCleaner - Class in no.priv.garshol.duke.cleaners
Cleaner for international phone numbers.
PhoneNumberCleaner() - Constructor for class no.priv.garshol.duke.cleaners.PhoneNumberCleaner
 
pickRandomConfig() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Returns a random configuration.
positionalTokens(String, int) - Static method in class no.priv.garshol.duke.comparators.QGramComparator
Produces positional q-grams, so that 'gail' -> 'ga1', 'ai2', 'il3'.
prettyCompare(Record, Record, double, String, List<Property>) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
prettyPrint(Record, List<Property>) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
previousOneOf(String) - Method in class no.priv.garshol.duke.comparators.Matcher
 
PrintMatchListener - Class in no.priv.garshol.duke.matchers
Match listener which prints events to standard out.
PrintMatchListener(boolean, boolean, boolean, boolean, List<Property>, boolean) - Constructor for class no.priv.garshol.duke.matchers.PrintMatchListener
Creates a new listener.
process() - Method in class no.priv.garshol.duke.server.DukeController
Runs the record linkage process.
processingInstruction(String, String) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
Processor - Class in no.priv.garshol.duke
The class that implements the actual deduplication and record linkage logic.
Processor(Configuration) - Constructor for class no.priv.garshol.duke.Processor
Creates a new processor, overwriting the existing Lucene index.
Processor(Configuration, boolean) - Constructor for class no.priv.garshol.duke.Processor
Creates a new processor.
Processor(Configuration, Database) - Constructor for class no.priv.garshol.duke.Processor
Creates a new processor, bound to the given database.
Processor.Profiler - Class in no.priv.garshol.duke
 
Processor.Profiler() - Constructor for class no.priv.garshol.duke.Processor.Profiler
 
Property - Interface in no.priv.garshol.duke
Represents a property.
Property.Lookup - Enum in no.priv.garshol.duke
The lookup behaviour for this property.
PropertyImpl - Class in no.priv.garshol.duke
The default implementation of the Property interface.
PropertyImpl(String) - Constructor for class no.priv.garshol.duke.PropertyImpl
 
PropertyImpl(String, Comparator, double, double) - Constructor for class no.priv.garshol.duke.PropertyImpl
 
PropertyUtils - Class in no.priv.garshol.duke.utils
Utilities for making Java Properties objects easier to deal with.
PropertyUtils() - Constructor for class no.priv.garshol.duke.utils.PropertyUtils
 

Q

QGramComparator - Class in no.priv.garshol.duke.comparators
An implementation of q-grams comparison that can tokenize a few different ways, and also use a couple different formulas to compute the final score.
QGramComparator() - Constructor for class no.priv.garshol.duke.comparators.QGramComparator
 
QGramComparator.Formula - Enum in no.priv.garshol.duke.comparators
Represents the different formulas we can use to compute similarity.
QGramComparator.Tokenizer - Enum in no.priv.garshol.duke.comparators
Represents the different ways we can tokenize a string into a set of q-grams for a given q.
qgrams(String) - Method in class no.priv.garshol.duke.comparators.QGramComparator
 
queryForInt(Statement, String, int) - Static method in class no.priv.garshol.duke.utils.JDBCUtils
Runs a query that returns a single int.
queryHasResult(Statement, String) - Static method in class no.priv.garshol.duke.utils.JDBCUtils
Returns true if the query result has at least one row.

R

RDBMSLinkDatabase - Class in no.priv.garshol.duke
An abstract SQL-based link database implementation which can can maintain a set of links in an H2 or Oracle database over JDBC.
RDBMSLinkDatabase(String) - Constructor for class no.priv.garshol.duke.RDBMSLinkDatabase
 
RDBMSLinkDatabase.DatabaseType - Enum in no.priv.garshol.duke
 
Record - Interface in no.priv.garshol.duke
Represents a record, which may be a single source record from a data source, or a record created from merging data from many records.
RecordBuilder - Class in no.priv.garshol.duke.datasources
Helper class for building records, to avoid having to copy all the cleaning logic etc in each single data source.
RecordBuilder(ColumnarDataSource) - Constructor for class no.priv.garshol.duke.datasources.RecordBuilder
 
RecordImpl - Class in no.priv.garshol.duke
The default implementation of the Record interface.
RecordImpl(Map<String, Collection<String>>) - Constructor for class no.priv.garshol.duke.RecordImpl
 
RecordImpl() - Constructor for class no.priv.garshol.duke.RecordImpl
 
RecordIterator - Class in no.priv.garshol.duke
Special Iterator class for Record collections, in order to add some extra methods for resource management.
RecordIterator() - Constructor for class no.priv.garshol.duke.RecordIterator
 
records - Variable in class no.priv.garshol.duke.Bucket
 
RecordSearch - Class in no.priv.garshol.duke
Search for records and display the matching ones.
RecordSearch() - Constructor for class no.priv.garshol.duke.RecordSearch
 
RegexpCleaner - Class in no.priv.garshol.duke.cleaners
Cleaner which returns the part of the input string matched by either the entire regular expression or a group in the regexp.
RegexpCleaner() - Constructor for class no.priv.garshol.duke.cleaners.RegexpCleaner
 
registerId(long, String) - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
registerId(long, String) - Method in interface no.priv.garshol.duke.KeyValueStore
Records that this external ID refers to the given internal record ID.
registerOption(CommandLineParser.Option) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
registerRecord(long, Record) - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
registerRecord(long, Record) - Method in interface no.priv.garshol.duke.KeyValueStore
Stores the entire record under the given internal record ID.
registerToken(long, String, String) - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
registerToken(long, String, String) - Method in interface no.priv.garshol.duke.KeyValueStore
Records that the given token occurred in the given record.
remove(String) - Method in class no.priv.garshol.duke.RecordImpl
 
remove() - Method in class no.priv.garshol.duke.RecordIterator
 
removeMatchListener(MatchListener) - Method in class no.priv.garshol.duke.Processor
Removes a listener from being notified of the processing events.
replaceAnyOf(String, String, char) - Static method in class no.priv.garshol.duke.utils.StringUtils
Replaces all characters in the second parameter found in the first parameter with the final character.
retract() - Method in class no.priv.garshol.duke.Link
Changes the link status to retracted, and updates the timestamp.
run() - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
Actually runs the genetic algorithm.
run() - Method in class no.priv.garshol.duke.server.BasicTimer
 
runQuery(String, String) - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
An extension point so we can control how the query gets executed.

S

score - Variable in class no.priv.garshol.duke.KeyValueDatabase.Score
 
setAcceptTypes(String) - Method in class no.priv.garshol.duke.datasources.NTriplesDataSource
 
setActive(boolean) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setBeanProperty(Object, String, String, Map<String, Object>) - Static method in class no.priv.garshol.duke.utils.ObjectUtils
Calls the named bean setter property on the object, converting the given value to the correct type.
setComparator(Comparator) - Method in class no.priv.garshol.duke.comparators.DiceCoefficientComparator
 
setComparator(Comparator) - Method in class no.priv.garshol.duke.comparators.JaccardIndexComparator
 
setComparator(Comparator) - Method in interface no.priv.garshol.duke.Property
Sets the comparator used for this property.
setComparator(Comparator) - Method in class no.priv.garshol.duke.PropertyImpl
Sets the comparator used for this property.
setConfigOutput(String) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setConnectionString(String) - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
setDatabaseImplementation(String) - Method in class no.priv.garshol.duke.DatabaseProperties
 
setDigitWeight(double) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
setDiscardGroup(boolean) - Method in class no.priv.garshol.duke.cleaners.RegexpCleaner
 
setDocumentLocator(Locator) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
setDoInference(boolean) - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
setDriverClass(String) - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
setDropControlCharacters(boolean) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
INTERNAL: If this property is true control characters are being dropped from the resulting document.
setEncodeCharactersFrom(int) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
INTERNAL: Encodes element content as decimal character entitites for characters from the given character number.
setEncoding(String) - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
setEndpoint(String) - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
 
setEstimator(WeightedLevenshtein.WeightEstimator) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein
 
setFNumber(double) - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Sets the F-score of this configuration.
setFormula(QGramComparator.Formula) - Method in class no.priv.garshol.duke.comparators.QGramComparator
Tells the comparator what formula to use to compute the actual similarity.
setFromOther(Configuration, Configuration) - Method in class no.priv.garshol.duke.genetic.Aspect
Set this aspect of the configuration to be the same as that of the other configuration.
setFromOther(Configuration, Configuration) - Method in class no.priv.garshol.duke.genetic.ComparatorAspect
 
setFromOther(Configuration, Configuration) - Method in class no.priv.garshol.duke.genetic.HighProbabilityAspect
 
setFromOther(Configuration, Configuration) - Method in class no.priv.garshol.duke.genetic.LowProbabilityAspect
 
setFromOther(Configuration, Configuration) - Method in class no.priv.garshol.duke.genetic.ThresholdAspect
 
setGenerations(int) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setGeneric(String) - Method in class no.priv.garshol.duke.cleaners.GenericValueCleaner
 
setGroup(int) - Method in class no.priv.garshol.duke.cleaners.RegexpCleaner
 
setHeaderLine(boolean) - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
setHighProbability(double) - Method in interface no.priv.garshol.duke.Property
Sets the high probability used for this property.
setHighProbability(double) - Method in class no.priv.garshol.duke.PropertyImpl
Sets the high probability used for this property.
setIgnoreProperty(boolean) - Method in interface no.priv.garshol.duke.Property
Makes Duke skip this property when comparing records.
setIgnoreProperty(boolean) - Method in class no.priv.garshol.duke.PropertyImpl
Makes Duke skip this property when comparing records.
setIncrementalMode(boolean) - Method in class no.priv.garshol.duke.datasources.NTriplesDataSource
 
setInputFile(String) - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
setInputFile(String) - Method in class no.priv.garshol.duke.datasources.NTriplesDataSource
 
setJndiPath(String) - Method in class no.priv.garshol.duke.datasources.JNDIDataSource
 
setLetterWeight(double) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
setLinkFile(String) - Method in class no.priv.garshol.duke.genetic.ConsoleOracle
 
setLinkFile(String) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setLogger(Logger) - Method in interface no.priv.garshol.duke.DataSource
Gives the data source a logger to report diagnostic information to.
setLogger(Logger) - Method in class no.priv.garshol.duke.datasources.ColumnarDataSource
 
setLogger(Logger) - Method in class no.priv.garshol.duke.datasources.InMemoryDataSource
 
setLogger(Logger) - Method in class no.priv.garshol.duke.Processor
Sets the logger to report to.
setLogger(Logger) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
setLookupBehaviour(Property.Lookup) - Method in interface no.priv.garshol.duke.Property
Sets the lookup behaviour of this property.
setLookupBehaviour(Property.Lookup) - Method in class no.priv.garshol.duke.PropertyImpl
Sets the lookup behaviour of this property.
setLowProbability(double) - Method in interface no.priv.garshol.duke.Property
Sets the low probability used for this property.
setLowProbability(double) - Method in class no.priv.garshol.duke.PropertyImpl
Sets the low probability used for this property.
setMappingFile(String) - Method in class no.priv.garshol.duke.cleaners.MappingFileCleaner
 
setMaxDistance(double) - Method in class no.priv.garshol.duke.comparators.GeopositionComparator
 
setMaximumArguments(int) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
setMaxSearchHits(int) - Method in class no.priv.garshol.duke.DatabaseProperties
 
setMaybeThreshold(double) - Method in class no.priv.garshol.duke.ConfigurationImpl
Sets the probability threshold for considering two records possibly equivalent.
setMinimumArguments(int) - Method in class no.priv.garshol.duke.utils.CommandLineParser
 
setMinRatio(double) - Method in class no.priv.garshol.duke.comparators.NumericComparator
 
setMinRelevance(float) - Method in class no.priv.garshol.duke.DatabaseProperties
 
setNewGeneration(List<GeneticConfiguration>) - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
 
setOtherWeight(double) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
setOutput(Writer) - Method in class no.priv.garshol.duke.Processor.Profiler
Sets Writer to receive performance statistics.
setPageSize(int) - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
If pagesize is set to 0, paging is disabled.
setPassword(String) - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
setPath(String) - Method in class no.priv.garshol.duke.ConfigurationImpl
The path to the Lucene index directory.
setPerformanceProfiling(boolean) - Method in class no.priv.garshol.duke.Processor
Used to turn performance profiling on and off.
setPessimistic(boolean) - Method in class no.priv.garshol.duke.matchers.TestFileListener
 
setPopulation(int) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setProperties(List<Property>) - Method in class no.priv.garshol.duke.ConfigurationImpl
The set of properties Duke is to work with.
setPunctuationWeight(double) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
setQ(int) - Method in class no.priv.garshol.duke.comparators.QGramComparator
Sets the value of q, that is, the size of the q-grams.
setQuery(String) - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
setQuery(String) - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
 
setQuestions(int) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setQuiet(boolean) - Method in class no.priv.garshol.duke.matchers.TestFileListener
 
setRandomly(Configuration) - Method in class no.priv.garshol.duke.genetic.Aspect
Randomly modify this aspect of the Configuration.
setRandomly(Configuration) - Method in class no.priv.garshol.duke.genetic.ComparatorAspect
 
setRandomly(Configuration) - Method in class no.priv.garshol.duke.genetic.HighProbabilityAspect
 
setRandomly(Configuration) - Method in class no.priv.garshol.duke.genetic.LowProbabilityAspect
 
setRandomly(Configuration) - Method in class no.priv.garshol.duke.genetic.ThresholdAspect
 
setRank(int) - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Sets the rank of this configuration within its generation.
setReader(Reader) - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
setReader(Reader) - Method in class no.priv.garshol.duke.datasources.NTriplesDataSource
 
setRegexp(String) - Method in class no.priv.garshol.duke.cleaners.RegexpCleaner
 
setSize(int) - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Sets the size of the population.
setSkipLines(int) - Method in class no.priv.garshol.duke.datasources.CSVDataSource
 
setSplitOn(String) - Method in class no.priv.garshol.duke.datasources.Column
 
setStrict(boolean) - Method in class no.priv.garshol.duke.comparators.GeopositionComparator
 
setStripAccents(boolean) - Method in class no.priv.garshol.duke.cleaners.LowerCaseNormalizeCleaner
Controls whether accents are stripped (that is, "é" becomes "e", and so on).
setTablePrefix(String) - Method in class no.priv.garshol.duke.RDBMSLinkDatabase
 
setThreads(int) - Method in class no.priv.garshol.duke.genetic.GeneticAlgorithm
 
setThreads(int) - Method in class no.priv.garshol.duke.Processor
Sets the number of threads to use for processing.
setThreshold(double) - Method in interface no.priv.garshol.duke.Configuration
Sets the threshold.
setThreshold(double) - Method in class no.priv.garshol.duke.ConfigurationImpl
Sets the probability threshold for considering two records equivalent.
setTokenizer(QGramComparator.Tokenizer) - Method in class no.priv.garshol.duke.comparators.QGramComparator
Tells the comparator what tokenizer to use to produce q-grams.
setTripleMode(boolean) - Method in class no.priv.garshol.duke.datasources.SparqlDataSource
 
setUserName(String) - Method in class no.priv.garshol.duke.datasources.JDBCDataSource
 
setValue(String, String) - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
setValue(Column, String) - Method in class no.priv.garshol.duke.datasources.RecordBuilder
 
setValue(String) - Method in class no.priv.garshol.duke.utils.CommandLineParser.StringOption
 
show(Record, Record, double, String, List<Property>) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
show(Record, Record, double, String, List<Property>, boolean) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
similarity(String, String) - Static method in class no.priv.garshol.duke.comparators.JaroWinkler
Returns normalized score, with 0.0 meaning no similarity at all, and 1.0 meaning full equality.
size() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Returns the size of the population.
skip() - Method in class no.priv.garshol.duke.comparators.Matcher
 
sort() - Method in class no.priv.garshol.duke.Bucket
 
sort() - Method in class no.priv.garshol.duke.genetic.GeneticPopulation
Sorts the population by their achieved F-numbers.
soundex(String) - Static method in class no.priv.garshol.duke.comparators.SoundexComparator
Produces the Soundex key for the given string.
SoundexComparator - Class in no.priv.garshol.duke.comparators
An implementation of the Soundex algorithm, and a comparator which considers strings to have a score of 0.9 if their Soundex values match.
SoundexComparator() - Constructor for class no.priv.garshol.duke.comparators.SoundexComparator
 
SparqlClient - Class in no.priv.garshol.duke.utils
 
SparqlClient() - Constructor for class no.priv.garshol.duke.utils.SparqlClient
 
SparqlDataSource - Class in no.priv.garshol.duke.datasources
 
SparqlDataSource() - Constructor for class no.priv.garshol.duke.datasources.SparqlDataSource
 
SparqlResult - Class in no.priv.garshol.duke.utils
Represents the result of a SPARQL query.
SparqlResult() - Constructor for class no.priv.garshol.duke.utils.SparqlResult
 
spawnThread(DukeController, int) - Method in class no.priv.garshol.duke.server.BasicTimer
 
spawnThread(DukeController, int) - Method in class no.priv.garshol.duke.server.CommonJTimer
Starts a background thread which calls the controller every check_interval milliseconds.
spawnThread(DukeController, int) - Method in interface no.priv.garshol.duke.server.DukeTimer
Starts a background thread which calls the controller every check_interval seconds.
split(String) - Method in class no.priv.garshol.duke.datasources.Column
Splits the given string into multiple values.
split(String) - Static method in class no.priv.garshol.duke.utils.StringUtils
 
startDocument() - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
startElement(String, AttributeList) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
startline - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
startProcessing() - Method in class no.priv.garshol.duke.matchers.AbstractMatchListener
 
startProcessing() - Method in interface no.priv.garshol.duke.matchers.MatchListener
Notification that the processing run is beginning.
startProcessing() - Method in class no.priv.garshol.duke.Processor.Profiler
 
startRecord_(Record) - Method in class no.priv.garshol.duke.matchers.LinkDatabaseMatchListener
 
statement(String, String, String, boolean) - Method in interface no.priv.garshol.duke.StatementHandler
 
statement(String, String, String, boolean) - Method in class no.priv.garshol.duke.utils.NTriplesWriter
 
StatementHandler - Interface in no.priv.garshol.duke
Event-handler which receives parsed statements.
StatusServlet - Class in no.priv.garshol.duke.server
Starts up Duke processing, and provides a web interface containing some minimal information about the status of the service.
StatusServlet() - Constructor for class no.priv.garshol.duke.server.StatusServlet
 
stmt - Variable in class no.priv.garshol.duke.RDBMSLinkDatabase
 
stop() - Method in class no.priv.garshol.duke.server.BasicTimer
 
stop() - Method in class no.priv.garshol.duke.server.CommonJTimer
Stops the background thread.
stop() - Method in interface no.priv.garshol.duke.server.DukeTimer
Stops the background thread.
StringUtils - Class in no.priv.garshol.duke.utils
 
StringUtils() - Constructor for class no.priv.garshol.duke.utils.StringUtils
 
subelements - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
substitute(int, char, char) - Method in class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
substitute(int, char, char) - Method in interface no.priv.garshol.duke.comparators.WeightedLevenshtein.WeightEstimator
 

T

TestFileListener - Class in no.priv.garshol.duke.matchers
A match listener for evaluating Duke configurations against a set of known correct mappings.
TestFileListener(String, Configuration, boolean, Processor, boolean, boolean) - Constructor for class no.priv.garshol.duke.matchers.TestFileListener
Creates a test file listener.
TestFileListener(LinkDatabase, Configuration, boolean, Processor, boolean, boolean) - Constructor for class no.priv.garshol.duke.matchers.TestFileListener
Creates a test file listener.
TestFileUtils - Class in no.priv.garshol.duke.utils
The concept of a special syntax for link files is a bit dubious.
TestFileUtils() - Constructor for class no.priv.garshol.duke.utils.TestFileUtils
 
text(String) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
ThresholdAspect - Class in no.priv.garshol.duke.genetic
Sets the threshold.
ThresholdAspect() - Constructor for class no.priv.garshol.duke.genetic.ThresholdAspect
 
timerExpired(Timer) - Method in class no.priv.garshol.duke.server.CommonJTimer
This is the callback from the timer service, letting us know it's time do something.
timing(String, String) - Static method in class no.priv.garshol.duke.comparators.WeightedLevenshtein
Utility function for testing Levenshtein performance.
toString() - Method in class no.priv.garshol.duke.DocumentRecord
 
toString() - Method in class no.priv.garshol.duke.genetic.GeneticConfiguration
Returns the brief summary used in the command-line output.
toString() - Method in class no.priv.garshol.duke.InMemoryDatabase
 
toString() - Method in class no.priv.garshol.duke.InMemoryKeyValueStore
 
toString() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
toString() - Method in class no.priv.garshol.duke.KeyValueDatabase
 
toString() - Method in class no.priv.garshol.duke.Link
 
toString() - Method in class no.priv.garshol.duke.LuceneDatabase
 
toString(Record) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
toString(Record, List<Property>) - Static method in class no.priv.garshol.duke.matchers.PrintMatchListener
 
toString() - Method in class no.priv.garshol.duke.PropertyImpl
 
toString() - Method in class no.priv.garshol.duke.RecordImpl
 
trace(String) - Method in class no.priv.garshol.duke.DummyLogger
 
trace(String) - Method in interface no.priv.garshol.duke.Logger
 
Transform - Class in no.priv.garshol.duke.cleaners
Helper class used by AbstractRuleBasedCleaner.
Transform(String, String) - Constructor for class no.priv.garshol.duke.cleaners.Transform
 
Transform(String, String, int) - Constructor for class no.priv.garshol.duke.cleaners.Transform
 
transform(String) - Method in class no.priv.garshol.duke.cleaners.Transform
 
traverseAll(String, Set<String>) - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
TrimCleaner - Class in no.priv.garshol.duke.cleaners
A cleaner which removes leading and trailing whitespace, without making any other changes.
TrimCleaner() - Constructor for class no.priv.garshol.duke.cleaners.TrimCleaner
 
turnon() - Method in class no.priv.garshol.duke.utils.CommandLineParser.BooleanOption
 

U

Utils - Class in no.priv.garshol.duke.utils
 
Utils() - Constructor for class no.priv.garshol.duke.utils.Utils
 

V

validate() - Method in interface no.priv.garshol.duke.Configuration
Validates the configuration to verify that it makes sense.
validate() - Method in class no.priv.garshol.duke.ConfigurationImpl
Validates the configuration to verify that it makes sense.
validate(Statement) - Static method in class no.priv.garshol.duke.utils.JDBCUtils
Verifies that the connection is still alive.
validateConnection() - Method in class no.priv.garshol.duke.InMemoryLinkDatabase
 
validateConnection() - Method in class no.priv.garshol.duke.JDBCLinkDatabase
 
validateConnection() - Method in class no.priv.garshol.duke.JNDILinkDatabase
 
validateConnection() - Method in interface no.priv.garshol.duke.LinkDatabase
Verifies that we still have a connection to the database, and reestablishes it, if not.
valueOf(String) - Static method in enum no.priv.garshol.duke.comparators.QGramComparator.Formula
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum no.priv.garshol.duke.comparators.QGramComparator.Tokenizer
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum no.priv.garshol.duke.DatabaseProperties.DatabaseImplementation
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum no.priv.garshol.duke.LinkKind
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum no.priv.garshol.duke.LinkStatus
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum no.priv.garshol.duke.Property.Lookup
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
Returns the enum constant of this type with the specified name.
values() - Static method in enum no.priv.garshol.duke.comparators.QGramComparator.Formula
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum no.priv.garshol.duke.comparators.QGramComparator.Tokenizer
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum no.priv.garshol.duke.DatabaseProperties.DatabaseImplementation
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum no.priv.garshol.duke.LinkKind
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum no.priv.garshol.duke.LinkStatus
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum no.priv.garshol.duke.Property.Lookup
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum no.priv.garshol.duke.RDBMSLinkDatabase.DatabaseType
Returns an array containing the constants of this enum type, in the order they are declared.
verifyProperty(String, String) - Method in class no.priv.garshol.duke.datasources.ColumnarDataSource
 

W

warn(String) - Method in class no.priv.garshol.duke.DummyLogger
 
warn(String, Throwable) - Method in class no.priv.garshol.duke.DummyLogger
 
warn(String) - Method in interface no.priv.garshol.duke.Logger
 
warn(String, Throwable) - Method in interface no.priv.garshol.duke.Logger
 
WeightedLevenshtein - Class in no.priv.garshol.duke.comparators
An implementation of the Levenshtein distance metric that uses weights, so that not all editing operations are considered equal.
WeightedLevenshtein() - Constructor for class no.priv.garshol.duke.comparators.WeightedLevenshtein
 
WeightedLevenshtein.DefaultWeightEstimator - Class in no.priv.garshol.duke.comparators
 
WeightedLevenshtein.DefaultWeightEstimator() - Constructor for class no.priv.garshol.duke.comparators.WeightedLevenshtein.DefaultWeightEstimator
 
WeightedLevenshtein.WeightEstimator - Interface in no.priv.garshol.duke.comparators
The object which supplies the actual weights for editing operations.
write(Configuration, String) - Static method in class no.priv.garshol.duke.ConfigWriter
Writes the given configuration to the given file.
write(Record, Record, boolean) - Method in class no.priv.garshol.duke.utils.LinkFileWriter
 
write(String, String, boolean) - Method in class no.priv.garshol.duke.utils.LinkFileWriter
 
write(Writer, String) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
write(Writer, char) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
write(Writer, char[], int, int) - Method in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 
writer - Variable in class no.priv.garshol.duke.utils.XMLPrettyPrinter
 

X

XMLPrettyPrinter - Class in no.priv.garshol.duke.utils
A SAX document handler that writes pretty-printed XML to a Writer.
XMLPrettyPrinter(OutputStream) - Constructor for class no.priv.garshol.duke.utils.XMLPrettyPrinter
Creates an XMLPrettyPrinter that writes to the given OutputStream.
XMLPrettyPrinter(OutputStream, String) - Constructor for class no.priv.garshol.duke.utils.XMLPrettyPrinter
Creates an XMLPrettyPrinter that writes to the given OutputStream in the requested character encoding.
XMLPrettyPrinter(Writer, String) - Constructor for class no.priv.garshol.duke.utils.XMLPrettyPrinter
Creates an XMLPrettyPrinter that writes to the given Writer.

Y

YesNoConsole - Class in no.priv.garshol.duke.utils
 
YesNoConsole() - Constructor for class no.priv.garshol.duke.utils.YesNoConsole
 
yesorno() - Method in class no.priv.garshol.duke.utils.YesNoConsole
 

A B C D E F G H I J K L M N O P Q R S T U V W X Y

Copyright © 2013. All Rights Reserved.