About Wilma Installing Wilma Getting Started WilmaDesktop Display Index Properties Search ..Search Phrases ..Regular Expressions ..Referrals Menus ..File Menu ..Edit Menu ..Index Menu ..View Menu ..Custom Menu ..Help Menu Server Command Interpreter ..Command Language News |
The index properties dialog has five sections accessed by the tabs across the top: It also contains the following buttons accessible from all the tabs:
Root PathsThis tab consists of a list of top level folders where Wilma should start looking for included files. All the folders that these root folders contain and the folders that they in turn contain and so forth, will also be searched. Thus if you wanted to index all the files in your Documents folder on a Windows machine, you might have an entry: C:\Documents and Settings\yourname\Documents while on a Mac it might be: /Users/yourname/Documents and on Linux: /home/yourname/Documents You can either just type the root folders into the list directly or use the browse button to search using a directory tree. Include FilesWhile the Root Paths list tells Wilma where to look for files to index, this list tells Wilma which files in those folders are to be indexed and which file analyzer is to be used in reading them. When a new index is created, the index properties dialog is opened and the include files list is filled with a range of common file extensions and associated analyzers.In many cases the simplest thing to do will be to simply use these default entries, but in other cases it will be more appropriate to clear the list, and add just specific entries. Note that file patterns are typically something like *.txt where the * character is a wild card representing any number of any characters. Also note that this matches against the full path name of the file, so something like */saved/*.doc (*\saved\*.doc on Windows) would only include doc files that had the folder named "saved" somewhere in its path. The Add button adds a new row to the list, while the Delete button removes the currently selected row and the Clear button removes all the rows. The Up and Down buttons can be used to move the selected item in the list. The analyzer used for a file is determined by the first match in the list and this can be important if for instance you want to use a different analyzer for files in a certain folder. The Default button will add the default list of analyzers to the end of the current list. To select an analyzer, just click on analyzer for that row to get a selection dialog, which lists the available analyzers on the left with a brief description on the right. The generic analyzer will be used for most files and is quite proficient at extracting text even from binary file formats. Some analyzers, such as the zip, tar and gzip analyzers, deal with archives and will recursively index all of the files in the archive using the appropriate analyzers, including archive analyzers. Wilma can also use external command line programs to process unknown formats into temporary files that Wilma can handle. One good application of this is to use the open source pdftotext program for pdf files. While Wilma does have a simple pdf analyzer built in, it can have problems with complex pdf files that pdftotext handles well. See external analyzers for information on how to set these up. Exclude PathsThis list is used to indicate any files that you don't want indexed, even though they fit the criterion set by the root paths and include files lists. For example you might have a temporary directory somewhere in the directory tree included by a root path. This could be skipped with something like: */temporary/* Like the include files, these patterns are matched against the full path name of the file. OptionsThis tab is used to specify various options pertaining to how the index is built:
LanguageOn this tab you can select which accented characters should be used when indexing. |