The Project Window
The Project Window is the window shown when CorpusTool opens
up. It presents the analysis layers you have defined, and the
text files in your project.
When you create a new project, the list of layers will be empty,
and the file list will show the files you specified within the New
Project Wizard.
The Layer Pane
The top part of the Project Window shows the layers in your project,
one pane per project. To add a new layer, press the Add Layer button.
The pane for each layer tells you:
- The layer name
- The kind of annotation which the layer involves. This includes:
- code-document: assign features to the document as a whole.
- code-segments: assign features to segments within the document.
- code-whole-image: assign features to a complete document.
- code-image-segments: assign features to segments within an image.
- The name of the coding scheme for this layer. Press the "Edit" button to edit the coding scheme.
The File Pane
Under the layer pane is the space where you manage the files that
you have in your project. Initially, files are categorised as "unincorporated", which means the files are present in your corpus, but not yet
available for annotation. You need to "incorporate" each file
into the project before you can annotate it. To do this,
click on the "Action" menu at the left of each file name, and select
"Incorporate file". For text files (but not images), you will be asked
to specify some information:
- Language: the language of the text (English, German, Chinese, etc.).
This field is used when loading text processing resources (parsers, concordancers, etc.)
- Encoding: the character encoding used in the text file. For English
text, this is usually some ascii encoding. For other writing systems, the encoding will be different, e.g. UTF-8, etc. The encoding offered by CorpusTool is
an intelligent guess, but it might not be correct. To see what encoding your text file is, open it in a browser (using "Open with..." or similar), and see what encoding the browser assigned to the text. For instance, with Internet Explorer, from the View menu, click on the "Encoding" option, and see which option is selected.
- Font: Select the display font to use for this file in the annotation interfaces.
Note: all of these options can be changed at a later point by selecting
the "Change file metadata" option from the Action menu of each incorporated file.
Coding Documents: Once a file is incorporated, it will be given some extra buttons,
one for each defined analysis layer. Cick on any of these buttons
to open the analysis window for that file and layer.
Color Coding Text: Clicking on the file name will open a window which allows
you to view the text, and (if you have at least one layer defined),
it allows you to "style" the text: change the font style (size, bold, italic)
and color of text which is tagged with given features. See "Styled Text View" for more information.
Adding New Text: Click on the "Extend Corpus" button to
add new texts to the corpus.