
uamcorpustool3.0使用说明.pdf
49页1 UAM CorpusTool Version 3.0 Tutorial Introduction (June, 2013) Mick O’Donnell michael.odonnell@uam.es 2 About this Document This document provides a tutorial introduction to UAM CorpusTool 3.0 (henceforth: UAMCT3). For more detailed information about the options in each screen and menu of UAMCT3, please see the UAMCT3 User Manual. About UAM CorpusTool 3.0 UAM CorpusTool is a set of tools for the linguistic annotation of text. Core concepts include: The user defines a project, which is: a set of files, and a set of analyses which are applied to each of these files. All the files of a project are stored in a single folder: the original texts (the ‘corpus’), the annotations on this text and the coding schemes (the tags applied to the texts). Each ‘analysis’ can be seen as a ‘layer’ of annotation. CorpusTool currently allows two types of annotation: 1. Document Coding: where the text as a whole is assigned features. For instance, these features could represent the register of the document (field, tenor, mode), or text-type. 2. Segment Coding: The user can select segments within a file, and assign features to each of these segments. Segments are specified by dragging the mouse over a span of text, and the user is then prompted to specify the features of this segment. Annotation can be ‘manual’ (the user swipes text and chooses categories for it) or ‘automatic’ (the program does the annotation for you). Sometimes annotation is mixed, for instance, you can have the program recognise clause or noun-phase segments, but it is up to the you to code them.: CorpusTool is available from: See that site for instructions on how to install CorpusTool on your machine. 3 Tutorial 1: Starting a new project 1 Launch UAM CorpusTool Once UAM CorpusTool is installed on your machine, you can begin working with it. The first thing to do is to create a new “project”: Windows: When installing CorpusTool, you had the option to place an icon on the desktop. Click on this icon to launch CorpusTool. Alternatively, there should be a UAM CorpusTool icon in the Programs menu in the Start menu on Windows Toolbar. Select this to launch CorpusTool. Macintosh: The installation of CorpusTool placed the application in your Applications folder. Double-click on the application to launch it. You might find it useful to place the application in the Dock for easy access. If you have already created a project, you can open it simply by double-clicking the .cp3 file in the Project folder. This file has an icon as below: MacOSX: Windows: The Opening Window A window should appear as in Figure 1.1. This window provides, amongst other information, the version number you are using (useful if you need to communicate bugs). 4 Figure 1.1: The Opening Window The Window offers several options, Start New Project: create a new project from scratch. Open Project, to continue with a project you have already started, you will be prompted to select one. Import Project from UAMCT 2: If you have a project from UAMCT 2, you can use the “Import Project from UAMCT 2” button to make a copy of your project in the UAMCT 3 format. Open SomeProjectName: If you have opened a project previously on this machine, there will also be a button to open the last project opened. 2 Click on the “Start New Project” button. After clicking this button, a “Create Project Wizard” will appear, which will lead you through the steps needed to create your project: 1. Providing a name for a new project 2. Specify the folder where your new project’s folder is to be stored. For instance, choose the Desktop folder on your machine. When you click the “Finalise” button, CorpusTool will create your project, which is a folder containing all the details related to your project, including the corpus, and the annotation files. It also contains an icon which can be used to launch your project directly (the .ct3 file). Once you have finished with the Create Project Wizard, the CorpusTool Main Window will open, showing the File pane. See Figure 1.2. This pane is where you add or remove files to your project, or open a file for annotation. 5 Figure 1.2: The File Management pane The buttons at the top of the pane allow you to switch between the different panes of CorpusTool: Files (Tutorial 2), Layers (Tutorial 3) Search (Tutorial 5), Autocode (Tutorial 6), Statistics (Tutorial 7), Explore (Tutorial 8), Options and Help. We will assume for now that the “File” pane is selected. The name of your project is shown in the title bar of the Project window. In the space below is a box showing all the files in the project (initially empty), and for each file, one button for each of the possible analyses of that file. This ends the first tutorial. The next tutorial will show how to add content to your project. 6 Tutorial 2: Adding text files to your project The next step is to add some files to the project. 1 Save Documents as plain text UAMCT 3 deals on。
