Skip to content

Commit

Permalink
Merge branch 'olzama-dev' into main
Browse files Browse the repository at this point in the history
  • Loading branch information
olzama committed Mar 16, 2023
2 parents 89a9337 + ef9a9b2 commit 3547db0
Show file tree
Hide file tree
Showing 10 changed files with 30 additions and 884 deletions.
31 changes: 30 additions & 1 deletion util/README
Original file line number Diff line number Diff line change
@@ -1,5 +1,10 @@
The scripts in this folder are mainly for the interface with the morphophonological analyzer (Freeling).

*************************************************************************************************

CREATING TSDB TEST SUITES WITH YY TOKENS WITH FREELING TAGS:


The grammar currently relies on Freeling to provide morphophonological tags which are then mapped to "inflectional" rules (found in inflr.tdl). Once the tags are successfully mapped to those rules, the grammar can proceed with trying to find a syntactic analysis for the input.

The tags the grammar currently assumed correspond to Freeling v.4.2.
Expand All @@ -24,4 +29,28 @@ Note that this will modify the test suite, so make sure to back it up first.

The updated test suite can be processed by ACE using e.g. pydelphin wrapper as follows:

delphin process --options="-y --yy-rules" -g ~/delphin/srg/ace/srg.dat --full-forest --select i-tokens path-to-test-suite
delphin process --options="-y --yy-rules" -g ~/delphin/srg/ace/srg.dat --full-forest --select i-tokens path-to-test-suite

*************************************************************************************************

TSDB TREEBANKS UPDATES

All old Spanish treebanks required updates with respect to the `relations` file, which involves some manual work (see https://github.com/delph-in/docs/wiki/ItsdbProfileManipulation#updating-old-profiles).

The MRS test suite required significant changes (correcting typos, removing English sentences, updating example IDs. The relevant scripts are found under tsdb-updates/

*************************************************************************************************

RUNNING ACE ON INDICIDUAL SENTENCES:

see https://github.com/delph-in/docs/wiki/SrgTop

The scripts are under the folder analyze-wrappers/

*************************************************************************************************

ARCHAEOLOGY:

The folder "freeling-updates" contains the history of tag updates motivated by the changes in Freeling 4.2 compared to Freeling 3.0 as well as by adding token mapping to the grammar.


1 change: 0 additions & 1 deletion util/txt/ace_err.txt

This file was deleted.

5 changes: 0 additions & 5 deletions util/txt/debug-freeling.txt

This file was deleted.

2 changes: 0 additions & 2 deletions util/txt/debug-yy-no-unk.txt

This file was deleted.

2 changes: 0 additions & 2 deletions util/txt/debug-yy.txt

This file was deleted.

1 change: 0 additions & 1 deletion util/txt/debug.txt

This file was deleted.

3 changes: 0 additions & 3 deletions util/txt/freeling.txt

This file was deleted.

Loading

0 comments on commit 3547db0

Please sign in to comment.