Skip to content

Commit

Permalink
setup for mbart test
Browse files Browse the repository at this point in the history
  • Loading branch information
jds-amplify committed Apr 29, 2021
1 parent ee5281d commit a9823a2
Show file tree
Hide file tree
Showing 3 changed files with 11 additions and 1 deletion.
5 changes: 5 additions & 0 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -93,3 +93,8 @@ build-pdl-corpus:
gawk 'BEGIN {FS="\t"} !/urn:/ {print $$3}' corpora/pdl/all.txt > corpora/pdl.txt

build-corpora: download-openITI build-openITI-corpus download-pdl build-pdl-corpus



nb:
jupyter notebook --NotebookApp.token='' --no-browser
5 changes: 5 additions & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
@@ -1 +1,6 @@
git+https://github.com/mittagessen/kraken@master#egg=kraken
transformers
sentencepiece
datasets
tensorboardx

2 changes: 1 addition & 1 deletion scripts/clean_openiti.py
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
import sys
import re
from functools import partial

Expand Down Expand Up @@ -37,7 +38,6 @@ def _clean(s):
(_clean(line) for line in lines)
if _has_arabic(cleaned_line))

import sys
if __name__ == '__main__':

for filename in sys.argv[1:]:
Expand Down

0 comments on commit a9823a2

Please sign in to comment.