基于术语词典干预的机器翻译挑战赛笔记Task1 跑通baseline
#AI夏令营 #Datawhale #夏令营
Step1:报名赛事!(点击即可跳转)
赛事链接:https://challenge.xfyun.cn/h5/detail?type=role-element-extraction&ch=dw24_y0SCtdhttps://challenge.xfyun.cn/topic/info?type=machine-translation-2024&option=tjjg&ch=dw24_AtTCK9登录/注册参加比赛
Step2:下载代码文件
掠过
modelScope初使用
注册/登录 modelScope
魔搭社区汇聚各领域最先进的机器学习模型,提供模型探索体验、推理、训练、部署和应用的一站式服务。https://modelscope.cn/my/mynotebook/preset
我这里csdn直接登录,中间要扫码绑定阿里云可以那算力
新用户绑定
我这里已经绑定过了就不做了
打开链接并启动示例
魔搭社区汇聚各领域最先进的机器学习模型,提供模型探索体验、推理、训练、部署和应用的一站式服务。https://modelscope.cn/my/mynotebook/preset
启动中ing
点击查看
创建baseline文件夹并上传代码和文件
都拖进来即可
打开一个终端
解压dataset.zip
root@dsw-559464-8f6c6588c-7gtnm:/mnt/workspace/baseline# ls
dataset.zip task-1_terminology.ipynb
root@dsw-559464-8f6c6588c-7gtnm:/mnt/workspace/baseline# unzip dataset.zip
Archive: dataset.zipcreating: dataset/inflating: dataset/dev_en.txt inflating: dataset/dev_zh.txt inflating: dataset/en-zh.dic inflating: dataset/test_en.txt inflating: dataset/train.txt
root@dsw-559464-8f6c6588c-7gtnm:/mnt/workspace/baseline#
左侧会出现解压好的文件
双击打开task-1_terminology.ipynb
直接一键重启并运行所有cell
遇到错误
Looking in indexes: https://mirrors.cloud.aliyuncs.com/pypi/simple
Collecting torchtextDownloading https://mirrors.cloud.aliyuncs.com/pypi/packages/d7/4f/9953b4d4b79917e03c393484ea8ce8f46a4cc1745f272cc371550fb7fc05/torchtext-0.18.0-cp310-cp310-manylinux1_x86_64.whl (2.0 MB)━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.0/2.0 MB 28.2 MB/s eta 0:00:0000:0100:01
Requirement already satisfied: requests in /usr/local/lib/python3.10/site-packages (from torchtext) (2.32.3)
Requirement already satisfied: torch>=2.3.0 in /usr/local/lib/python3.10/site-packages (from torchtext) (2.3.0+cu121)
Requirement already satisfied: tqdm in /usr/local/lib/python3.10/site-packages (from torchtext) (4.66.4)
Requirement already satisfied: numpy in /usr/local/lib/python3.10/site-packages (from torchtext) (1.26.3)
Requirement already satisfied: filelock in /usr/local/lib/python3.10/site-packages (from torch>=2.3.0->torchtext) (3.14.0)
Requirement already satisfied: typing-extensions>=4.8.0 in /usr/local/lib/python3.10/site-packages (from torch>=2.3.0->torchtext) (4.12.0)
Requirement already satisfied: networkx in /usr/local/lib/python3.10/site-packages (from torch>=2.3.0->torchtext) (3.3)
Requirement already satisfied: fsspec in /usr/local/lib/python3.10/site-packages (from torch>=2.3.0->torchtext) (2024.2.0)
Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/site-packages (from torch>=2.3.0->torchtext) (3.1.4)
Requirement already satisfied: sympy in /usr/local/lib/python3.10/site-packages (from torch>=2.3.0->torchtext) (1.12.1)
Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/site-packages (from requests->torchtext) (3.7)
Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/site-packages (from requests->torchtext) (2.2.1)
Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/site-packages (from requests->torchtext) (3.3.2)
Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/site-packages (from requests->torchtext) (2024.2.2)
Requirement already satisfied: MarkupSafe>=2.0 in /usr/local/lib/python3.10/site-packages (from jinja2->torch>=2.3.0->torchtext) (2.1.5)
Requirement already satisfied: mpmath<1.4.0,>=1.1.0 in /usr/local/lib/python3.10/site-packages (from sympy->torch>=2.3.0->torchtext) (1.3.0)
Installing collected packages: torchtext
Successfully installed torchtext-0.18.0
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv[notice] A new release of pip is available: 23.0.1 -> 24.1.2
[notice] To update, run: pip install --upgrade pip
更新pip
pip install --upgrade pip
再重启一下kernel
官方给的路径和我们的不一样需要把所有路径的“../”改成“./”即可
排除所有错误就可以等待程序跑完出结果
最后一个cell跑完并显示如下
翻译完成!文件已保存到./dataset/submit.txt
右键下载
用完右上角关闭实例
魔搭社区汇聚各领域最先进的机器学习模型,提供模型探索体验、推理、训练、部署和应用的一站式服务。https://modelscope.cn/my/mynotebook/preset或者在这个地方关闭
Step3:提交文件,拿下第一个分数!(点击即可跳转官网进行提交)
链接:https://challenge.xfyun.cn/h5/detail?type=role-element-extraction&ch=dw24_y0SCtd
1 | 返回分数 | 0.0932 | submit.txt | baseline | 1gszwJaV | 2024-07-11 18:03:16 |