国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看

合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

COMP3009J代做、代寫Python程序設計

時間:2024-05-29  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



COMP3009J – Information Retrieval 
Programming Assignment 
 
This assignment is worth 30% of the final grade for the module. 
Due Date: Friday 31th May 2024 at 23:55 (i.e. end of Week 14) 
 
Before you begin, download and extract the files ``small_corpus.zip’’ and ``large_corpus.zip’’ 
from Brightspace. These contain several files that you will need to complete this assignment. 
The README.md file in each describes the files contained in the archive and their format
1

 
The main objective of the assignment is to create a basic Information Retrieval system that 
can perform preprocessing, indexing, retrieval (using BM25) and evaluation. 
 
The small corpus is intended to show the correctness of your code. The large corpus is 
intended to show the efficiency. Efficiency is only important if the code is firstly correct. 
 
Both corpora are in the same format, except for the relevance judgments. For the small 
corpus, all documents not included in the relevance judgments have been judged nonrelevant.
For the large corpus, documents not included in the relevance judgments have not 
been judged. 
 
For this assignment, you should write several independent programs, each of which is 
contained in one file2. The list of programs is below, with descriptions of each. You may 
choose not to implement all the programs (see the “Grading” section below). However, an A+ 
grade can only be awarded if all these programs have been written correctly and efficiently. 
 
It is ESSENTIAL that all programs can be run as a standalone command-line program, without 
requiring an IDE/environment such as IDLE, PyCharm, Jupyter, etc. 
 
Non-standard libraries (other than the Porter stemmer provided) may not be used. Do not 
use absolute paths (the path to the corpus will always be provided to your program). 
 
What you should submit 
 
Submission of this assignment is through Brightspace. You should submit a single .zip archive 
containing the programs you have written. 
 
1 This is a Markdown file. Although you can open and read it as plain text, proper 
programming editor (e.g. Visual Studio Code) will provide syntax highlighting for better 
readability. 
2 Here, “independent programs” means that they should not import anything from one 
another. If you write a function that is helpful in multiple programs, copy/paste it. This is, of 
course, not good programming practice in terms of reusability of code. However, it helps 
with the grading process. Programs: 
index_small_corpus.py 
 
This program is intended to read the small corpus, process its contents and create an index. 
 
It must be possible to pass the path to the (unzipped) small corpus to this program as a 
command-line argument named “-p”3: 
 
./index_small_corpus.py -p /path/to/comp3009j-corpus-small 
 
This program must perform the following tasks: 
 
1. Extract the documents contained in the corpus provided. You must divide the documents 
into terms in an appropriate way (these are contained in the ``documents’’ directory of the 
corpus. The strategy must be documented in your source code comments. 
 
2. Perform stopword removal. A list of stopwords to use can be loaded from the 
stopwords.txt file that is provided in the ``files’’ directory of the corpus. 
 
3. Perform stemming. For this task, you may use the porter.py code in the ``files’’ 
directory. 
 
4. Create an appropriate index so that IR using the BM25 method may be performed. Here, 
an index is any data structure that is suitable for performing retrieval later. 
 
This will require you to calculate the appropriate weights and do as much pre-calculation as 
you can. This should be stored in a single external file in some human-readable4 format. Do 
not use database systems (e.g. MySQL, SQL Server, SQLite, etc.) for this. 
 
The output of this program should be a single index file, stored in the current working 
directory, named “21888888-small.index” (replacing “21888888” with your UCD 
student number). 
 
 
 
3 This path might, for example be “/Users/david/datasets/comp3009j-corpussmall”
or “C:/Users/datasets/comp3009j-corpus-small”. 
4 Here, “human-readable” means some text-based (i.e. non-binary) format. It should be 
possible to see the contents and the structure of the index using a standard text editor. query_small_corpus.py 
 
This program allows a user to submit queries to retrieve from the small corpus, or to run the 
standard corpus queries so that the system can be evaluated. The BM25 model must be used 
for retrieval. 
 
Every time this program runs, it should first load the index into memory (named “21888888-
small.index” in the current working directory, replacing “21888888” with your UCD student 
number), so that querying can be as fast as possible. 
 
This program should offer two modes, depending on a command-line argument named “-
m”. These are as follows: 
 
1. Interactive mode 
 
In this mode, a user can manually type in queries and see the first 15 results in their 
command line, sorted beginning with the highest similarity score. The output should have 
three columns: the rank, the document’s ID, and the similarity score. A sample run of the 
program is contained later in this document. The user should continue to be prompted to 
enter further queries until they type “QUIT”. 
 
Example output is given below. 
 
Interactive mode is activated by running the program in the following way: 
 
./query_small_corpus.py -m interactive -p /path/to/comp3009j-corpus-small 
 
2. Automatic mode 
 
In this mode, the standard queries should be read from the ``queries.txt’’ file (in the 
``files’’ directory of the corpus). This file has a query on each line, beginning with its 
query ID. The results5 should be stored in a file named “218888880-small.results" 
in the current working directory (replacing “21888888” with your UCD student number), 
which should include four columns: query ID, document ID, rank and similarity score. A 
sample of the desired output can be found in the “sample_output.txt” file in the 
“files” directory in the corpus. 
 
Automatic mode is activated by running the program in the following way: 
 
./query_small_corpus.py -m automatic -p /path/to/comp3009j-corpus-small 
 
 
 
5 You will need to decide how many results to store for each query. evaluate_small_corpus.py 
 
This program calculates suitable evaluation metrics, based on the output of the automatic 
mode of query_small_corpus.py (stored in “218888880-small.results" in the 
current working directory (replacing “21888888” with your UCD student number). 
 
The program should calculate the following metrics, based on the relevance judgments 
contained in the ``qrels.txt’’ file in the ``files’’ directory of the corpus): 
- Precision 
- Recall 
- R-Precision 
- P@15 
- NDCG@15 
- MAP 
 
The program should be run in the following way: 
./evaluate_small_corpus.py -p /path/to/comp3009j-corpus-small 
 index_large_corpus.py 
 
This program should perform the same tasks as index_small_corpus.py, except that the 
output file should be named “21888888-large.index” (replacing “21888888” with your 
UCD student number). 
 
query_large_corpus.py 
 
This program should perform the same tasks as query_small_corpus.py, except that the 
output results file should be named “21888888-large.results” (replacing “21888888” 
with your UCD student number). 
 
evaluate_large_corpus.py 
 
In addition to the evaluation metrics calculated by evaluate_small_corpus.py, this 
program should also calculate bpref (since the large corpus has incomplete relevance 
judgments). 
 
Otherwise, this program should perform the same tasks as evaluate_small_corpus.py, 
except that the input results file should be named “21888888-large.results” (replacing 
“21888888” with your UCD student number). 
 
 Sample Run (Interactive) 
$ ./query_small_corpus.py -m interactive -p /Users/david/comp3009j-corpus-small 
Loading BM25 index from file, please wait. 
Enter query: library information conference 
 
Results for query [library information conference] 
1 928 0.991997 
2 1109 0.984280 
3 1184 0.979530 
4 309 0.96**75 
5 533 0.918940 
6 710 0.912594 
**88 0.894091 
8 1311 0.8**748 
9 960 0.845044 
10 717 0.833753 
11 77 0.829261 
12 1129 0.821643 
13 783 0.817639 
14 1312 0.804034 
15 423 0.795264 
Enter query: QUIT 
Note: In all of these examples, the results, and similarity scores were generated at random for 
illustration purposes, so they are not correct scores. 
Sample Run (Evaluation) 
$ ./evaluate_large_corpus.py -p /Users/david/comp3009j-corpus-large 
 
Evaluation results: 
Precision: 0.138 
Recall: 0.412 
R-precision: 0.345 
P@15: 0.621 
NDCG@15 0.123 
MAP: 0.253 
bpref: 0.345 
 
 Grading 
 
Grading is based on the following (with the given weights)6: 
- Document reading and preprocessing: 15% 
- Indexing: 20% 
- Retrieval with BM25: 20% 
- Evaluation: 15% 
- Efficiency: 15% (as evidenced by the performance on the large corpus) 
- Programming style (comments/organisation): 15% 
 
Other notes 
1. This is an individual assignment. All code submitted must be your own work. Submitting the work 
of somebody else or generated by AI tools such as ChatGPT is plagiarism, which is a serious 
academic offence. Be familiar with the UCD Plagiarism Policy and the UCD School of Computer 
Science Plagiarism Policy. 
2. If you have questions about what is or is not plagiarism, ask! 
 
Document Version History 
v1.0: 2024-04-26, Initial Version. 
 
6This assignment will be graded using the “Alternative Linear Conversion Grade Scale 40% 
Pass” Mark to Grade Conversation Scale: 

請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp






















 

掃一掃在手機打開當前頁
  • 上一篇: XJCO1921代做、代寫c/c++編程語言
  • 下一篇:菲律賓商務簽證入境稅費 菲律賓商務簽證的辦理材料
  • 無相關信息
    合肥生活資訊

    合肥圖文信息
    流體仿真外包多少錢_專業CFD分析代做_友商科技CAE仿真
    流體仿真外包多少錢_專業CFD分析代做_友商科
    CAE仿真分析代做公司 CFD流體仿真服務 管路流場仿真外包
    CAE仿真分析代做公司 CFD流體仿真服務 管路
    流體CFD仿真分析_代做咨詢服務_Fluent 仿真技術服務
    流體CFD仿真分析_代做咨詢服務_Fluent 仿真
    結構仿真分析服務_CAE代做咨詢外包_剛強度疲勞振動
    結構仿真分析服務_CAE代做咨詢外包_剛強度疲
    流體cfd仿真分析服務 7類仿真分析代做服務40個行業
    流體cfd仿真分析服務 7類仿真分析代做服務4
    超全面的拼多多電商運營技巧,多多開團助手,多多出評軟件徽y1698861
    超全面的拼多多電商運營技巧,多多開團助手
    CAE有限元仿真分析團隊,2026仿真代做咨詢服務平臺
    CAE有限元仿真分析團隊,2026仿真代做咨詢服
    釘釘簽到打卡位置修改神器,2026怎么修改定位在范圍內
    釘釘簽到打卡位置修改神器,2026怎么修改定
  • 短信驗證碼 豆包網頁版入口 破天一劍 目錄網 排行網

    關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
    ICP備06013414號-3 公安備 42010502001045

    国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看
    成人av在线播放观看| 青青久久av北条麻妃黑人| 动漫一区二区在线| 国产一区视频观看| 久久黄色av网站| 性色av一区二区咪爱| 国产精品自拍偷拍视频| 久久亚洲影音av资源网| 欧日韩免费视频| 久久精品第九区免费观看| 亚洲精品欧洲精品| 国产精品永久免费视频| 国产精品福利网| 精品欧美日韩在线| 日韩视频亚洲视频| 亚洲 国产 日韩 综合一区| 国产小视频免费| 久久国产精品久久久久久| 免费在线观看一区二区| 日韩在线观看免费高清| 日韩亚洲在线视频| 国产成人精品日本亚洲11 | 奇米影视首页 狠狠色丁香婷婷久久综合| 99视频在线免费| 亚洲第一精品区| 69精品小视频| 日本一区高清在线视频| 久久久av水蜜桃| 久久中文字幕在线| 久久久亚洲欧洲日产国码aⅴ| 狠狠精品干练久久久无码中文字幕| 天堂√在线观看一区二区| 国产精品视频99| 国产欧美精品一区二区三区介绍 | 青青a在线精品免费观看| 欧美日本精品在线| 日韩有码片在线观看| 91久久精品国产| 日韩一级片播放| 99在线视频首页| 欧美激情一级欧美精品| 成人av男人的天堂| 精品国产第一页| 超碰国产精品久久国产精品99| 亚洲欧洲日夜超级视频| 国产成人一区三区| 黄频视频在线观看| 中文字幕人成一区| 69av在线播放| 欧美综合国产精品久久丁香| 久久久精品电影| 国产欧美日韩精品专区| 亚洲av综合色区| 日韩中文字幕视频在线| 国产主播一区二区三区四区| 亚洲欧美日韩国产成人综合一二三区 | 久久69精品久久久久久久电影好| 成人综合视频在线| 色女人综合av| 国产精品美女久久久久av超清 | 午夜精品视频在线| 久久久精品网站| 国产日本欧美一区二区三区 | 日本一区二区三区在线视频| 久久riav| 国内一区二区三区在线视频| 欧美激情一级欧美精品| 国产成人97精品免费看片| 欧美 日韩 国产在线观看| 欧美另类在线播放| 91久久久久久久久| 视频一区亚洲| 国产精品久久久久福利| 91精品久久久久久久久久入口 | yy111111少妇影院日韩夜片| 日韩亚洲在线视频| 色综合导航网站| 国产成人精品免高潮在线观看| 国内精品久久国产| 日韩中文字幕av在线| 欧美精品情趣视频| 九色视频成人porny| 国产精品自拍合集| 欧美日本亚洲| 色综合久久av| 欧美激情xxxx| 国产精品日韩在线一区| 2019日本中文字幕| 国产日韩久久| 人妻夜夜添夜夜无码av| 亚洲国产精品www| 欧美xxxx综合视频| 久久成人免费观看| 99久久精品无码一区二区毛片| 欧美 日韩 国产在线观看| 色999日韩自偷自拍美女| 九九热这里只有精品6| 日韩中文字幕av| 国产精品69久久| 国产精选一区二区| 麻豆成人小视频| 欧美专区福利在线| 亚洲黄色网址在线观看| 精品久久久久久综合日本| 久久久久久久999| 97人人模人人爽人人喊38tv| 国产在线精品91| 欧美成人第一区| 日韩精品一区二区三区四区五区 | 国产一区二区网| 日韩精品―中文字幕| 熟妇人妻va精品中文字幕| 亚洲专区中文字幕| 尤物一区二区三区| 欧美精品一区二区免费| 国产精品爽黄69天堂a| 久久99精品久久久久久久青青日本 | 亚洲不卡一卡2卡三卡4卡5卡精品| 国产aⅴ精品一区二区三区黄| 国产精品激情自拍| 国产精品久久久久久久久久东京 | 国产精品一区视频| 国产亚洲综合视频| 精品一区2区三区| 红桃av在线播放| 黄色一级片在线看| 免费高清一区二区三区| 美日韩精品免费| 国模吧无码一区二区三区| 蜜桃传媒一区二区| 欧美国产综合在线| 国内精品久久久久久| 麻豆精品视频| 国产情侣av自拍| 成人av电影免费| 91免费国产精品| 久久久一本精品99久久精品| 久久九九视频| 日韩亚洲第一页| 国产精品久久久久影院日本| 久久夜色撩人精品| 欧美日韩xxxxx| 午夜一区二区三区| 日韩网站在线免费观看| 日韩伦理一区二区三区av在线| 欧美日韩精品免费观看| 国模精品娜娜一二三区| 国产精品有限公司| 久久综合给合久久狠狠色| 国产成人亚洲综合青青| 日韩有码视频在线| 久久这里有精品视频| 欧美激情综合色| 午夜精品久久久久久久久久久久久 | 欧美有码在线观看视频| 国内自拍在线观看| 国产日韩在线观看av| 国产精品一码二码三码在线| 99精品欧美一区二区三区| 91精品国产综合久久香蕉的用户体验| 国产精品18毛片一区二区| 日韩中文理论片| 精品综合久久久久久97| 中文字幕日韩精品无码内射| 欧美一区二区三区免费观看| 秋霞久久久久久一区二区| 国产在线青青草| 99爱精品视频| 国产精品丝袜白浆摸在线| 亚洲午夜久久久影院伊人 | 日韩久久一级片| 国产私拍一区| 国产精国产精品| 久久精视频免费在线久久完整在线看| 麻豆国产精品va在线观看不卡| 亚洲伊人成综合成人网| 日本精品免费在线观看| 黄色一级二级三级| 99在线高清视频在线播放| 国产成人精品一区二区三区福利| 久久国产视频网站| 日本高清视频免费在线观看| 国产一区在线观| 国产精品一区二区久久| 日韩在线视频观看正片免费网站| 精品久久久久久久久久中文字幕 | 精品一区二区日本| 91福利视频在线观看| 精品国产依人香蕉在线精品| 在线播放豆国产99亚洲| 欧美日本韩国在线| 91精品国产自产在线老师啪| 国产精品久久久久久av福利软件| 欧美一区二区三区免费视| 精品少妇人妻av一区二区| 久久99精品久久久久久久久久| 欧美日韩国产999| 欧洲久久久久久| 久青草视频在线播放| 欧美精品第一页在线播放|