Total Pageviews

Sunday 24 April 2022

AI-Paper-collector


Fully-automated scripts for collecting AI-related papers. Support fuzzy and exact search for paper titles.(AI-Paper-collector 实际上是AI-Paper-searcher, ai文章搜索引擎)

AI-Search-Engine.

Tips:
- enter "q" into any input to exit the program.
- enter "#" into any input to clear the screen.
- enter "help" into any input to see the help.
- enter nothing means search with default mode: exact.

Search Categories

- [ACL 2019-2021] [EMNLP 2019-2021] [NAACL 2019-2021] [COLING 2020]
- [CVPR 2019-2021] [ECCV 2020] [ICCV2019] [ACMMM 2019-2021]
- [ICLR 2019-2021] [ICML 2019-2021] [AAAI 2019-2021] [IJCAI 2019-2021]
- [SIGIR 2019-2021] [KDD 2019-2021] [CIKM 2019-2021] [WSDM 2019-2022]
- [WWW 2019-2021] [ECIR 2019-2022]

Requirements

pip install python-Levenshtein thefuzz beautifulsoup4 requests tqdm

Usage

python main.py

Help information

Search Commands:
- --mode <mode: fuzzy|exact> [optional: --threshold <num>] [optional: --conf <string/list(string)>]
- e.g. "--mode fuzzy --threshold 50" means fuzzy search with similarity >= 50 with all papers
- e.g. "--mode exact --conf ACL" means exact search with all papers in ACL
- e.g. "--mode exact --conf ACL,CVPR" means exact search with all papers in ACL and CVPR
- Note that the threshold is only for fuzzy search from 0 to 100 (default: 50)
- Note that the list of confs should be separated by comma (e.g. "ACL,CVPR")

Example

[+] Initializing System...
[+] Loading from cache...
[+] Enter 'help' into any input for more information when first starting.
[+] Enter your query: few-shot 
[+] Enter Search Commands: --mode fuzzy --conf SIGIR,WSDM,CIKM
[+] Search Results:
[=] Only show Top-5, Please Save results to see all.
[1] [CIKM2021] REFORM: Error-Aware Few-Shot Knowledge Graph Completion.
[2] [CIKM2021] Boosting Few-shot Abstractive Summarization with Auxiliary Tasks.
[3] [CIKM2021] Multi-objective Few-shot Learning for Fair Classification.
[4] [CIKM2020] Graph Few-shot Learning with Attribute Matching.
[5] [CIKM2020] Few-shot Insider Threat Detection.
[+] Enter Save filename: 
[+] Writing results to output/results.txt
[+] Writing results Done!

How to add new conferences from DBLP

  • clean the cache
rm -rf cache/cache.json
  • add new conferences by modifying the conf/dblp_conf.json file
[
    # add the name and dblp_url of the new conf
    {
        "name": "WWW2021",
        "url": "https://dblp.org/db/conf/www/www2021.html"
    },
    ... 
]
  • run the script
python main.py
from https://github.com/Doragd/AI-Paper-collector

No comments:

Post a Comment