Giter Site home page Giter Site logo

guanrunwei / findvehicle Goto Github PK

View Code? Open in Web Editor NEW
36.0 2.0 2.0 1.92 MB

FindVehicle: A NER dataset in transportation to extract keywords describing vehicles on the road

dataset intelligent-transportation-systems named-entity-recognition traffic-monitoring natural-language-processing

findvehicle's Introduction

FindVehicle and VehicleFinder: A NER dataset for a text-image cross-modal vehicle retrieval system

๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅFindVehicle: The ๐Ÿ”ฅfirst๐Ÿ”ฅ NER dataset in traffic domain for natural language-based vehicle retrieval

๐ŸŽ‰๐ŸŽ‰๐ŸŽ‰VehicleFinder A text-image cross-modal vehicle retrieval system link


FindVehicle

Entity Types of FindVehicle

Dataset Download

Data Link 1: Baidu Cloud Disk Password: xp9o

Data Link 2: Google Drive

Dataset Directory

FindVehicle has 2 data formats, CoNLL-style and jsonlines.

CoNLL-style format

  • FindVehicle_train.txt -> Train set, CoNLL-style annotation, NER Label
  • FindVehicle_test.txt -> Test set, CoNLL-style annotation, NER Label

CoNLL-style Example (Flat Entity)

I O
am O
looking O
for O
a O
white B-vehicle_color
sedan B-vehicle_type
. O

CoNLL-style Example (Overlapped Entity)

I O
am O
looking O
for O
a O
white B-vehicle_color
Audi B-vehicle_brand
Q7 B-vehicle_model
. O

I O
am O
looking O
for O
a O
white B-vehicle_color
Audi B-vehicle_type-suv
Q7 E-vehicle_type-suv
. O

jsonlines format

  • FindVehicle_train.jsonl -> Train set, jsonlines annotation, NER Label, RE Label
  • FindVehicle_test.jsonl -> Test set, jsonlines annotation, NER Label, RE Label

Install jsonlines, then you could read it.

pip install jsonlines

jsonlines Example

{
"id": 41628,
"data": "Let the clever boy help find out the Silver XPeng G3 and lemon yellow Chevrolet Trailblazer in the Bottom Left of the image that driven left .",
"ner_label": [
["vehicle_color", 37, 43, "Silver", 8, 9, ["Silver"]], ### label, char span start index, char span end index, char span check, token span start index, token > > span end index, token span check
["vehicle_brand", 44, 49, "XPeng", 9, 10, ["XPeng"]],
["vehicle_model", 50, 52, "G3", 10, 11, ["G3"]],
["vehicle_color", 57, 69, "lemon yellow", 12, 14, ["lemon", "yellow"]],
["vehicle_brand", 70, 79, "Chevrolet", 14, 15, ["Chevrolet"]],
["vehicle_model", 80, 91, "Trailblazer", 15, 16, ["Trailblazer"]],
["vehicle_location", 99, 110, "Bottom Left", 18, 20, ["Bottom", "Left"]],
["vehicle_orientation", 99, 105, "Bottom", 18, 19, ["Bottom"]]],
"re_label": [[0, 1, 2, 6, 7], [3, 4, 5, 6, 7]]
### the indexes 0,1,2,6,7 refer to one target, indexes 3,4,5,6,7 refer to one target. }

Contributors

  • Runwei Guan [email], University of Liverpool, XJTLU-JITRI, Institute of Deep Perception Technology
  • Feifan Chen [email], University of Liverpool, XJTLU
  • Rongsheng Hu [email], Jiangnan University
  • Shanliang Yao [email], University of Liverpool, XJTLU-JITRI, Institute of Deep Perception Technology
  • Zhou Yuan [email], University of Bristol
  • Sihao Dai [email], University of Southampton
  • Wenjie Zhou [email], Jiangyin Baoneng Precision New Material Co.,Ltd

Citation

@misc{guan2023findvehicle,
      title={FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval system}, 
      author={Runwei Guan and Ka Lok Man and Feifan Chen and Shanliang Yao and Rongsheng Hu and Xiaohui Zhu and Jeremy Smith and Eng Gee Lim and Yutao Yue},
      year={2023},
      eprint={2304.10893},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Notes: Any problem please send them in Issues.

findvehicle's People

Contributors

guanrunwei avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

jie311 ozlana

findvehicle's Issues

dataset not accessible

given dataset, link address requires website log-in for accessing and downloading files.
Not able to create an account due to a mobile number format issue.
can you please provide another source to access the data-set!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.