Giter Site home page Giter Site logo

jhw5981 / researchtracker Goto Github PK

View Code? Open in Web Editor NEW

This project forked from vincentqyw/cv-arxiv-daily

0.0 0.0 0.0 7.82 MB

ResearchTracker is a repository devoted to monitoring the most recent research of my interest. Derived from Vincentqyw/cv-arxiv-daily.

Home Page: https://github.com/sjtu-jhw/ResearchTracker

License: Apache License 2.0

Python 100.00%

researchtracker's Introduction

Document Analysis
Num Update Date Title GPT Paper ID
1 2024-03-08 DeepSeek-VL: Towards Real-World Vision-Language Understanding No translated results is contained! 2403.05525v1
2 2024-03-08 Online Contention Resolution Schemes for Network Revenue Management and Combinatorial Auctions No translated results is contained! 2403.05378v1
3 2024-03-07 Children Age Group Detection based on Human-Computer Interaction and Time Series Analysis No translated results is contained! 2403.04574v1
4 2024-03-07 TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document No translated results is contained! 2403.04473v1
5 2024-03-06 Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis No translated results is contained! 2403.04080v1
6 2024-03-06 Multimodal Transformer for Comics Text-Cloze No translated results is contained! 2403.03719v1
7 2024-03-04 LOCR: Location-Guided Transformer for Optical Character Recognition No translated results is contained! 2403.02127v1
8 2024-03-01 Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction No translated results is contained! 2403.00528v1
9 2024-03-01 ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting No translated results is contained! 2403.00303v1
10 2024-03-01 Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System No translated results is contained! 2402.17204v3
11 2024-02-23 Representing Online Handwriting for Recognition in Large Vision-Language Models No translated results is contained! 2402.15307v1
12 2024-02-18 Syntactic Language Change in English and German: Metrics, Parsers, and Convergences No translated results is contained! 2402.11549v1
13 2024-02-15 LAPDoc: Layout-Aware Prompting for Documents No translated results is contained! 2402.09841v1
14 2024-02-15 TEXTRON: Weakly Supervised Multilingual Text Detection through Data Programming No translated results is contained! 2402.09811v1
15 2024-02-12 Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing No translated results is contained! 2402.08025v1
16 2024-02-12 Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription No translated results is contained! 2402.07596v1
17 2024-02-12 ClusterTabNet: Supervised clustering method for table detection and table structure recognition No translated results is contained! 2402.07502v1
18 2024-02-09 Deuterated Polystyrene -- Synthesis and uses for ultracold neutron bottles and the neutron EDM experiment No translated results is contained! 2402.06469v1
19 2024-02-08 SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models No translated results is contained! 2402.05935v1
20 2024-02-08 GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru No translated results is contained! 2402.05882v1
21 2024-02-08 Text Role Classification in Scientific Charts Using Multimodal Transformers No translated results is contained! 2402.14579v1
22 2024-02-08 Advances and Limitations in Open Source Arabic-Script OCR: A Case Study No translated results is contained! 2402.10943v1
23 2024-02-08 Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification No translated results is contained! 2402.05417v1
24 2024-02-07 TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing No translated results is contained! 2402.05282v1
25 2024-02-07 Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types No translated results is contained! 2402.05158v1
26 2024-02-03 ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images No translated results is contained! 2402.02246v1
27 2024-02-01 Instruction Makes a Difference No translated results is contained! 2402.00453v1
28 2024-02-07 KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization No translated results is contained! 2401.18079v2
29 2024-01-31 Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation No translated results is contained! 2401.17904v1
30 2024-01-30 MouSi: Poly-Visual-Expert Vision-Language Models No translated results is contained! 2401.17221v1
31 2024-01-30 AutoIE: An Automated Framework for Information Extraction from Scientific Literature No translated results is contained! 2401.16672v1
32 2024-02-14 Detecting and recognizing characters in Greek papyri with YOLOv8, DeiT and SimCLR No translated results is contained! 2401.12513v2
33 2024-01-22 Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis No translated results is contained! 2401.11874v1
34 2024-01-22 A Fair Evaluation of Various Deep Learning-Based Document Image Binarization Approaches No translated results is contained! 2401.11831v1
35 2024-01-16 U-DIADS-Bib: a full and few-shot pixel-precise dataset for document layout analysis of ancient manuscripts No translated results is contained! 2401.08425v1
36 2024-01-15 Improving OCR Quality in 19th Century Historical Documents Using a Combined Machine Learning Based Approach No translated results is contained! 2401.07787v1
37 2024-01-06 Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method No translated results is contained! 2401.06782v1
38 2024-01-01 Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters No translated results is contained! 2401.00971v1
39 2023-12-31 Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition No translated results is contained! 2401.00435v1
40 2024-01-31 An Empirical Study of Scaling Law for OCR No translated results is contained! 2401.00028v3
41 2023-12-28 Chaurah: A Smart Raspberry Pi based Parking System No translated results is contained! 2312.16894v1
42 2023-12-26 360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception No translated results is contained! 2312.16268v1
43 2023-12-20 The Common Optical Music Recognition Evaluation Framework No translated results is contained! 2312.12908v1
44 2023-12-19 Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey No translated results is contained! 2312.11812v1
45 2023-12-18 TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement No translated results is contained! 2312.11043v1
46 2023-12-16 When Graph Data Meets Multimodal: A New Paradigm for Graph Understanding and Reasoning No translated results is contained! 2312.10372v1
47 2023-12-15 Information Extraction from Unstructured data using Augmented-AI and Computer Vision No translated results is contained! 2312.09880v1
48 2024-01-21 Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation No translated results is contained! 2312.11532v2
49 2023-12-15 Privacy-Aware Document Visual Question Answering No translated results is contained! 2312.10108v1
50 2023-12-15 Object Recognition from Scientific Document based on Compartment Refinement Framework No translated results is contained! 2312.09038v2
Data Centric
Num Update Date Title GPT Paper ID
1 2024-03-08 VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI No translated results is contained! 2403.05174v1
2 2024-03-07 Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI No translated results is contained! 2403.04551v1
3 2024-03-07 A data-centric approach to class-specific bias in image data augmentation No translated results is contained! 2403.04120v1
4 2024-03-05 ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving No translated results is contained! 2403.02877v1
5 2024-03-05 Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation No translated results is contained! 2403.02707v1
6 2024-03-04 Model-Based Data-Centric AI: Bridging the Divide Between Academic Ideals and Industrial Pragmatism No translated results is contained! 2403.01832v1
7 2024-03-02 The Science of Data Collection: Insights from Surveys can Improve Machine Learning Models No translated results is contained! 2403.01208v1
8 2024-03-01 ChartReformer: Natural Language-Driven Chart Image Editing No translated results is contained! 2403.00209v1
9 2024-02-27 Side Information-Driven Session-based Recommendation: A Survey No translated results is contained! 2402.17129v1
10 2024-02-28 Dealing with Data for RE: Mitigating Challenges while using NLP and Generative AI No translated results is contained! 2402.16977v2
11 2024-02-26 Uncertainty quantification by direct propagation of shallow ensembles No translated results is contained! 2402.16621v1
12 2024-02-28 DAGnosis: Localized Identification of Data Inconsistencies using Structures No translated results is contained! 2402.17599v2
13 2024-02-23 A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models No translated results is contained! 2402.15422v1
14 2024-02-29 EyeTrans: Merging Human and Machine Attention for Neural Code Summarization No translated results is contained! 2402.14096v3
15 2024-02-20 Static vs. Dynamic Databases for Indoor Localization based on Wi-Fi Fingerprinting: A Discussion from a Data Perspective No translated results is contained! 2402.12756v1
16 2024-02-19 Training Green AI Models Using Elite Samples No translated results is contained! 2402.12010v1
17 2024-02-18 Solving Data-centric Tasks using Large Language Models No translated results is contained! 2402.11734v1
18 2024-02-18 Efficient Multimodal Learning from Data-centric Perspective No translated results is contained! 2402.11530v1
19 2024-02-12 Empowering Federated Learning for Massive Models with NVIDIA FLARE No translated results is contained! 2402.07792v1
20 2024-02-21 Privacy-Preserving Gaze Data Streaming in Immersive Interactive Virtual Reality: Robustness and User Experience No translated results is contained! 2402.07687v2
21 2024-02-06 A Data Centric Approach for Unsupervised Domain Generalization via Retrieval from Web Scale Multimodal Data No translated results is contained! 2402.04416v1
22 2024-02-29 Roadmap on Data-Centric Materials Science No translated results is contained! 2402.10932v2
23 2024-02-01 MobilityDL: A Review of Deep Learning From Trajectory Data No translated results is contained! 2402.00732v1
24 2024-02-01 EXMOS: Explanatory Model Steering Through Multifaceted Explanations and Data Configurations No translated results is contained! 2402.00491v1
25 2024-02-02 A Survey on Data-Centric Recommender Systems No translated results is contained! 2401.17878v2
26 2024-01-30 Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models No translated results is contained! 2402.01749v1
27 2024-01-26 Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline No translated results is contained! 2401.14625v1
28 2024-01-26 Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse No translated results is contained! 2401.14616v1
29 2024-02-20 Challenging Low Homophily in Social Recommendation No translated results is contained! 2401.14606v3
30 2024-01-24 The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview No translated results is contained! 2401.14428v1
31 2024-01-26 Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies No translated results is contained! 2401.12888v2
32 2024-01-24 Falcon: Fair Active Learning using Multi-armed Bandits No translated results is contained! 2401.12722v2
33 2024-01-22 Exploring descriptors for titanium microstructure via digital fingerprints from variational autoencoders No translated results is contained! 2401.11967v1
34 2024-01-21 An Interacting Wasserstein Gradient Flow Strategy to Robust Bayesian Inference No translated results is contained! 2401.11607v1
35 2024-01-23 D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems No translated results is contained! 2401.11478v2
36 2024-01-10 GOODAT: Towards Test-time Graph Out-of-Distribution Detection No translated results is contained! 2401.06176v1
37 2024-01-10 Inconsistency-Based Data-Centric Active Open-Set Annotation No translated results is contained! 2401.04923v1
38 2024-01-13 Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective No translated results is contained! 2401.04374v2
39 2024-01-08 Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking No translated results is contained! 2401.04266v1
40 2024-01-04 Data-Centric Foundation Models in Computational Healthcare: A Survey No translated results is contained! 2401.02458v1
41 2024-01-03 CodeFuse-Query: A Data-Centric Static Code Analysis System for Large-Scale Organizations No translated results is contained! 2401.01571v1
42 2024-01-01 Improve Fidelity and Utility of Synthetic Credit Card Transaction Time Series from Data-centric Perspective No translated results is contained! 2401.00965v1
43 2023-12-24 README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP No translated results is contained! 2312.15561v1
44 2024-02-21 Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities No translated results is contained! 2312.14647v2
45 2023-12-22 CaptainCook4D: A dataset for understanding errors in procedural activities No translated results is contained! 2312.14556v1
46 2023-12-15 Quilt: Robust Data Segment Selection against Concept Drifts No translated results is contained! 2312.09691v1
47 2023-12-08 Data-Centric Machine Learning for Geospatial Remote Sensing Data No translated results is contained! 2312.05327v1
48 2023-12-08 A Review On Table Recognition Based On Deep Learning No translated results is contained! 2312.04808v1
49 2024-01-31 Efficient Large Language Models: A Survey No translated results is contained! 2312.03863v3
50 2023-12-06 Data-Centric Digital Agriculture: A Perspective No translated results is contained! 2312.03437v1
LLM
Num Update Date Title GPT Paper ID
1 2024-03-08 Bayesian Preference Elicitation with Language Models No translated results is contained! 2403.05534v1
2 2024-03-08 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context No translated results is contained! 2403.05530v1
3 2024-03-08 GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM No translated results is contained! 2403.05527v1
4 2024-03-08 DeepSeek-VL: Towards Real-World Vision-Language Understanding No translated results is contained! 2403.05525v1
5 2024-03-08 Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola No translated results is contained! 2403.05523v1
6 2024-03-08 Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT No translated results is contained! 2403.05519v1
7 2024-03-08 Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought No translated results is contained! 2403.05518v1
8 2024-03-08 To Err Is Human, but Llamas Can Learn It Too No translated results is contained! 2403.05493v1
9 2024-03-08 Will GPT-4 Run DOOM? No translated results is contained! 2403.05468v1
10 2024-03-08 Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs No translated results is contained! 2403.05434v1
11 2024-03-08 Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery No translated results is contained! 2403.05381v1
12 2024-03-08 VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model No translated results is contained! 2403.05346v1
13 2024-03-08 Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings No translated results is contained! 2403.05338v1
14 2024-03-08 ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues No translated results is contained! 2403.05326v1
15 2024-03-08 RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation No translated results is contained! 2403.05313v1
16 2024-03-08 Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents No translated results is contained! 2403.05307v1
17 2024-03-08 ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications No translated results is contained! 2403.05303v1
18 2024-03-08 Modeling Dynamic (De)Allocations of Local Memory for Translation Validation No translated results is contained! 2403.05302v1
19 2024-03-08 LLM4Decompile: Decompiling Binary Code with Large Language Models No translated results is contained! 2403.05286v1
20 2024-03-08 Deep Prompt Multi-task Network for Abuse Language Detection No translated results is contained! 2403.05268v1
21 2024-03-08 ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models No translated results is contained! 2403.05266v1
22 2024-03-08 Debiasing Large Visual Language Models No translated results is contained! 2403.05262v1
23 2024-03-08 Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity No translated results is contained! 2403.05257v1
24 2024-03-08 Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance No translated results is contained! 2403.05231v1
25 2024-03-08 Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering No translated results is contained! 2403.05217v1
26 2024-03-08 SocialPET: Socially Informed Pattern Exploiting Training for Few-Shot Stance Detection in Social Media No translated results is contained! 2403.05216v1
27 2024-03-08 Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge No translated results is contained! 2403.05189v1
28 2024-03-08 Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation No translated results is contained! 2403.05171v1
29 2024-03-08 On Protecting the Data Privacy of Large Language Models (LLMs): A Survey No translated results is contained! 2403.05156v1
30 2024-03-08 Towards a Psychology of Machines: Large Language Models Predict Human Memory No translated results is contained! 2403.05152v1
31 2024-03-08 Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem No translated results is contained! 2403.05149v1
32 2024-03-08 Med3DInsight: Enhancing 3D Medical Image Understanding with 2D Multi-Modal Large Language Models No translated results is contained! 2403.05141v1
33 2024-03-08 ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment No translated results is contained! 2403.05135v1
34 2024-03-08 ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models No translated results is contained! 2403.05132v1
35 2024-03-08 CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model No translated results is contained! 2403.05124v1
36 2024-03-08 Benchmarking Large Language Models for Molecule Prediction Tasks No translated results is contained! 2403.05075v1
37 2024-03-08 Can we obtain significant success in RST discourse parsing by using Large Language Models? No translated results is contained! 2403.05065v1
38 2024-03-08 Aligning Large Language Models for Controllable Recommendations No translated results is contained! 2403.05063v1
39 2024-03-08 Multimodal Infusion Tuning for Large Models No translated results is contained! 2403.05060v1
40 2024-03-08 XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution No translated results is contained! 2403.05049v1
41 2024-03-08 Are Human Conversations Special? A Large Language Model Perspective No translated results is contained! 2403.05045v1
42 2024-03-08 Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs No translated results is contained! 2403.05020v1
43 2024-03-08 Can't Remember Details in Long Documents? You Need Some R&R No translated results is contained! 2403.05004v1
44 2024-03-08 DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation No translated results is contained! 2403.04997v1
45 2024-03-08 Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience No translated results is contained! 2403.04979v1
46 2024-03-08 Embracing Large Language and Multimodal Models for Prosthetic Technologies No translated results is contained! 2403.04974v1
47 2024-03-08 Tell me the truth: A system to measure the trustworthiness of Large Language Models No translated results is contained! 2403.04964v1
48 2024-03-08 An In-depth Evaluation of GPT-4 in Sentence Simplification with Error-based Human Assessment No translated results is contained! 2403.04963v1
49 2024-03-08 SecGPT: An Execution Isolation Architecture for LLM-Based Systems No translated results is contained! 2403.04960v1
50 2024-03-07 Automatic and Universal Prompt Injection Attacks against Large Language Models No translated results is contained! 2403.04957v1

researchtracker's People

Contributors

vincentqyw avatar jhw5981 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.