The researchtracker from jhw5981

Document Analysis

Num	Update Date	Title	GPT	Paper ID
1	2024-03-08	DeepSeek-VL: Towards Real-World Vision-Language Understanding	No translated results is contained!	2403.05525v1
2	2024-03-08	Online Contention Resolution Schemes for Network Revenue Management and Combinatorial Auctions	No translated results is contained!	2403.05378v1
3	2024-03-07	Children Age Group Detection based on Human-Computer Interaction and Time Series Analysis	No translated results is contained!	2403.04574v1
4	2024-03-07	TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document	No translated results is contained!	2403.04473v1
5	2024-03-06	Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis	No translated results is contained!	2403.04080v1
6	2024-03-06	Multimodal Transformer for Comics Text-Cloze	No translated results is contained!	2403.03719v1
7	2024-03-04	LOCR: Location-Guided Transformer for Optical Character Recognition	No translated results is contained!	2403.02127v1
8	2024-03-01	Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction	No translated results is contained!	2403.00528v1
9	2024-03-01	ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting	No translated results is contained!	2403.00303v1
10	2024-03-01	Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System	No translated results is contained!	2402.17204v3
11	2024-02-23	Representing Online Handwriting for Recognition in Large Vision-Language Models	No translated results is contained!	2402.15307v1
12	2024-02-18	Syntactic Language Change in English and German: Metrics, Parsers, and Convergences	No translated results is contained!	2402.11549v1
13	2024-02-15	LAPDoc: Layout-Aware Prompting for Documents	No translated results is contained!	2402.09841v1
14	2024-02-15	TEXTRON: Weakly Supervised Multilingual Text Detection through Data Programming	No translated results is contained!	2402.09811v1
15	2024-02-12	Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing	No translated results is contained!	2402.08025v1
16	2024-02-12	Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription	No translated results is contained!	2402.07596v1
17	2024-02-12	ClusterTabNet: Supervised clustering method for table detection and table structure recognition	No translated results is contained!	2402.07502v1
18	2024-02-09	Deuterated Polystyrene -- Synthesis and uses for ultracold neutron bottles and the neutron EDM experiment	No translated results is contained!	2402.06469v1
19	2024-02-08	SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models	No translated results is contained!	2402.05935v1
20	2024-02-08	GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru	No translated results is contained!	2402.05882v1
21	2024-02-08	Text Role Classification in Scientific Charts Using Multimodal Transformers	No translated results is contained!	2402.14579v1
22	2024-02-08	Advances and Limitations in Open Source Arabic-Script OCR: A Case Study	No translated results is contained!	2402.10943v1
23	2024-02-08	Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification	No translated results is contained!	2402.05417v1
24	2024-02-07	TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing	No translated results is contained!	2402.05282v1
25	2024-02-07	Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types	No translated results is contained!	2402.05158v1
26	2024-02-03	ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images	No translated results is contained!	2402.02246v1
27	2024-02-01	Instruction Makes a Difference	No translated results is contained!	2402.00453v1
28	2024-02-07	KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization	No translated results is contained!	2401.18079v2
29	2024-01-31	Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation	No translated results is contained!	2401.17904v1
30	2024-01-30	MouSi: Poly-Visual-Expert Vision-Language Models	No translated results is contained!	2401.17221v1
31	2024-01-30	AutoIE: An Automated Framework for Information Extraction from Scientific Literature	No translated results is contained!	2401.16672v1
32	2024-02-14	Detecting and recognizing characters in Greek papyri with YOLOv8, DeiT and SimCLR	No translated results is contained!	2401.12513v2
33	2024-01-22	Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis	No translated results is contained!	2401.11874v1
34	2024-01-22	A Fair Evaluation of Various Deep Learning-Based Document Image Binarization Approaches	No translated results is contained!	2401.11831v1
35	2024-01-16	U-DIADS-Bib: a full and few-shot pixel-precise dataset for document layout analysis of ancient manuscripts	No translated results is contained!	2401.08425v1
36	2024-01-15	Improving OCR Quality in 19th Century Historical Documents Using a Combined Machine Learning Based Approach	No translated results is contained!	2401.07787v1
37	2024-01-06	Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method	No translated results is contained!	2401.06782v1
38	2024-01-01	Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters	No translated results is contained!	2401.00971v1
39	2023-12-31	Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition	No translated results is contained!	2401.00435v1
40	2024-01-31	An Empirical Study of Scaling Law for OCR	No translated results is contained!	2401.00028v3
41	2023-12-28	Chaurah: A Smart Raspberry Pi based Parking System	No translated results is contained!	2312.16894v1
42	2023-12-26	360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception	No translated results is contained!	2312.16268v1
43	2023-12-20	The Common Optical Music Recognition Evaluation Framework	No translated results is contained!	2312.12908v1
44	2023-12-19	Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey	No translated results is contained!	2312.11812v1
45	2023-12-18	TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement	No translated results is contained!	2312.11043v1
46	2023-12-16	When Graph Data Meets Multimodal: A New Paradigm for Graph Understanding and Reasoning	No translated results is contained!	2312.10372v1
47	2023-12-15	Information Extraction from Unstructured data using Augmented-AI and Computer Vision	No translated results is contained!	2312.09880v1
48	2024-01-21	Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation	No translated results is contained!	2312.11532v2
49	2023-12-15	Privacy-Aware Document Visual Question Answering	No translated results is contained!	2312.10108v1
50	2023-12-15	Object Recognition from Scientific Document based on Compartment Refinement Framework	No translated results is contained!	2312.09038v2

Data Centric

Num	Update Date	Title	GPT	Paper ID
1	2024-03-08	VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI	No translated results is contained!	2403.05174v1
2	2024-03-07	Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI	No translated results is contained!	2403.04551v1
3	2024-03-07	A data-centric approach to class-specific bias in image data augmentation	No translated results is contained!	2403.04120v1
4	2024-03-05	ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving	No translated results is contained!	2403.02877v1
5	2024-03-05	Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation	No translated results is contained!	2403.02707v1
6	2024-03-04	Model-Based Data-Centric AI: Bridging the Divide Between Academic Ideals and Industrial Pragmatism	No translated results is contained!	2403.01832v1
7	2024-03-02	The Science of Data Collection: Insights from Surveys can Improve Machine Learning Models	No translated results is contained!	2403.01208v1
8	2024-03-01	ChartReformer: Natural Language-Driven Chart Image Editing	No translated results is contained!	2403.00209v1
9	2024-02-27	Side Information-Driven Session-based Recommendation: A Survey	No translated results is contained!	2402.17129v1
10	2024-02-28	Dealing with Data for RE: Mitigating Challenges while using NLP and Generative AI	No translated results is contained!	2402.16977v2
11	2024-02-26	Uncertainty quantification by direct propagation of shallow ensembles	No translated results is contained!	2402.16621v1
12	2024-02-28	DAGnosis: Localized Identification of Data Inconsistencies using Structures	No translated results is contained!	2402.17599v2
13	2024-02-23	A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models	No translated results is contained!	2402.15422v1
14	2024-02-29	EyeTrans: Merging Human and Machine Attention for Neural Code Summarization	No translated results is contained!	2402.14096v3
15	2024-02-20	Static vs. Dynamic Databases for Indoor Localization based on Wi-Fi Fingerprinting: A Discussion from a Data Perspective	No translated results is contained!	2402.12756v1
16	2024-02-19	Training Green AI Models Using Elite Samples	No translated results is contained!	2402.12010v1
17	2024-02-18	Solving Data-centric Tasks using Large Language Models	No translated results is contained!	2402.11734v1
18	2024-02-18	Efficient Multimodal Learning from Data-centric Perspective	No translated results is contained!	2402.11530v1
19	2024-02-12	Empowering Federated Learning for Massive Models with NVIDIA FLARE	No translated results is contained!	2402.07792v1
20	2024-02-21	Privacy-Preserving Gaze Data Streaming in Immersive Interactive Virtual Reality: Robustness and User Experience	No translated results is contained!	2402.07687v2
21	2024-02-06	A Data Centric Approach for Unsupervised Domain Generalization via Retrieval from Web Scale Multimodal Data	No translated results is contained!	2402.04416v1
22	2024-02-29	Roadmap on Data-Centric Materials Science	No translated results is contained!	2402.10932v2
23	2024-02-01	MobilityDL: A Review of Deep Learning From Trajectory Data	No translated results is contained!	2402.00732v1
24	2024-02-01	EXMOS: Explanatory Model Steering Through Multifaceted Explanations and Data Configurations	No translated results is contained!	2402.00491v1
25	2024-02-02	A Survey on Data-Centric Recommender Systems	No translated results is contained!	2401.17878v2
26	2024-01-30	Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models	No translated results is contained!	2402.01749v1
27	2024-01-26	Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline	No translated results is contained!	2401.14625v1
28	2024-01-26	Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse	No translated results is contained!	2401.14616v1
29	2024-02-20	Challenging Low Homophily in Social Recommendation	No translated results is contained!	2401.14606v3
30	2024-01-24	The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview	No translated results is contained!	2401.14428v1
31	2024-01-26	Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies	No translated results is contained!	2401.12888v2
32	2024-01-24	Falcon: Fair Active Learning using Multi-armed Bandits	No translated results is contained!	2401.12722v2
33	2024-01-22	Exploring descriptors for titanium microstructure via digital fingerprints from variational autoencoders	No translated results is contained!	2401.11967v1
34	2024-01-21	An Interacting Wasserstein Gradient Flow Strategy to Robust Bayesian Inference	No translated results is contained!	2401.11607v1
35	2024-01-23	D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems	No translated results is contained!	2401.11478v2
36	2024-01-10	GOODAT: Towards Test-time Graph Out-of-Distribution Detection	No translated results is contained!	2401.06176v1
37	2024-01-10	Inconsistency-Based Data-Centric Active Open-Set Annotation	No translated results is contained!	2401.04923v1
38	2024-01-13	Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective	No translated results is contained!	2401.04374v2
39	2024-01-08	Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking	No translated results is contained!	2401.04266v1
40	2024-01-04	Data-Centric Foundation Models in Computational Healthcare: A Survey	No translated results is contained!	2401.02458v1
41	2024-01-03	CodeFuse-Query: A Data-Centric Static Code Analysis System for Large-Scale Organizations	No translated results is contained!	2401.01571v1
42	2024-01-01	Improve Fidelity and Utility of Synthetic Credit Card Transaction Time Series from Data-centric Perspective	No translated results is contained!	2401.00965v1
43	2023-12-24	README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP	No translated results is contained!	2312.15561v1
44	2024-02-21	Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities	No translated results is contained!	2312.14647v2
45	2023-12-22	CaptainCook4D: A dataset for understanding errors in procedural activities	No translated results is contained!	2312.14556v1
46	2023-12-15	Quilt: Robust Data Segment Selection against Concept Drifts	No translated results is contained!	2312.09691v1
47	2023-12-08	Data-Centric Machine Learning for Geospatial Remote Sensing Data	No translated results is contained!	2312.05327v1
48	2023-12-08	A Review On Table Recognition Based On Deep Learning	No translated results is contained!	2312.04808v1
49	2024-01-31	Efficient Large Language Models: A Survey	No translated results is contained!	2312.03863v3
50	2023-12-06	Data-Centric Digital Agriculture: A Perspective	No translated results is contained!	2312.03437v1

LLM

Num	Update Date	Title	GPT	Paper ID
1	2024-03-08	Bayesian Preference Elicitation with Language Models	No translated results is contained!	2403.05534v1
2	2024-03-08	Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context	No translated results is contained!	2403.05530v1
3	2024-03-08	GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM	No translated results is contained!	2403.05527v1
4	2024-03-08	DeepSeek-VL: Towards Real-World Vision-Language Understanding	No translated results is contained!	2403.05525v1
5	2024-03-08	Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola	No translated results is contained!	2403.05523v1
6	2024-03-08	Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT	No translated results is contained!	2403.05519v1
7	2024-03-08	Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought	No translated results is contained!	2403.05518v1
8	2024-03-08	To Err Is Human, but Llamas Can Learn It Too	No translated results is contained!	2403.05493v1
9	2024-03-08	Will GPT-4 Run DOOM?	No translated results is contained!	2403.05468v1
10	2024-03-08	Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs	No translated results is contained!	2403.05434v1
11	2024-03-08	Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery	No translated results is contained!	2403.05381v1
12	2024-03-08	VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model	No translated results is contained!	2403.05346v1
13	2024-03-08	Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings	No translated results is contained!	2403.05338v1
14	2024-03-08	ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues	No translated results is contained!	2403.05326v1
15	2024-03-08	RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation	No translated results is contained!	2403.05313v1
16	2024-03-08	Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents	No translated results is contained!	2403.05307v1
17	2024-03-08	ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications	No translated results is contained!	2403.05303v1
18	2024-03-08	Modeling Dynamic (De)Allocations of Local Memory for Translation Validation	No translated results is contained!	2403.05302v1
19	2024-03-08	LLM4Decompile: Decompiling Binary Code with Large Language Models	No translated results is contained!	2403.05286v1
20	2024-03-08	Deep Prompt Multi-task Network for Abuse Language Detection	No translated results is contained!	2403.05268v1
21	2024-03-08	ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models	No translated results is contained!	2403.05266v1
22	2024-03-08	Debiasing Large Visual Language Models	No translated results is contained!	2403.05262v1
23	2024-03-08	Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity	No translated results is contained!	2403.05257v1
24	2024-03-08	Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance	No translated results is contained!	2403.05231v1
25	2024-03-08	Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering	No translated results is contained!	2403.05217v1
26	2024-03-08	SocialPET: Socially Informed Pattern Exploiting Training for Few-Shot Stance Detection in Social Media	No translated results is contained!	2403.05216v1
27	2024-03-08	Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge	No translated results is contained!	2403.05189v1
28	2024-03-08	Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation	No translated results is contained!	2403.05171v1
29	2024-03-08	On Protecting the Data Privacy of Large Language Models (LLMs): A Survey	No translated results is contained!	2403.05156v1
30	2024-03-08	Towards a Psychology of Machines: Large Language Models Predict Human Memory	No translated results is contained!	2403.05152v1
31	2024-03-08	Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem	No translated results is contained!	2403.05149v1
32	2024-03-08	Med3DInsight: Enhancing 3D Medical Image Understanding with 2D Multi-Modal Large Language Models	No translated results is contained!	2403.05141v1
33	2024-03-08	ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment	No translated results is contained!	2403.05135v1
34	2024-03-08	ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models	No translated results is contained!	2403.05132v1
35	2024-03-08	CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model	No translated results is contained!	2403.05124v1
36	2024-03-08	Benchmarking Large Language Models for Molecule Prediction Tasks	No translated results is contained!	2403.05075v1
37	2024-03-08	Can we obtain significant success in RST discourse parsing by using Large Language Models?	No translated results is contained!	2403.05065v1
38	2024-03-08	Aligning Large Language Models for Controllable Recommendations	No translated results is contained!	2403.05063v1
39	2024-03-08	Multimodal Infusion Tuning for Large Models	No translated results is contained!	2403.05060v1
40	2024-03-08	XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution	No translated results is contained!	2403.05049v1
41	2024-03-08	Are Human Conversations Special? A Large Language Model Perspective	No translated results is contained!	2403.05045v1
42	2024-03-08	Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs	No translated results is contained!	2403.05020v1
43	2024-03-08	Can't Remember Details in Long Documents? You Need Some R&R	No translated results is contained!	2403.05004v1
44	2024-03-08	DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation	No translated results is contained!	2403.04997v1
45	2024-03-08	Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience	No translated results is contained!	2403.04979v1
46	2024-03-08	Embracing Large Language and Multimodal Models for Prosthetic Technologies	No translated results is contained!	2403.04974v1
47	2024-03-08	Tell me the truth: A system to measure the trustworthiness of Large Language Models	No translated results is contained!	2403.04964v1
48	2024-03-08	An In-depth Evaluation of GPT-4 in Sentence Simplification with Error-based Human Assessment	No translated results is contained!	2403.04963v1
49	2024-03-08	SecGPT: An Execution Isolation Architecture for LLM-Based Systems	No translated results is contained!	2403.04960v1
50	2024-03-07	Automatic and Universal Prompt Injection Attacks against Large Language Models	No translated results is contained!	2403.04957v1

jhw5981 / researchtracker Goto Github PK

researchtracker's Introduction

researchtracker's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent