Document Analysis
Num | Update Date | Title | GPT | Paper ID |
---|---|---|---|---|
1 | 2024-03-08 | DeepSeek-VL: Towards Real-World Vision-Language Understanding | No translated results is contained! | 2403.05525v1 |
2 | 2024-03-08 | Online Contention Resolution Schemes for Network Revenue Management and Combinatorial Auctions | No translated results is contained! | 2403.05378v1 |
3 | 2024-03-07 | Children Age Group Detection based on Human-Computer Interaction and Time Series Analysis | No translated results is contained! | 2403.04574v1 |
4 | 2024-03-07 | TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document | No translated results is contained! | 2403.04473v1 |
5 | 2024-03-06 | Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis | No translated results is contained! | 2403.04080v1 |
6 | 2024-03-06 | Multimodal Transformer for Comics Text-Cloze | No translated results is contained! | 2403.03719v1 |
7 | 2024-03-04 | LOCR: Location-Guided Transformer for Optical Character Recognition | No translated results is contained! | 2403.02127v1 |
8 | 2024-03-01 | Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction | No translated results is contained! | 2403.00528v1 |
9 | 2024-03-01 | ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting | No translated results is contained! | 2403.00303v1 |
10 | 2024-03-01 | Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System | No translated results is contained! | 2402.17204v3 |
11 | 2024-02-23 | Representing Online Handwriting for Recognition in Large Vision-Language Models | No translated results is contained! | 2402.15307v1 |
12 | 2024-02-18 | Syntactic Language Change in English and German: Metrics, Parsers, and Convergences | No translated results is contained! | 2402.11549v1 |
13 | 2024-02-15 | LAPDoc: Layout-Aware Prompting for Documents | No translated results is contained! | 2402.09841v1 |
14 | 2024-02-15 | TEXTRON: Weakly Supervised Multilingual Text Detection through Data Programming | No translated results is contained! | 2402.09811v1 |
15 | 2024-02-12 | Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing | No translated results is contained! | 2402.08025v1 |
16 | 2024-02-12 | Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription | No translated results is contained! | 2402.07596v1 |
17 | 2024-02-12 | ClusterTabNet: Supervised clustering method for table detection and table structure recognition | No translated results is contained! | 2402.07502v1 |
18 | 2024-02-09 | Deuterated Polystyrene -- Synthesis and uses for ultracold neutron bottles and the neutron EDM experiment | No translated results is contained! | 2402.06469v1 |
19 | 2024-02-08 | SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models | No translated results is contained! | 2402.05935v1 |
20 | 2024-02-08 | GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru | No translated results is contained! | 2402.05882v1 |
21 | 2024-02-08 | Text Role Classification in Scientific Charts Using Multimodal Transformers | No translated results is contained! | 2402.14579v1 |
22 | 2024-02-08 | Advances and Limitations in Open Source Arabic-Script OCR: A Case Study | No translated results is contained! | 2402.10943v1 |
23 | 2024-02-08 | Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification | No translated results is contained! | 2402.05417v1 |
24 | 2024-02-07 | TreeForm: End-to-end Annotation and Evaluation for Form Document Parsing | No translated results is contained! | 2402.05282v1 |
25 | 2024-02-07 | Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types | No translated results is contained! | 2402.05158v1 |
26 | 2024-02-03 | ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images | No translated results is contained! | 2402.02246v1 |
27 | 2024-02-01 | Instruction Makes a Difference | No translated results is contained! | 2402.00453v1 |
28 | 2024-02-07 | KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization | No translated results is contained! | 2401.18079v2 |
29 | 2024-01-31 | Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation | No translated results is contained! | 2401.17904v1 |
30 | 2024-01-30 | MouSi: Poly-Visual-Expert Vision-Language Models | No translated results is contained! | 2401.17221v1 |
31 | 2024-01-30 | AutoIE: An Automated Framework for Information Extraction from Scientific Literature | No translated results is contained! | 2401.16672v1 |
32 | 2024-02-14 | Detecting and recognizing characters in Greek papyri with YOLOv8, DeiT and SimCLR | No translated results is contained! | 2401.12513v2 |
33 | 2024-01-22 | Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis | No translated results is contained! | 2401.11874v1 |
34 | 2024-01-22 | A Fair Evaluation of Various Deep Learning-Based Document Image Binarization Approaches | No translated results is contained! | 2401.11831v1 |
35 | 2024-01-16 | U-DIADS-Bib: a full and few-shot pixel-precise dataset for document layout analysis of ancient manuscripts | No translated results is contained! | 2401.08425v1 |
36 | 2024-01-15 | Improving OCR Quality in 19th Century Historical Documents Using a Combined Machine Learning Based Approach | No translated results is contained! | 2401.07787v1 |
37 | 2024-01-06 | Semantic Similarity Matching for Patent Documents Using Ensemble BERT-related Model and Novel Text Processing Method | No translated results is contained! | 2401.06782v1 |
38 | 2024-01-01 | Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters | No translated results is contained! | 2401.00971v1 |
39 | 2023-12-31 | Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition | No translated results is contained! | 2401.00435v1 |
40 | 2024-01-31 | An Empirical Study of Scaling Law for OCR | No translated results is contained! | 2401.00028v3 |
41 | 2023-12-28 | Chaurah: A Smart Raspberry Pi based Parking System | No translated results is contained! | 2312.16894v1 |
42 | 2023-12-26 | 360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception | No translated results is contained! | 2312.16268v1 |
43 | 2023-12-20 | The Common Optical Music Recognition Evaluation Framework | No translated results is contained! | 2312.12908v1 |
44 | 2023-12-19 | Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey | No translated results is contained! | 2312.11812v1 |
45 | 2023-12-18 | TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement | No translated results is contained! | 2312.11043v1 |
46 | 2023-12-16 | When Graph Data Meets Multimodal: A New Paradigm for Graph Understanding and Reasoning | No translated results is contained! | 2312.10372v1 |
47 | 2023-12-15 | Information Extraction from Unstructured data using Augmented-AI and Computer Vision | No translated results is contained! | 2312.09880v1 |
48 | 2024-01-21 | Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided Document Generation | No translated results is contained! | 2312.11532v2 |
49 | 2023-12-15 | Privacy-Aware Document Visual Question Answering | No translated results is contained! | 2312.10108v1 |
50 | 2023-12-15 | Object Recognition from Scientific Document based on Compartment Refinement Framework | No translated results is contained! | 2312.09038v2 |
Data Centric
Num | Update Date | Title | GPT | Paper ID |
---|---|---|---|---|
1 | 2024-03-08 | VTruST: Controllable value function based subset selection for Data-Centric Trustworthy AI | No translated results is contained! | 2403.05174v1 |
2 | 2024-03-07 | Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI | No translated results is contained! | 2403.04551v1 |
3 | 2024-03-07 | A data-centric approach to class-specific bias in image data augmentation | No translated results is contained! | 2403.04120v1 |
4 | 2024-03-05 | ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving | No translated results is contained! | 2403.02877v1 |
5 | 2024-03-05 | Enhancing Generalization in Medical Visual Question Answering Tasks via Gradient-Guided Model Perturbation | No translated results is contained! | 2403.02707v1 |
6 | 2024-03-04 | Model-Based Data-Centric AI: Bridging the Divide Between Academic Ideals and Industrial Pragmatism | No translated results is contained! | 2403.01832v1 |
7 | 2024-03-02 | The Science of Data Collection: Insights from Surveys can Improve Machine Learning Models | No translated results is contained! | 2403.01208v1 |
8 | 2024-03-01 | ChartReformer: Natural Language-Driven Chart Image Editing | No translated results is contained! | 2403.00209v1 |
9 | 2024-02-27 | Side Information-Driven Session-based Recommendation: A Survey | No translated results is contained! | 2402.17129v1 |
10 | 2024-02-28 | Dealing with Data for RE: Mitigating Challenges while using NLP and Generative AI | No translated results is contained! | 2402.16977v2 |
11 | 2024-02-26 | Uncertainty quantification by direct propagation of shallow ensembles | No translated results is contained! | 2402.16621v1 |
12 | 2024-02-28 | DAGnosis: Localized Identification of Data Inconsistencies using Structures | No translated results is contained! | 2402.17599v2 |
13 | 2024-02-23 | A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models | No translated results is contained! | 2402.15422v1 |
14 | 2024-02-29 | EyeTrans: Merging Human and Machine Attention for Neural Code Summarization | No translated results is contained! | 2402.14096v3 |
15 | 2024-02-20 | Static vs. Dynamic Databases for Indoor Localization based on Wi-Fi Fingerprinting: A Discussion from a Data Perspective | No translated results is contained! | 2402.12756v1 |
16 | 2024-02-19 | Training Green AI Models Using Elite Samples | No translated results is contained! | 2402.12010v1 |
17 | 2024-02-18 | Solving Data-centric Tasks using Large Language Models | No translated results is contained! | 2402.11734v1 |
18 | 2024-02-18 | Efficient Multimodal Learning from Data-centric Perspective | No translated results is contained! | 2402.11530v1 |
19 | 2024-02-12 | Empowering Federated Learning for Massive Models with NVIDIA FLARE | No translated results is contained! | 2402.07792v1 |
20 | 2024-02-21 | Privacy-Preserving Gaze Data Streaming in Immersive Interactive Virtual Reality: Robustness and User Experience | No translated results is contained! | 2402.07687v2 |
21 | 2024-02-06 | A Data Centric Approach for Unsupervised Domain Generalization via Retrieval from Web Scale Multimodal Data | No translated results is contained! | 2402.04416v1 |
22 | 2024-02-29 | Roadmap on Data-Centric Materials Science | No translated results is contained! | 2402.10932v2 |
23 | 2024-02-01 | MobilityDL: A Review of Deep Learning From Trajectory Data | No translated results is contained! | 2402.00732v1 |
24 | 2024-02-01 | EXMOS: Explanatory Model Steering Through Multifaceted Explanations and Data Configurations | No translated results is contained! | 2402.00491v1 |
25 | 2024-02-02 | A Survey on Data-Centric Recommender Systems | No translated results is contained! | 2401.17878v2 |
26 | 2024-01-30 | Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models | No translated results is contained! | 2402.01749v1 |
27 | 2024-01-26 | Toward Practical Automatic Speech Recognition and Post-Processing: a Call for Explainable Error Benchmark Guideline | No translated results is contained! | 2401.14625v1 |
28 | 2024-01-26 | Alternative Speech: Complementary Method to Counter-Narrative for Better Discourse | No translated results is contained! | 2401.14616v1 |
29 | 2024-02-20 | Challenging Low Homophily in Social Recommendation | No translated results is contained! | 2401.14606v3 |
30 | 2024-01-24 | The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview | No translated results is contained! | 2401.14428v1 |
31 | 2024-01-26 | Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies | No translated results is contained! | 2401.12888v2 |
32 | 2024-01-24 | Falcon: Fair Active Learning using Multi-armed Bandits | No translated results is contained! | 2401.12722v2 |
33 | 2024-01-22 | Exploring descriptors for titanium microstructure via digital fingerprints from variational autoencoders | No translated results is contained! | 2401.11967v1 |
34 | 2024-01-21 | An Interacting Wasserstein Gradient Flow Strategy to Robust Bayesian Inference | No translated results is contained! | 2401.11607v1 |
35 | 2024-01-23 | D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems | No translated results is contained! | 2401.11478v2 |
36 | 2024-01-10 | GOODAT: Towards Test-time Graph Out-of-Distribution Detection | No translated results is contained! | 2401.06176v1 |
37 | 2024-01-10 | Inconsistency-Based Data-Centric Active Open-Set Annotation | No translated results is contained! | 2401.04923v1 |
38 | 2024-01-13 | Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective | No translated results is contained! | 2401.04374v2 |
39 | 2024-01-08 | Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking | No translated results is contained! | 2401.04266v1 |
40 | 2024-01-04 | Data-Centric Foundation Models in Computational Healthcare: A Survey | No translated results is contained! | 2401.02458v1 |
41 | 2024-01-03 | CodeFuse-Query: A Data-Centric Static Code Analysis System for Large-Scale Organizations | No translated results is contained! | 2401.01571v1 |
42 | 2024-01-01 | Improve Fidelity and Utility of Synthetic Credit Card Transaction Time Series from Data-centric Perspective | No translated results is contained! | 2401.00965v1 |
43 | 2023-12-24 | README: Bridging Medical Jargon and Lay Understanding for Patient Education through Data-Centric NLP | No translated results is contained! | 2312.15561v1 |
44 | 2024-02-21 | Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities | No translated results is contained! | 2312.14647v2 |
45 | 2023-12-22 | CaptainCook4D: A dataset for understanding errors in procedural activities | No translated results is contained! | 2312.14556v1 |
46 | 2023-12-15 | Quilt: Robust Data Segment Selection against Concept Drifts | No translated results is contained! | 2312.09691v1 |
47 | 2023-12-08 | Data-Centric Machine Learning for Geospatial Remote Sensing Data | No translated results is contained! | 2312.05327v1 |
48 | 2023-12-08 | A Review On Table Recognition Based On Deep Learning | No translated results is contained! | 2312.04808v1 |
49 | 2024-01-31 | Efficient Large Language Models: A Survey | No translated results is contained! | 2312.03863v3 |
50 | 2023-12-06 | Data-Centric Digital Agriculture: A Perspective | No translated results is contained! | 2312.03437v1 |
LLM
Num | Update Date | Title | GPT | Paper ID |
---|---|---|---|---|
1 | 2024-03-08 | Bayesian Preference Elicitation with Language Models | No translated results is contained! | 2403.05534v1 |
2 | 2024-03-08 | Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context | No translated results is contained! | 2403.05530v1 |
3 | 2024-03-08 | GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM | No translated results is contained! | 2403.05527v1 |
4 | 2024-03-08 | DeepSeek-VL: Towards Real-World Vision-Language Understanding | No translated results is contained! | 2403.05525v1 |
5 | 2024-03-08 | Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola | No translated results is contained! | 2403.05523v1 |
6 | 2024-03-08 | Authorship Attribution in Bangla Literature (AABL) via Transfer Learning using ULMFiT | No translated results is contained! | 2403.05519v1 |
7 | 2024-03-08 | Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought | No translated results is contained! | 2403.05518v1 |
8 | 2024-03-08 | To Err Is Human, but Llamas Can Learn It Too | No translated results is contained! | 2403.05493v1 |
9 | 2024-03-08 | Will GPT-4 Run DOOM? | No translated results is contained! | 2403.05468v1 |
10 | 2024-03-08 | Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs | No translated results is contained! | 2403.05434v1 |
11 | 2024-03-08 | Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery | No translated results is contained! | 2403.05381v1 |
12 | 2024-03-08 | VLM-PL: Advanced Pseudo Labeling approach Class Incremental Object Detection with Vision-Language Model | No translated results is contained! | 2403.05346v1 |
13 | 2024-03-08 | Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings | No translated results is contained! | 2403.05338v1 |
14 | 2024-03-08 | ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues | No translated results is contained! | 2403.05326v1 |
15 | 2024-03-08 | RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | No translated results is contained! | 2403.05313v1 |
16 | 2024-03-08 | Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents | No translated results is contained! | 2403.05307v1 |
17 | 2024-03-08 | ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications | No translated results is contained! | 2403.05303v1 |
18 | 2024-03-08 | Modeling Dynamic (De)Allocations of Local Memory for Translation Validation | No translated results is contained! | 2403.05302v1 |
19 | 2024-03-08 | LLM4Decompile: Decompiling Binary Code with Large Language Models | No translated results is contained! | 2403.05286v1 |
20 | 2024-03-08 | Deep Prompt Multi-task Network for Abuse Language Detection | No translated results is contained! | 2403.05268v1 |
21 | 2024-03-08 | ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models | No translated results is contained! | 2403.05266v1 |
22 | 2024-03-08 | Debiasing Large Visual Language Models | No translated results is contained! | 2403.05262v1 |
23 | 2024-03-08 | Cross-lingual Transfer or Machine Translation? On Data Augmentation for Monolingual Semantic Textual Similarity | No translated results is contained! | 2403.05257v1 |
24 | 2024-03-08 | Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance | No translated results is contained! | 2403.05231v1 |
25 | 2024-03-08 | Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering | No translated results is contained! | 2403.05217v1 |
26 | 2024-03-08 | SocialPET: Socially Informed Pattern Exploiting Training for Few-Shot Stance Detection in Social Media | No translated results is contained! | 2403.05216v1 |
27 | 2024-03-08 | Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge | No translated results is contained! | 2403.05189v1 |
28 | 2024-03-08 | Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation | No translated results is contained! | 2403.05171v1 |
29 | 2024-03-08 | On Protecting the Data Privacy of Large Language Models (LLMs): A Survey | No translated results is contained! | 2403.05156v1 |
30 | 2024-03-08 | Towards a Psychology of Machines: Large Language Models Predict Human Memory | No translated results is contained! | 2403.05152v1 |
31 | 2024-03-08 | Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | No translated results is contained! | 2403.05149v1 |
32 | 2024-03-08 | Med3DInsight: Enhancing 3D Medical Image Understanding with 2D Multi-Modal Large Language Models | No translated results is contained! | 2403.05141v1 |
33 | 2024-03-08 | ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | No translated results is contained! | 2403.05135v1 |
34 | 2024-03-08 | ChatUIE: Exploring Chat-based Unified Information Extraction using Large Language Models | No translated results is contained! | 2403.05132v1 |
35 | 2024-03-08 | CLIP-Gaze: Towards General Gaze Estimation via Visual-Linguistic Model | No translated results is contained! | 2403.05124v1 |
36 | 2024-03-08 | Benchmarking Large Language Models for Molecule Prediction Tasks | No translated results is contained! | 2403.05075v1 |
37 | 2024-03-08 | Can we obtain significant success in RST discourse parsing by using Large Language Models? | No translated results is contained! | 2403.05065v1 |
38 | 2024-03-08 | Aligning Large Language Models for Controllable Recommendations | No translated results is contained! | 2403.05063v1 |
39 | 2024-03-08 | Multimodal Infusion Tuning for Large Models | No translated results is contained! | 2403.05060v1 |
40 | 2024-03-08 | XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution | No translated results is contained! | 2403.05049v1 |
41 | 2024-03-08 | Are Human Conversations Special? A Large Language Model Perspective | No translated results is contained! | 2403.05045v1 |
42 | 2024-03-08 | Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs | No translated results is contained! | 2403.05020v1 |
43 | 2024-03-08 | Can't Remember Details in Long Documents? You Need Some R&R | No translated results is contained! | 2403.05004v1 |
44 | 2024-03-08 | DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation | No translated results is contained! | 2403.04997v1 |
45 | 2024-03-08 | Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience | No translated results is contained! | 2403.04979v1 |
46 | 2024-03-08 | Embracing Large Language and Multimodal Models for Prosthetic Technologies | No translated results is contained! | 2403.04974v1 |
47 | 2024-03-08 | Tell me the truth: A system to measure the trustworthiness of Large Language Models | No translated results is contained! | 2403.04964v1 |
48 | 2024-03-08 | An In-depth Evaluation of GPT-4 in Sentence Simplification with Error-based Human Assessment | No translated results is contained! | 2403.04963v1 |
49 | 2024-03-08 | SecGPT: An Execution Isolation Architecture for LLM-Based Systems | No translated results is contained! | 2403.04960v1 |
50 | 2024-03-07 | Automatic and Universal Prompt Injection Attacks against Large Language Models | No translated results is contained! | 2403.04957v1 |