pouriaomrani Goto Github PK
Name: Pouria Omrani
Type: User
Bio: AI Developer
Name: Pouria Omrani
Type: User
Bio: AI Developer
ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and External Knowledge for Decoding Advertisements".
یک پروژه برای دیدن اینکه چطوری یک ایده می تونه به مرحله اجرا برسه. قدم به قدم فیلم گرفتم و منتشر کردم و خود سیستم هم برای عموم قابل استفاده است
Bilingual Image Captioning (Persian - English) with Multi-task Learning
starter code to finetune bolbolzaban gpt2 persian
Official implementation of AAAI2022 paper "I can find you! Boundary-guided Separated Attention Network for Camouflaged Object Detection"
Constrained Levy Exploration (CLE) generates a scanpath computing eye movements as Levy flight on a saliency map.
covid_fake_news
An end-to-end library for editing and rendering motion of 3D characters with deep learning [SIGGRAPH 2020]
FastDVDnet: A Very Fast Deep Video Denoising algorithm
Implementation of a gravitational model of visual attention scanpath
GIT: A Generative Image-to-text Transformer for Vision and Language
yolov3 tensorflow object detection and report human movements in persian
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
A free implementation of the CovidGAN project
A Persian Image Captioning model based on Vision Encoder Decoder Models of the transformers🤗.
Persian/Farsi text to speech(TTS) training using coqui tts
A fast, local neural text to speech system
Python script to download Instagram stories from Instagram users.
Contextual Encoder-Decoder Network for Visual Saliency Prediction [Neural Networks 2020]
Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning (CVPR2020)
Demo on how to compute soccer ball possession automatically using AI.
LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".
Enhanced Super-Resolution Generative Adversarial Networks
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
SwinIR: Image Restoration Using Swin Transformer (official repository)
Tacotron 2 - Persian
Video Captioning is an encoder decoder mode based on sequence to sequence learning
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.