Models

Search
all
verified
Color_Extraction
Color Extraction

Color Extraction is a task in computer vision that involves the extraction and analysis of colors from images or videos. The objective of this task is to identify and isolate specific colors, or color ranges present in the visual data.

mit
Image-to-Text
PyTorch
English

Background_Removal
Background Removal

Background Removal is an image processing technique, used to separate the main object from the background of a photo. Removing the background helps highlight the product, subject, or character, bringing a professional and aesthetically pleasing look to the image.

apache-2.0
Image-to-Image
PyTorch
English

Image_To_Anime
Image to Anime

The goal of Image to Anime was to create a new version of the image that would possess the same clean lines and evoke the characteristic feel found in anime productions, capturing the unique artistry, and aesthetics associated with this style.

mit
Image-to-Image
PyTorch
English

MediaPipe_Face_Mesh_Ploting
MediaPipe Face Mesh Ploting

Face mesh detection, also known as facial landmark detection or face pose estimation, is the task of identifying and localizing specific keypoints or landmarks on a human face. It involves detecting the positions of facial features, such as eyes, eyebrows, nose, mouth, and jawline, in an image or video.

mit
Object Detection
PyTorch
English

MediaPipe_Face_Detection
MediaPipe Face Detection

Face detection is a computer vision technique that involves identifying and locating human faces within an image or video. The goal of face detection is to detect the presence of faces, and draw bounding boxes around them, without necessarily identifying specific facial features or landmarks.

mit
Object Detection
PyTorch
English

Background_Replacement
Background Replacement

Background Replacement is a powerful tool that enables users to easily change the background of their images, opening up endless possibilities for creative transformations, and visual enhancements.

apache-2.0
Image-to-Image
PyTorch
English

Collections


Text to Image

Task Text-to-Image is an important task in the field of artificial intelligence and natural language processing. This task aims to create images from descriptions or descriptive text.

Text Generation

Task Text Generation is an important task in the field of natural language processing and artificial intelligence. This task aims to generate text automatically from input data, including descriptions, stories, articles, or other types of text.

Image to Image

Image-to-Image is an important task in the field of image processing, where we convert images from one format or data type to another.

Image-to-Text

The Image-to-Text task is an important task in the field of natural language processing and computer vision. Its purpose is to convert information within an image into readable and understandable text.

Image Feature Extraction

Image feature extraction is the task of extracting features learnt in a computer vision model.

Object Detection

The Object Detection task is an important task in the fields of computer vision and artificial intelligence. Its main objective is to detect and determine the position of objects within images or videos.

Zero-Shot Image Classification

Task Zero-Shot Image Classification is an important task in the field of image processing and artificial intelligence. This task aims to classify images into different categories where the model has never been trained before.

Document Question Answering

The DQA is a task in natural language processing and information retrieval that focuses on automatically generating accurate and relevant answers to questions based on a given document.

Latest


Northern_Lights
Northern Lights

Northern Lights

earthylife
earthylife

miccheck12

Terra-Verde
Nature

Serenity Unveiled: A breathtaking landscape of untouched nature, where vibrant hues of emerald green and sapphire blue converge, evoking feelings of tranquility and awe.

Lyolaratna
Lyolaratna

Mubarak

Multi-InteractionVQA
Multi InteractionVQA

This repository is the implementation of Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering for the visual question answering task. Our single model achieved 70.93 (Test-standard, VQA 2.0). Moreover, in TDIUC dataset, our single model achieved 73.04 in Arithmetic MTP metric and 66.86 in Harmonic MTP metric.

image_blend_multiple_method
Image Blending with Multiple Methods

Image Blending with Multiple Methods is a task that involves combining two or more images seamlessly to create a composite image using a variety of blending techniques. By leveraging multiple blending methods, such as alpha blending, gradient blending, or Laplacian pyramid blending, this task enables the merging of images while preserving the visual coherence and integrity of the final composition.

Image_Super_Resolution_SeemoRe
Image Super-Resolution with SeemoRe

Image Super-Resolution with SeemoRe is a task aimed at improving the process of image super-resolution by leveraging expertise in the field. This task involves incorporating techniques that identify and utilize expert knowledge or specialized information to enhance the efficiency and accuracy of image upscaling.

Image_Super_Resolution_SMFANet
Image Super-Resolution with SMFANet

Image Super-Resolution with SMFANet involves utilizing the SMFANet model architecture to enhance the resolution and quality of images. SMFANet is a deep learning network designed for super-resolution tasks, aiming to generate high-quality, detailed images from low-resolution inputs.

SG_Low_Light_Image_Enhancement
Semantic-Guided Low-Light Network Enhancement

Semantic-Guided Low-Light Network is a task that integrates semantic information into the process of enhancing the quality of images captured in low-light conditions. By incorporating semantic guidance, this task aims to improve the accuracy and effectiveness of enhancing low-light images by considering the context and content of the scene.

Low_light_Image_Enhancement
Low-light Image Enhancement

Low light Image Enhancement is a task focused on improving the quality and visibility of images captured in low-light conditions. This task involves applying image processing techniques and algorithms to enhance details, reduce noise, and increase brightness in photos taken in dimly lit environments.

Image_To_Anime
Image to Anime

The goal of Image to Anime was to create a new version of the image that would possess the same clean lines and evoke the characteristic feel found in anime productions, capturing the unique artistry, and aesthetics associated with this style.

MediaPipe_Face_Detection
MediaPipe Face Detection

Face detection is a computer vision technique that involves identifying and locating human faces within an image or video. The goal of face detection is to detect the presence of faces, and draw bounding boxes around them, without necessarily identifying specific facial features or landmarks.

MediaPipe_Face_Mesh_Ploting
MediaPipe Face Mesh Ploting

Face mesh detection, also known as facial landmark detection or face pose estimation, is the task of identifying and localizing specific keypoints or landmarks on a human face. It involves detecting the positions of facial features, such as eyes, eyebrows, nose, mouth, and jawline, in an image or video.

Video_To_Canny_Edge
Video to Canny Edge

Video to Canny Edge is the process of converting a video into a Canny edge representation, where edges in the video are emphasized and separated. Canny Edge is a popular algorithm in image processing and is often used to detect edges in images and videos.

Color_Extraction
Color Extraction

Color Extraction is a task in computer vision that involves the extraction and analysis of colors from images or videos. The objective of this task is to identify and isolate specific colors, or color ranges present in the visual data.

Background_Replacement
Background Replacement

Background Replacement is a powerful tool that enables users to easily change the background of their images, opening up endless possibilities for creative transformations, and visual enhancements.