Deep Image Prior |
CVPR |
code |
3305 |
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation |
CVPR |
code |
2997 |
Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network |
ECCV |
code |
2020 |
Learning to See in the Dark |
CVPR |
code |
1957 |
Squeeze-and-Excitation Networks |
CVPR |
code |
1175 |
Multimodal Unsupervised Image-to-image Translation |
ECCV |
code |
1137 |
Efficient Neural Architecture Search via Parameters Sharing |
ICML |
code |
1122 |
Non-Local Neural Networks |
CVPR |
code |
813 |
Image Generation From Scene Graphs |
CVPR |
code |
738 |
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? |
CVPR |
code |
647 |
Single-Shot Refinement Neural Network for Object Detection |
CVPR |
code |
608 |
Detect-and-Track: Efficient Pose Estimation in Videos |
CVPR |
code |
519 |
Relation Networks for Object Detection |
CVPR |
code |
497 |
GANimation: Anatomically-aware Facial Animation from a Single Image |
ECCV |
code |
484 |
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples |
ICML |
code |
471 |
Cascaded Pyramid Network for Multi-Person Pose Estimation |
CVPR |
code |
425 |
Taskonomy: Disentangling Task Transfer Learning |
CVPR |
code |
417 |
Neural 3D Mesh Renderer |
CVPR |
code |
412 |
Which Training Methods for GANs do actually Converge? |
ICML |
code |
411 |
Generative Image Inpainting With Contextual Attention |
CVPR |
code |
393 |
Zero-Shot Recognition via Semantic Embeddings and Knowledge Graphs |
CVPR |
code |
392 |
Simple Baselines for Human Pose Estimation and Tracking |
ECCV |
code |
387 |
Look at Boundary: A Boundary-Aware Face Alignment Algorithm |
CVPR |
code |
379 |
In-Place Activated BatchNorm for Memory-Optimized Training of DNNs |
CVPR |
code |
361 |
ICNet for Real-Time Semantic Segmentation on High-Resolution Images |
ECCV |
code |
349 |
End-to-End Recovery of Human Shape and Pose |
CVPR |
code |
346 |
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric |
CVPR |
code |
326 |
Gibson Env: Real-World Perception for Embodied Agents |
CVPR |
code |
306 |
Efficient Interactive Annotation of Segmentation Datasets With Polygon-RNN++ |
CVPR |
code |
305 |
Frustum PointNets for 3D Object Detection From RGB-D Data |
CVPR |
code |
302 |
Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning |
CVPR |
code |
294 |
GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose |
CVPR |
code |
289 |
GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation |
CVPR |
code |
289 |
Soccer on Your Tabletop |
CVPR |
code |
285 |
Distractor-aware Siamese Networks for Visual Object Tracking |
ECCV |
code |
276 |
Neural Baby Talk |
CVPR |
code |
273 |
Fast End-to-End Trainable Guided Filter |
CVPR |
code |
268 |
Adversarially Regularized Autoencoders |
ICML |
code |
258 |
Noise2Noise: Learning Image Restoration without Clean Data |
ICML |
code |
257 |
Acquisition of Localization Confidence for Accurate Object Detection |
ECCV |
code |
244 |
Convolutional Neural Networks With Alternately Updated Clique |
CVPR |
code |
243 |
Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors |
CVPR |
code |
237 |
Pyramid Stereo Matching Network |
CVPR |
code |
233 |
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume |
CVPR |
code |
231 |
Neural Relational Inference for Interacting Systems |
ICML |
code |
229 |
Deep Photo Enhancer: Unpaired Learning for Image Enhancement From Photographs With GANs |
CVPR |
code |
228 |
Learning to Adapt Structured Output Space for Semantic Segmentation |
CVPR |
code |
217 |
The Lovász-Softmax Loss: A Tractable Surrogate for the Optimization of the Intersection-Over-Union Measure in Neural Networks |
CVPR |
code |
216 |
End-to-End Learning of Motion Representation for Video Understanding |
CVPR |
code |
213 |
LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation |
CVPR |
code |
211 |
Semi-Parametric Image Synthesis |
CVPR |
code |
208 |
Iterative Visual Reasoning Beyond Convolutions |
CVPR |
code |
197 |
Learning to Segment Every Thing |
CVPR |
code |
190 |
Style Aggregated Network for Facial Landmark Detection |
CVPR |
code |
184 |
Referring Relationships |
CVPR |
code |
182 |
Pose-Robust Face Recognition via Deep Residual Equivariant Mapping |
CVPR |
code |
177 |
MoCoGAN: Decomposing Motion and Content for Video Generation |
CVPR |
code |
175 |
LayoutNet: Reconstructing the 3D Room Layout From a Single RGB Image |
CVPR |
code |
174 |
License Plate Detection and Recognition in Unconstrained Scenarios |
ECCV |
code |
173 |
Compressed Video Action Recognition |
CVPR |
code |
167 |
Multi-Content GAN for Few-Shot Font Style Transfer |
CVPR |
code |
162 |
GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models |
ICML |
code |
156 |
SPLATNet: Sparse Lattice Networks for Point Cloud Processing |
CVPR |
code |
154 |
Unsupervised Feature Learning via Non-Parametric Instance Discrimination |
CVPR |
code |
153 |
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images |
ECCV |
code |
153 |
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing |
CVPR |
code |
147 |
An End-to-End TextSpotter With Explicit Alignment and Attention |
CVPR |
code |
147 |
Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks |
CVPR |
code |
145 |
Attentive Generative Adversarial Network for Raindrop Removal From a Single Image |
CVPR |
code |
144 |
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation |
ECCV |
code |
142 |
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices |
CVPR |
code |
141 |
Single View Stereo Matching |
CVPR |
code |
141 |
Learning Category-Specific Mesh Reconstruction from Image Collections |
ECCV |
code |
138 |
Optimizing Video Object Detection via a Scale-Time Lattice |
CVPR |
code |
135 |
Facelet-Bank for Fast Portrait Manipulation |
CVPR |
code |
133 |
PlaneNet: Piece-Wise Planar Reconstruction From a Single RGB Image |
CVPR |
code |
133 |
Large-Scale Point Cloud Semantic Segmentation With Superpoint Graphs |
CVPR |
code |
133 |
MegaDepth: Learning Single-View Depth Prediction From Internet Photos |
CVPR |
code |
132 |
Group Normalization |
ECCV |
code |
131 |
DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks |
CVPR |
code |
130 |
Learning a Single Convolutional Super-Resolution Network for Multiple Degradations |
CVPR |
code |
129 |
Self-Imitation Learning |
ICML |
code |
128 |
Two-Stream Convolutional Networks for Dynamic Texture Synthesis |
CVPR |
code |
127 |
Unsupervised Cross-Dataset Person Re-Identification by Transfer Learning of Spatial-Temporal Patterns |
CVPR |
code |
126 |
Densely Connected Pyramid Dehazing Network |
CVPR |
code |
126 |
Residual Dense Network for Image Super-Resolution |
CVPR |
code |
122 |
ECO: Efficient Convolutional Network for Online Video Understanding |
ECCV |
code |
121 |
Camera Style Adaptation for Person Re-Identification |
CVPR |
code |
117 |
SO-Net: Self-Organizing Network for Point Cloud Analysis |
CVPR |
code |
117 |
Context Embedding Networks |
CVPR |
code |
115 |
Embodied Question Answering |
CVPR |
code |
113 |
A Style-Aware Content Loss for Real-time HD Style Transfer |
ECCV |
code |
113 |
Neural Motifs: Scene Graph Parsing With Global Context |
CVPR |
code |
113 |
Fast and Accurate Online Video Object Segmentation via Tracking Parts |
CVPR |
code |
113 |
Image-Image Domain Adaptation With Preserved Self-Similarity and Domain-Dissimilarity for Person Re-Identification |
CVPR |
code |
113 |
LSTM Pose Machines |
CVPR |
code |
111 |
Weakly Supervised Instance Segmentation Using Class Peak Response |
CVPR |
code |
111 |
Gated Path Planning Networks |
ICML |
code |
110 |
Image Super-Resolution Using Very Deep Residual Channel Attention Networks |
ECCV |
code |
110 |
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning |
CVPR |
code |
109 |
Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining |
ECCV |
code |
109 |
Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation |
CVPR |
code |
107 |
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation |
ECCV |
code |
106 |
A Closer Look at Spatiotemporal Convolutions for Action Recognition |
CVPR |
code |
106 |
Decoupled Networks |
CVPR |
code |
106 |
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling |
CVPR |
code |
105 |
Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer |
CVPR |
code |
104 |
Hierarchical Imitation and Reinforcement Learning |
ICML |
code |
100 |
Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships |
CVPR |
code |
100 |
Long-term Tracking in the Wild: a Benchmark |
ECCV |
code |
99 |
CosFace: Large Margin Cosine Loss for Deep Face Recognition |
CVPR |
code |
99 |
MVSNet: Depth Inference for Unstructured Multi-view Stereo |
ECCV |
code |
98 |
Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform |
CVPR |
code |
98 |
MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network |
ECCV |
code |
97 |
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics |
CVPR |
code |
96 |
DeepMVS: Learning Multi-View Stereopsis |
CVPR |
code |
95 |
Noisy Natural Gradient as Variational Inference |
ICML |
code |
94 |
Factoring Shape, Pose, and Layout From the 2D Image of a 3D Scene |
CVPR |
code |
93 |
Rethinking Feature Distribution for Loss Functions in Image Classification |
CVPR |
code |
93 |
3D-CODED: 3D Correspondences by Deep Deformation |
ECCV |
code |
92 |
Learning to Compare: Relation Network for Few-Shot Learning |
CVPR |
code |
91 |
Real-Time Seamless Single Shot 6D Object Pose Prediction |
CVPR |
code |
90 |
Domain Adaptive Faster R-CNN for Object Detection in the Wild |
CVPR |
code |
90 |
Density-Aware Single Image De-Raining Using a Multi-Stream Dense Network |
CVPR |
code |
89 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis |
ICML |
code |
89 |
Deep Back-Projection Networks for Super-Resolution |
CVPR |
code |
89 |
PU-Net: Point Cloud Upsampling Network |
CVPR |
code |
89 |
MAttNet: Modular Attention Network for Referring Expression Comprehension |
CVPR |
code |
89 |
Knowledge Aided Consistency for Weakly Supervised Phrase Grounding |
CVPR |
code |
86 |
DenseASPP for Semantic Segmentation in Street Scenes |
CVPR |
code |
85 |
Scale-Recurrent Network for Deep Image Deblurring |
CVPR |
code |
83 |
Quantized Densely Connected U-Nets for Efficient Landmark Localization |
ECCV |
code |
82 |
Deep Depth Completion of a Single RGB-D Image |
CVPR |
code |
82 |
Unsupervised Learning of Monocular Depth Estimation and Visual Odometry With Deep Feature Reconstruction |
CVPR |
code |
81 |
End-to-End Weakly-Supervised Semantic Alignment |
CVPR |
code |
81 |
Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights |
ECCV |
code |
80 |
Repulsion Loss: Detecting Pedestrians in a Crowd |
CVPR |
code |
79 |
ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes |
ECCV |
code |
78 |
Video Based Reconstruction of 3D People Models |
CVPR |
code |
78 |
Multi-View Consistency as Supervisory Signal for Learning Shape and Pose Prediction |
CVPR |
code |
75 |
Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking |
CVPR |
code |
75 |
A PID Controller Approach for Stochastic Optimization of Deep Networks |
CVPR |
code |
73 |
BodyNet: Volumetric Inference of 3D Human Body Shapes |
ECCV |
code |
73 |
Neural Kinematic Networks for Unsupervised Motion Retargetting |
CVPR |
code |
72 |
Adaptive Affinity Fields for Semantic Segmentation |
ECCV |
code |
72 |
Learning Blind Video Temporal Consistency |
ECCV |
code |
72 |
Attention-based Deep Multiple Instance Learning |
ICML |
code |
71 |
Image Inpainting for Irregular Holes Using Partial Convolutions |
ECCV |
code |
71 |
Deep Mutual Learning |
CVPR |
code |
71 |
Nonlinear 3D Face Morphable Model |
CVPR |
code |
71 |
VITAL: VIsual Tracking via Adversarial Learning |
CVPR |
code |
70 |
Multi-Scale Location-Aware Kernel Representation for Object Detection |
CVPR |
code |
70 |
FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors |
CVPR |
code |
70 |
Macro-Micro Adversarial Network for Human Parsing |
ECCV |
code |
69 |
Tell Me Where to Look: Guided Attention Inference Network |
CVPR |
code |
69 |
VITON: An Image-Based Virtual Try-On Network |
CVPR |
code |
68 |
Graph R-CNN for Scene Graph Generation |
ECCV |
code |
68 |
Recurrent Pixel Embedding for Instance Grouping |
CVPR |
code |
67 |
Visual Feature Attribution Using Wasserstein GANs |
CVPR |
code |
67 |
Synthesizing Images of Humans in Unseen Poses |
CVPR |
code |
66 |
Future Frame Prediction for Anomaly Detection – A New Baseline |
CVPR |
code |
66 |
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks |
ECCV |
code |
66 |
PSANet: Point-wise Spatial Attention Network for Scene Parsing |
ECCV |
code |
62 |
Perturbative Neural Networks |
CVPR |
code |
60 |
Learning SO(3) Equivariant Representations with Spherical CNNs |
ECCV |
code |
60 |
Repeatability Is Not Enough: Learning Affine Regions via Discriminability |
ECCV |
code |
59 |
Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration |
CVPR |
code |
59 |
ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans |
CVPR |
code |
59 |
Learning Human-Object Interactions by Graph Parsing Neural Networks |
ECCV |
code |
59 |
Optimizing the Latent Space of Generative Networks |
ICML |
code |
58 |
Multi-Shot Pedestrian Re-Identification via Sequential Decision Making |
CVPR |
code |
58 |
Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence |
ECCV |
code |
57 |
Decorrelated Batch Normalization |
CVPR |
code |
56 |
Pointwise Convolutional Neural Networks |
CVPR |
code |
54 |
PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition |
CVPR |
code |
54 |
Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking |
CVPR |
code |
54 |
Generalizing A Person Retrieval Model Hetero- and Homogeneously |
ECCV |
code |
54 |
Generate to Adapt: Aligning Domains Using Generative Adversarial Networks |
CVPR |
code |
54 |
Regularizing RNNs for Caption Generation by Reconstructing the Past With the Present |
CVPR |
code |
53 |
Neural Program Synthesis from Diverse Demonstration Videos |
ICML |
code |
53 |
Path-Level Network Transformation for Efficient Architecture Search |
ICML |
code |
53 |
Geometry-Aware Learning of Maps for Camera Localization |
CVPR |
code |
53 |
Improving Generalization via Scalable Neighborhood Component Analysis |
ECCV |
code |
52 |
Adversarial Feature Augmentation for Unsupervised Domain Adaptation |
CVPR |
code |
52 |
Unsupervised Discovery of Object Landmarks as Structural Representations |
CVPR |
code |
51 |
Learning Latent Super-Events to Detect Multiple Activities in Videos |
CVPR |
code |
51 |
Progressive Neural Architecture Search |
ECCV |
code |
51 |
Learning to Reweight Examples for Robust Deep Learning |
ICML |
code |
51 |
Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency |
ECCV |
code |
51 |
Ordinal Depth Supervision for 3D Human Pose Estimation |
CVPR |
code |
51 |
Learning Depth From Monocular Videos Using Direct Methods |
CVPR |
code |
51 |
Deep Marching Cubes: Learning Explicit Surface Representations |
CVPR |
code |
51 |
Object Level Visual Reasoning in Videos |
ECCV |
code |
50 |
Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation |
ECCV |
code |
50 |
Disentangled Person Image Generation |
CVPR |
code |
50 |
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning |
CVPR |
code |
50 |
Depth-aware CNN for RGB-D Segmentation |
ECCV |
code |
50 |
Neural Style Transfer via Meta Networks |
CVPR |
code |
49 |
Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval |
CVPR |
code |
49 |
“Zero-Shot” Super-Resolution Using Deep Internal Learning |
CVPR |
code |
49 |
Leveraging Unlabeled Data for Crowd Counting by Learning to Rank |
CVPR |
code |
48 |
CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes |
CVPR |
code |
48 |
Shift-Net: Image Inpainting via Deep Feature Rearrangement |
ECCV |
code |
47 |
Discriminability Objective for Training Descriptive Captions |
CVPR |
code |
46 |
SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation |
CVPR |
code |
46 |
Diverse Image-to-Image Translation via Disentangled Representations |
ECCV |
code |
46 |
SparseMAP: Differentiable Sparse Structured Inference |
ICML |
code |
46 |
Fighting Fake News: Image Splice Detection via Learned Self-Consistency |
ECCV |
code |
45 |
Fast and Accurate Single Image Super-Resolution via Information Distillation Network |
CVPR |
code |
45 |
Wasserstein Introspective Neural Networks |
CVPR |
code |
44 |
Efficient end-to-end learning for quantizable representations |
ICML |
code |
44 |
Learning Less Is More - 6D Camera Localization via 3D Surface Regression |
CVPR |
code |
43 |
Learning Pose Specific Representations by Predicting Different Views |
CVPR |
code |
42 |
Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing |
CVPR |
code |
41 |
Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation |
CVPR |
code |
41 |
PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction |
ECCV |
code |
41 |
Learning From Synthetic Data: Addressing Domain Shift for Semantic Segmentation |
CVPR |
code |
41 |
Learning to Find Good Correspondences |
CVPR |
code |
41 |
Hierarchical Long-term Video Prediction without Supervision |
ICML |
code |
41 |
Conditional Probability Models for Deep Image Compression |
CVPR |
code |
40 |
Measuring abstract reasoning in neural networks |
ICML |
code |
40 |
BlockDrop: Dynamic Inference Paths in Residual Networks |
CVPR |
code |
40 |
Fast and Scalable Bayesian Deep Learning by Weight-Perturbation in Adam |
ICML |
code |
40 |
PDE-Net: Learning PDEs from Data |
ICML |
code |
39 |
Exploit the Unknown Gradually: One-Shot Video-Based Person Re-Identification by Stepwise Learning |
CVPR |
code |
39 |
DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency |
ECCV |
code |
38 |
Differentiable Compositional Kernel Learning for Gaussian Processes |
ICML |
code |
38 |
Pairwise Confusion for Fine-Grained Visual Classification |
ECCV |
code |
37 |
Learning Intrinsic Image Decomposition From Watching the World |
CVPR |
code |
36 |
Rotation-Sensitive Regression for Oriented Scene Text Detection |
CVPR |
code |
36 |
Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation |
ECCV |
code |
35 |
Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation |
ECCV |
code |
34 |
RayNet: Learning Volumetric 3D Reconstruction With Ray Potentials |
CVPR |
code |
34 |
Learning Pixel-Level Semantic Affinity With Image-Level Supervision for Weakly Supervised Semantic Segmentation |
CVPR |
code |
33 |
Robust Classification With Convolutional Prototype Learning |
CVPR |
code |
33 |
SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis |
CVPR |
code |
33 |
Deep k-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions |
ICML |
code |
33 |
Extracting Automata from Recurrent Neural Networks Using Queries and Counterexamples |
ICML |
code |
32 |
Surface Networks |
CVPR |
code |
32 |
Self-produced Guidance for Weakly-supervised Object Localization |
ECCV |
code |
32 |
Real-World Anomaly Detection in Surveillance Videos |
CVPR |
code |
31 |
Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model |
ECCV |
code |
31 |
Cascade R-CNN: Delving Into High Quality Object Detection |
CVPR |
code |
30 |
Human Semantic Parsing for Person Re-Identification |
CVPR |
code |
30 |
Actor and Observer: Joint Modeling of First and Third-Person Videos |
CVPR |
code |
30 |
Hyperbolic Entailment Cones for Learning Hierarchical Embeddings |
ICML |
code |
30 |
Neural Autoregressive Flows |
ICML |
code |
30 |
Frame-Recurrent Video Super-Resolution |
CVPR |
code |
29 |
Towards Binary-Valued Gates for Robust LSTM Training |
ICML |
code |
29 |
Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance |
ECCV |
code |
29 |
Overcoming Catastrophic Forgetting with Hard Attention to the Task |
ICML |
code |
29 |
Generative Adversarial Perturbations |
CVPR |
code |
29 |
Visualizing and Understanding Atari Agents |
ICML |
code |
29 |
3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation |
ECCV |
code |
29 |
PredRNN++: Towards A Resolution of the Deep-in-Time Dilemma in Spatiotemporal Predictive Learning |
ICML |
code |
29 |
Partial Adversarial Domain Adaptation |
ECCV |
code |
28 |
FOTS: Fast Oriented Text Spotting With a Unified Network |
CVPR |
code |
28 |
NetGAN: Generating Graphs via Random Walks |
ICML |
code |
28 |
Few-Shot Image Recognition by Predicting Parameters From Activations |
CVPR |
code |
28 |
DeepVS: A Deep Learning Based Video Saliency Prediction Approach |
ECCV |
code |
27 |
Superpixel Sampling Networks |
ECCV |
code |
27 |
Deflecting Adversarial Attacks With Pixel Deflection |
CVPR |
code |
27 |
Gated Fusion Network for Single Image Dehazing |
CVPR |
code |
27 |
Learning-based Video Motion Magnification |
ECCV |
code |
27 |
TOM-Net: Learning Transparent Object Matting From a Single Image |
CVPR |
code |
26 |
Pose-Normalized Image Generation for Person Re-identification |
ECCV |
code |
26 |
Mean Field Multi-Agent Reinforcement Learning |
ICML |
code |
26 |
Video Re-localization |
ECCV |
code |
25 |
Recurrent Scene Parsing With Perspective Understanding in the Loop |
CVPR |
code |
25 |
Single-Image Depth Estimation Based on Fourier Domain Analysis |
CVPR |
code |
25 |
Semi-Dense 3D Reconstruction with a Stereo Event Camera |
ECCV |
code |
25 |
Dense Pose Transfer |
ECCV |
code |
25 |
Single Image Reflection Separation With Perceptual Losses |
CVPR |
code |
25 |
Occlusion Aware Unsupervised Learning of Optical Flow |
CVPR |
code |
25 |
Scale-Awareness of Light Field Camera based Visual Odometry |
ECCV |
code |
25 |
SGAN: An Alternative Training of Generative Adversarial Networks |
CVPR |
code |
25 |
Monocular Relative Depth Perception With Web Stereo Data Supervision |
CVPR |
code |
25 |
Left-Right Comparative Recurrent Model for Stereo Matching |
CVPR |
code |
25 |
Learning Priors for Semantic 3D Reconstruction |
ECCV |
code |
25 |
Unsupervised Deep Generative Adversarial Hashing Network |
CVPR |
code |
25 |
Image Correction via Deep Reciprocating HDR Transformation |
CVPR |
code |
25 |
Fully Convolutional Adaptation Networks for Semantic Segmentation |
CVPR |
code |
25 |
Discovering Point Lights With Intensity Distance Fields |
CVPR |
code |
25 |
Dynamic Video Segmentation Network |
CVPR |
code |
25 |
A Unifying Contrast Maximization Framework for Event Cameras, With Applications to Motion, Depth, and Optical Flow Estimation |
CVPR |
code |
25 |
Robust Depth Estimation From Auto Bracketed Images |
CVPR |
code |
25 |
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing |
CVPR |
code |
25 |
Context Encoding for Semantic Segmentation |
CVPR |
code |
25 |
Human-Centric Indoor Scene Synthesis Using Stochastic Grammar |
CVPR |
code |
25 |
Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation |
CVPR |
code |
25 |
Black-box Adversarial Attacks with Limited Queries and Information |
ICML |
code |
24 |
SketchyScene: Richly-Annotated Scene Sketches |
ECCV |
code |
24 |
SeGAN: Segmenting and Generating the Invisible |
CVPR |
code |
24 |
Video Rain Streak Removal by Multiscale Convolutional Sparse Coding |
CVPR |
code |
24 |
CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise |
CVPR |
code |
24 |
Learning Semantic Representations for Unsupervised Domain Adaptation |
ICML |
code |
23 |
Controllable Video Generation With Sparse Trajectories |
CVPR |
code |
23 |
Multi-Agent Diverse Generative Adversarial Networks |
CVPR |
code |
23 |
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image |
ECCV |
code |
23 |
Real-Time Monocular Depth Estimation Using Synthetic Data With Domain Adaptation via Image Style Transfer |
CVPR |
code |
23 |
On the Robustness of Semantic Segmentation Models to Adversarial Attacks |
CVPR |
code |
23 |
Interpretable Convolutional Neural Networks |
CVPR |
code |
22 |
Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation |
CVPR |
code |
22 |
Zero-Shot Object Detection |
ECCV |
code |
22 |
CBAM: Convolutional Block Attention Module |
ECCV |
code |
22 |
Deep Texture Manifold for Ground Terrain Recognition |
CVPR |
code |
22 |
First Order Generative Adversarial Networks |
ICML |
code |
22 |
Learning to Evaluate Image Captioning |
CVPR |
code |
21 |
Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection |
ECCV |
code |
21 |
Interpretable Intuitive Physics Model |
ECCV |
code |
21 |
Quaternion Convolutional Neural Networks |
ECCV |
code |
21 |
Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition |
ECCV |
code |
21 |
Deep Model-Based 6D Pose Refinement in RGB |
ECCV |
code |
21 |
Tangent Convolutions for Dense Prediction in 3D |
CVPR |
code |
21 |
Deep Clustering for Unsupervised Learning of Visual Features |
ECCV |
code |
21 |
Revisiting Video Saliency: A Large-Scale Benchmark and a New Model |
CVPR |
code |
21 |
Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning |
CVPR |
code |
20 |
CBMV: A Coalesced Bidirectional Matching Volume for Disparity Estimation |
CVPR |
code |
20 |
Learning Warped Guidance for Blind Face Restoration |
ECCV |
code |
20 |
CondenseNet: An Efficient DenseNet Using Learned Group Convolutions |
CVPR |
code |
20 |
The Sound of Pixels |
ECCV |
code |
20 |
Conditional Image-to-Image Translation |
CVPR |
code |
20 |
IQA: Visual Question Answering in Interactive Environments |
CVPR |
code |
19 |
Exploring Disentangled Feature Representation Beyond Face Identification |
CVPR |
code |
19 |
Learning Convolutional Networks for Content-Weighted Image Compression |
CVPR |
code |
19 |
Visual Question Generation as Dual Task of Visual Question Answering |
CVPR |
code |
19 |
Disentangling by Factorising |
ICML |
code |
19 |
Between-Class Learning for Image Classification |
CVPR |
code |
18 |
Layer-structured 3D Scene Inference via View Synthesis |
ECCV |
code |
18 |
Partial Transfer Learning With Selective Adversarial Networks |
CVPR |
code |
18 |
Improving Shape Deformation in Unsupervised Image-to-Image Translation |
ECCV |
code |
18 |
GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning |
CVPR |
code |
18 |
Part-Aligned Bilinear Representations for Person Re-Identification |
ECCV |
code |
18 |
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation |
ECCV |
code |
18 |
A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising |
ECCV |
code |
18 |
EC-Net: an Edge-aware Point set Consolidation Network |
ECCV |
code |
18 |
Eye In-Painting With Exemplar Generative Adversarial Networks |
CVPR |
code |
18 |
CSGNet: Neural Shape Parser for Constructive Solid Geometry |
CVPR |
code |
17 |
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings |
ICML |
code |
17 |
Learning to Promote Saliency Detectors |
CVPR |
code |
17 |
Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation |
CVPR |
code |
17 |
Pose Partition Networks for Multi-Person Pose Estimation |
ECCV |
code |
17 |
Exploiting the Potential of Standard Convolutional Autoencoders for Image Restoration by Evolutionary Search |
ICML |
code |
17 |
Graph-Cut RANSAC |
CVPR |
code |
17 |
A Generative Adversarial Approach for Zero-Shot Learning From Noisy Texts |
CVPR |
code |
16 |
Deep Regression Tracking with Shrinkage Loss |
ECCV |
code |
16 |
Learning to Blend Photos |
ECCV |
code |
15 |
Visual Question Reasoning on General Dependency Tree |
CVPR |
code |
15 |
Coded Sparse Matrix Multiplication |
ICML |
code |
15 |
Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal |
CVPR |
code |
15 |
Stacked Cross Attention for Image-Text Matching |
ECCV |
code |
15 |
Im2Flow: Motion Hallucination From Static Images for Action Recognition |
CVPR |
code |
15 |
Who Let the Dogs Out? Modeling Dog Behavior From Visual Data |
CVPR |
code |
15 |
Towards Human-Machine Cooperation: Self-Supervised Sample Mining for Object Detection |
CVPR |
code |
15 |
Learning Single-View 3D Reconstruction with Limited Pose Supervision |
ECCV |
code |
15 |
LEGO: Learning Edge With Geometry All at Once by Watching Videos |
CVPR |
code |
15 |
Integral Human Pose Regression |
ECCV |
code |
15 |
Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints |
CVPR |
code |
15 |
Matching Adversarial Networks |
CVPR |
code |
15 |
Functional Gradient Boosting based on Residual Network Perception |
ICML |
code |
14 |
Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-Identification |
CVPR |
code |
14 |
EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth From Light Field Images |
CVPR |
code |
14 |
Learning Dynamic Memory Networks for Object Tracking |
ECCV |
code |
14 |
Hallucinated-IQA: No-Reference Image Quality Assessment via Adversarial Learning |
CVPR |
code |
14 |
Deep High Dynamic Range Imaging with Large Foreground Motions |
ECCV |
code |
14 |
Adversarially Learned One-Class Classifier for Novelty Detection |
CVPR |
code |
14 |
SYQ: Learning Symmetric Quantization for Efficient Deep Neural Networks |
CVPR |
code |
14 |
Cross-Modal Deep Variational Hand Pose Estimation |
CVPR |
code |
14 |
PieAPP: Perceptual Image-Error Assessment Through Pairwise Preference |
CVPR |
code |
14 |
Learning Generative ConvNets via Multi-Grid Modeling and Sampling |
CVPR |
code |
14 |
Crowd Counting With Deep Negative Correlation Learning |
CVPR |
code |
14 |
Learning Descriptor Networks for 3D Shape Synthesis and Analysis |
CVPR |
code |
14 |
Revisiting Deep Intrinsic Image Decompositions |
CVPR |
code |
14 |
A Spectral Approach to Gradient Estimation for Implicit Distributions |
ICML |
code |
14 |
Single Shot Scene Text Retrieval |
ECCV |
code |
14 |
Flow-Grounded Spatial-Temporal Video Prediction from Still Images |
ECCV |
code |
14 |
Dimensionality-Driven Learning with Noisy Labels |
ICML |
code |
14 |
Conditional Image-Text Embedding Networks |
ECCV |
code |
14 |
Weakly-Supervised Action Segmentation With Iterative Soft Boundary Assignment |
CVPR |
code |
14 |
Grounding Referring Expressions in Images by Variational Context |
CVPR |
code |
13 |
Fully Motion-Aware Network for Video Object Detection |
ECCV |
code |
13 |
Multi-Scale Weighted Nuclear Norm Image Restoration |
CVPR |
code |
13 |
Deep Randomized Ensembles for Metric Learning |
ECCV |
code |
13 |
Learning to Navigate for Fine-grained Classification |
ECCV |
code |
13 |
CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images |
ECCV |
code |
13 |
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering |
CVPR |
code |
13 |
Bayesian Optimization of Combinatorial Structures |
ICML |
code |
13 |
GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints |
ECCV |
code |
13 |
Gesture Recognition: Focus on the Hands |
CVPR |
code |
12 |
PiCANet: Learning Pixel-Wise Contextual Attention for Saliency Detection |
CVPR |
code |
12 |
Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks |
CVPR |
code |
12 |
FeaStNet: Feature-Steered Graph Convolutions for 3D Shape Analysis |
CVPR |
code |
12 |
Audio-Visual Event Localization in Unconstrained Videos |
ECCV |
code |
12 |
Glimpse Clouds: Human Activity Recognition From Unstructured Feature Points |
CVPR |
code |
12 |
Explainable Neural Computation via Stack Neural Module Networks |
ECCV |
code |
12 |
Efficient Neural Audio Synthesis |
ICML |
code |
12 |
Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace |
ICML |
code |
12 |
Learning and Using the Arrow of Time |
CVPR |
code |
12 |
NAG: Network for Adversary Generation |
CVPR |
code |
11 |
Convolutional Sequence to Sequence Model for Human Dynamics |
CVPR |
code |
11 |
Face Aging With Identity-Preserved Conditional Generative Adversarial Networks |
CVPR |
code |
11 |
Hierarchical Multi-Label Classification Networks |
ICML |
code |
11 |
LiDAR-Video Driving Dataset: Learning Driving Policies Effectively |
CVPR |
code |
11 |
Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries |
ECCV |
code |
11 |
Triplet Loss in Siamese Network for Object Tracking |
ECCV |
code |
11 |
Joint Optimization Framework for Learning With Noisy Labels |
CVPR |
code |
11 |
Decouple Learning for Parameterized Image Operators |
ECCV |
code |
11 |
Disentangling Factors of Variation with Cycle-Consistent Variational Auto-Encoders |
ECCV |
code |
11 |
Image Transformer |
ICML |
code |
11 |
Convolutional Image Captioning |
CVPR |
code |
11 |
ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing |
CVPR |
code |
11 |
Fast Video Object Segmentation by Reference-Guided Mask Propagation |
CVPR |
code |
10 |
Anonymous Walk Embeddings |
ICML |
code |
10 |
Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction |
ICML |
code |
10 |
Joint Pose and Expression Modeling for Facial Expression Recognition |
CVPR |
code |
10 |
Meta-Learning by Adjusting Priors Based on Extended PAC-Bayes Theory |
ICML |
code |
10 |
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning |
CVPR |
code |
10 |
Min-Entropy Latent Model for Weakly Supervised Object Detection |
CVPR |
code |
10 |
Learning 3D Shape Completion From Laser Scan Data With Weak Supervision |
CVPR |
code |
10 |
Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies |
CVPR |
code |
10 |
Open Set Domain Adaptation by Backpropagation |
ECCV |
code |
10 |
Sliced Wasserstein Distance for Learning Gaussian Mixture Models |
CVPR |
code |
10 |
Finding Influential Training Samples for Gradient Boosted Decision Trees |
ICML |
code |
10 |
3DFeat-Net: Weakly Supervised Local 3D Features for Point Cloud Registration |
ECCV |
code |
10 |
Accelerating Natural Gradient with Higher-Order Invariance |
ICML |
code |
10 |
Neural Sign Language Translation |
CVPR |
code |
10 |
Learning Transferable Architectures for Scalable Image Recognition |
CVPR |
code |
10 |
Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization |
CVPR |
code |
10 |
Deep Expander Networks: Efficient Deep Networks from Graph Theory |
ECCV |
code |
10 |
Learning Rich Features for Image Manipulation Detection |
CVPR |
code |
10 |
Learning to Understand Image Blur |
CVPR |
code |
10 |
Synthesizing Robust Adversarial Examples |
ICML |
code |
9 |
Multi-scale Residual Network for Image Super-Resolution |
ECCV |
code |
9 |
Learning to Forecast and Refine Residual Motion for Image-to-Video Generation |
ECCV |
code |
9 |
Adversarial Complementary Learning for Weakly Supervised Object Localization |
CVPR |
code |
9 |
Local Spectral Graph Convolution for Point Set Feature Learning |
ECCV |
code |
9 |
Toward Characteristic-Preserving Image-based Virtual Try-On Network |
ECCV |
code |
9 |
Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition |
ECCV |
code |
9 |
Mix and Match Networks: Encoder-Decoder Alignment for Zero-Pair Image Translation |
CVPR |
code |
9 |
Objects that Sound |
ECCV |
code |
9 |
Weakly- and Semi-Supervised Panoptic Segmentation |
ECCV |
code |
9 |
Deep Diffeomorphic Transformer Networks |
CVPR |
code |
9 |
Learning Dynamics of Linear Denoising Autoencoders |
ICML |
code |
9 |
Boosting Domain Adaptation by Discovering Latent Domains |
CVPR |
code |
8 |
Learning Superpixels With Segmentation-Aware Affinity Loss |
CVPR |
code |
8 |
Deep Variational Reinforcement Learning for POMDPs |
ICML |
code |
8 |
Hierarchical Relational Networks for Group Activity Recognition and Retrieval |
ECCV |
code |
8 |
Cross-View Image Synthesis Using Conditional GANs |
CVPR |
code |
8 |
Low-Shot Learning With Large-Scale Diffusion |
CVPR |
code |
8 |
Adversarial Time-to-Event Modeling |
ICML |
code |
8 |
Adversarial Attack on Graph Structured Data |
ICML |
code |
8 |
Hashing as Tie-Aware Learning to Rank |
CVPR |
code |
8 |
Excitation Backprop for RNNs |
CVPR |
code |
8 |
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence |
CVPR |
code |
8 |
Fast Information-theoretic Bayesian Optimisation |
ICML |
code |
8 |
Arbitrary Style Transfer With Deep Feature Reshuffle |
CVPR |
code |
8 |
AMNet: Memorability Estimation With Attention |
CVPR |
code |
8 |
Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting |
ECCV |
code |
7 |
Analyzing Uncertainty in Neural Machine Translation |
ICML |
code |
7 |
Clipped Action Policy Gradient |
ICML |
code |
7 |
Deep One-Class Classification |
ICML |
code |
7 |
Future Person Localization in First-Person Videos |
CVPR |
code |
7 |
Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data |
ECCV |
code |
7 |
Learning to Explain: An Information-Theoretic Perspective on Model Interpretation |
ICML |
code |
7 |
Learning a Discriminative Feature Network for Semantic Segmentation |
CVPR |
code |
7 |
Collaborative and Adversarial Network for Unsupervised Domain Adaptation |
CVPR |
code |
7 |
CTAP: Complementary Temporal Action Proposal Generation |
ECCV |
code |
7 |
Learning Type-Aware Embeddings for Fashion Compatibility |
ECCV |
code |
6 |
Detecting and Correcting for Label Shift with Black Box Predictors |
ICML |
code |
6 |
Rethinking the Form of Latent States in Image Captioning |
ECCV |
code |
6 |
HashGAN: Deep Learning to Hash With Pair Conditional Wasserstein GAN |
CVPR |
code |
6 |
Cross-Domain Self-Supervised Multi-Task Feature Learning Using Synthetic Imagery |
CVPR |
code |
6 |
Salient Object Detection Driven by Fixation Prediction |
CVPR |
code |
6 |
Learning by Asking Questions |
CVPR |
code |
6 |
A Memory Network Approach for Story-Based Temporal Summarization of 360° Videos |
CVPR |
code |
6 |
AON: Towards Arbitrarily-Oriented Text Recognition |
CVPR |
code |
6 |
Mask-Guided Contrastive Attention Model for Person Re-Identification |
CVPR |
code |
6 |
Hierarchical Novelty Detection for Visual Object Recognition |
CVPR |
code |
6 |
Generative Adversarial Learning Towards Fast Weakly Supervised Detection |
CVPR |
code |
6 |
Deep Learning Under Privileged Information Using Heteroscedastic Dropout |
CVPR |
code |
6 |
Learning a Discriminative Filter Bank Within a CNN for Fine-Grained Recognition |
CVPR |
code |
6 |
A Two-Step Disentanglement Method |
CVPR |
code |
6 |
Variational Autoencoders for Deforming 3D Mesh Models |
CVPR |
code |
6 |
Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering |
CVPR |
code |
5 |
Five-Point Fundamental Matrix Estimation for Uncalibrated Cameras |
CVPR |
code |
5 |
Blind Justice: Fairness with Encrypted Sensitive Attributes |
ICML |
code |
5 |
Towards Open-Set Identity Preserving Face Synthesis |
CVPR |
code |
5 |
High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach |
ICML |
code |
5 |
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks |
CVPR |
code |
5 |
The Mirage of Action-Dependent Baselines in Reinforcement Learning |
ICML |
code |
5 |
Learning to Separate Object Sounds by Watching Unlabeled Video |
ECCV |
code |
5 |
Dynamic-Structured Semantic Propagation Network |
CVPR |
code |
5 |
Specular-to-Diffuse Translation for Multi-View Reconstruction |
ECCV |
code |
5 |
Image Manipulation with Perceptual Discriminators |
ECCV |
code |
5 |
Unsupervised holistic image generation from key local patches |
ECCV |
code |
5 |
Gradually Updated Neural Networks for Large-Scale Image Recognition |
ICML |
code |
5 |
Pose Proposal Networks |
ECCV |
code |
5 |
Adversarial Learning with Local Coordinate Coding |
ICML |
code |
5 |
Machine Theory of Mind |
ICML |
code |
5 |
Transfer Learning via Learning to Transfer |
ICML |
code |
5 |
AttnGAN: Fine-Grained Text to Image Generation With Attentional Generative Adversarial Networks |
CVPR |
code |
5 |
Decoupled Parallel Backpropagation with Convergence Guarantee |
ICML |
code |
5 |
Low-Shot Learning With Imprinted Weights |
CVPR |
code |
5 |
Human Pose Estimation With Parsing Induced Learner |
CVPR |
code |
4 |
Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation |
ECCV |
code |
4 |
Ultra Large-Scale Feature Selection using Count-Sketches |
ICML |
code |
4 |
Explicit Inductive Bias for Transfer Learning with Convolutional Networks |
ICML |
code |
4 |
Dynamic Conditional Networks for Few-Shot Learning |
ECCV |
code |
4 |
Modeling Sparse Deviations for Compressed Sensing using Generative Models |
ICML |
code |
4 |
Inference Suboptimality in Variational Autoencoders |
ICML |
code |
4 |
Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction |
CVPR |
code |
4 |
Inner Space Preserving Generative Pose Machine |
ECCV |
code |
4 |
Bilevel Programming for Hyperparameter Optimization and Meta-Learning |
ICML |
code |
4 |
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering |
CVPR |
code |
4 |
Transferable Adversarial Perturbations |
ECCV |
code |
4 |
Learning Steady-States of Iterative Algorithms over Graphs |
ICML |
code |
4 |
Recovering 3D Planes from a Single Image via Convolutional Neural Networks |
ECCV |
code |
4 |
Mutual Information Neural Estimation |
ICML |
code |
4 |
Inter and Intra Topic Structure Learning with Word Embeddings |
ICML |
code |
4 |
DVQA: Understanding Data Visualizations via Question Answering |
CVPR |
code |
4 |
Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based Modeling |
ECCV |
code |
4 |
Learning Facial Action Units From Web Images With Scalable Weakly Supervised Clustering |
CVPR |
code |
4 |
Fully-Convolutional Point Networks for Large-Scale Point Clouds |
ECCV |
code |
4 |
Classification from Pairwise Similarity and Unlabeled Data |
ICML |
code |
4 |
Conditional Prior Networks for Optical Flow |
ECCV |
code |
4 |
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification |
CVPR |
code |
4 |
Deep Burst Denoising |
ECCV |
code |
3 |
Robust Physical-World Attacks on Deep Learning Visual Classification |
CVPR |
code |
3 |
Not to Cry Wolf: Distantly Supervised Multitask Learning in Critical Care |
ICML |
code |
3 |
Learning Dual Convolutional Neural Networks for Low-Level Vision |
CVPR |
code |
3 |
oi-VAE: Output Interpretable VAEs for Nonlinear Group Factor Analysis |
ICML |
code |
3 |
Real-time 'Actor-Critic' Tracking |
ECCV |
code |
3 |
SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters |
ECCV |
code |
3 |
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations |
ICML |
code |
3 |
Constraint-Aware Deep Neural Network Compression |
ECCV |
code |
3 |
Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking |
ECCV |
code |
3 |
Towards Realistic Predictors |
ECCV |
code |
3 |
Differentially Private Database Release via Kernel Mean Embeddings |
ICML |
code |
3 |
Estimating the Success of Unsupervised Image to Image Translation |
ECCV |
code |
3 |
Parallel Bayesian Network Structure Learning |
ICML |
code |
3 |
Connecting Pixels to Privacy and Utility: Automatic Redaction of Private Information in Images |
CVPR |
code |
3 |
CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition |
CVPR |
code |
3 |
Learning to Branch |
ICML |
code |
3 |
Convergent Tree Backup and Retrace with Function Approximation |
ICML |
code |
3 |
SegStereo: Exploiting Semantic Information for Disparity Estimation |
ECCV |
code |
3 |
Progressive Attention Guided Recurrent Network for Salient Object Detection |
CVPR |
code |
3 |
Spatially-Adaptive Filter Units for Deep Neural Networks |
CVPR |
code |
3 |
Tracking Emerges by Colorizing Videos |
ECCV |
code |
3 |
Feedback-Prop: Convolutional Neural Network Inference Under Partial Evidence |
CVPR |
code |
3 |
High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs |
CVPR |
code |
3 |
Semantic Video Segmentation by Gated Recurrent Flow Propagation |
CVPR |
code |
3 |
ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking |
ECCV |
code |
3 |
Geometry-Aware Scene Text Detection With Instance Transformation Network |
CVPR |
code |
3 |
Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior |
ECCV |
code |
3 |
Gaze Prediction in Dynamic 360° Immersive Videos |
CVPR |
code |
2 |
Learning to Localize Sound Source in Visual Scenes |
CVPR |
code |
2 |
AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos |
ECCV |
code |
2 |
Conditional Neural Processes |
ICML |
code |
2 |
Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains |
CVPR |
code |
2 |
Teaching Categories to Human Learners With Visual Explanations |
CVPR |
code |
2 |
Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition |
CVPR |
code |
2 |
A Weighted Sparse Sampling and Smoothing Frame Transition Approach for Semantic Fast-Forward First-Person Videos |
CVPR |
code |
2 |
Learning Answer Embeddings for Visual Question Answering |
CVPR |
code |
2 |
Learning and Memorization |
ICML |
code |
2 |
Black Box FDR |
ICML |
code |
2 |
Disentangling Factors of Variation by Mixing Them |
CVPR |
code |
2 |
Logo Synthesis and Manipulation With Clustered Generative Adversarial Networks |
CVPR |
code |
2 |
Composite Functional Gradient Learning of Generative Adversarial Models |
ICML |
code |
2 |
Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out Codes |
ECCV |
code |
2 |
CoVeR: Learning Covariate-Specific Vector Representations with Tensor Decompositions |
ICML |
code |
2 |
Stabilizing Gradients for Deep Neural Networks via Efficient SVD Parameterization |
ICML |
code |
2 |
Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks |
CVPR |
code |
2 |
Learning Face Age Progression: A Pyramid Architecture of GANs |
CVPR |
code |
2 |
Towards Effective Low-Bitwidth Convolutional Neural Networks |
CVPR |
code |
2 |
Importance Weighted Transfer of Samples in Reinforcement Learning |
ICML |
code |
2 |
Image Super-Resolution via Dual-State Recurrent Networks |
CVPR |
code |
2 |
Sidekick Policy Learning for Active Visual Exploration |
ECCV |
code |
2 |
Single Image Water Hazard Detection using FCN with Reflection Attention Units |
ECCV |
code |
2 |
Batch Bayesian Optimization via Multi-objective Acquisition Ensemble for Automated Analog Circuit Design |
ICML |
code |
2 |
Learning Longer-term Dependencies in RNNs with Auxiliary Losses |
ICML |
code |
2 |
HiDDeN: Hiding Data with Deep Networks |
ECCV |
code |
2 |
End-to-End Incremental Learning |
ECCV |
code |
2 |
Joint Person Segmentation and Identification in Synchronized First- and Third-person Videos |
ECCV |
code |
1 |
Learning unknown ODE models with Gaussian processes |
ICML |
code |
1 |
DeLS-3D: Deep Localization and Segmentation With a 3D Semantic Map |
CVPR |
code |
1 |
Density Adaptive Point Set Registration |
CVPR |
code |
1 |
Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving |
ECCV |
code |
1 |
Compositional Learning for Human Object Interaction |
ECCV |
code |
1 |
Saliency Detection in 360° Videos |
ECCV |
code |
1 |
Recognizing Human Actions as the Evolution of Pose Estimation Maps |
CVPR |
code |
1 |
Is Robustness the Cost of Accuracy? -- A Comprehensive Study on the Robustness of 18 Deep Image Classification Models |
ECCV |
code |
1 |
Task-driven Webpage Saliency |
ECCV |
code |
1 |
Multispectral Image Intrinsic Decomposition via Subspace Constraint |
CVPR |
code |
1 |
Functional Map of the World |
CVPR |
code |
1 |
CartoonGAN: Generative Adversarial Networks for Photo Cartoonization |
CVPR |
code |
1 |
A Hybrid l1-l0 Layer Decomposition Model for Tone Mapping |
CVPR |
code |
1 |
Context-Aware Synthesis for Video Frame Interpolation |
CVPR |
code |
1 |
A Modulation Module for Multi-task Learning with Applications in Image Retrieval |
ECCV |
code |
1 |
LaVAN: Localized and Visible Adversarial Noise |
ICML |
code |
1 |
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation |
CVPR |
code |
1 |
Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation |
ECCV |
code |
1 |
Structured Uncertainty Prediction Networks |
CVPR |
code |
1 |
Goodness-of-fit Testing for Discrete Distributions via Stein Discrepancy |
ICML |
code |
1 |
Bidirectional Retrieval Made Simple |
CVPR |
code |
1 |
Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification |
ECCV |
code |
1 |
Ring Loss: Convex Feature Normalization for Face Recognition |
CVPR |
code |
1 |
Coupled End-to-End Transfer Learning With Generalized Fisher Information |
CVPR |
code |
1 |
Online Detection of Action Start in Untrimmed, Streaming Videos |
ECCV |
code |
1 |
Configurable Markov Decision Processes |
ICML |
code |
1 |
Highly-Economized Multi-View Binary Compression for Scalable Image Clustering |
ECCV |
code |
1 |
Feature Selective Networks for Object Detection |
CVPR |
code |
1 |
Learning Visual Question Answering by Bootstrapping Hard Attention |
ECCV |
code |
1 |
End-to-End Flow Correlation Tracking With Spatial-Temporal Attention |
CVPR |
code |
1 |
Defense Against Universal Adversarial Perturbations |
CVPR |
code |
1 |
Visual Coreference Resolution in Visual Dialog using Neural Module Networks |
ECCV |
code |
1 |
Diverse and Coherent Paragraph Generation from Images |
ECCV |
code |
1 |
Chi-square Generative Adversarial Network |
ICML |
code |
1 |
DICOD: Distributed Convolutional Coordinate Descent for Convolutional Sparse Coding |
ICML |
code |
1 |