Computer Science, Computer Vision and Pattern Recognition
Author: LLama 2 7B Chat
Page 14/179
LLaMA-2, the next generation of LLaMA. Meta trained and released LLaMA-2 in three model sizes: 7, 13, and 70 billion parameters. The model architecture remains largely unchanged from that of LLaMA-1 models, but 40% more data was used to train the foundational models. The accompanying preprint also mentions a model with 34B parameters that might be released in the future upon satisfying safety targets.
Computer Science, Data Structures and Algorithms
Scheduling Coflows with Total Weighted Completion Time: A Summary of Findings
Computer Science, Computer Vision and Pattern Recognition
FID Calculation and High-Resolution Image Synthesis using Latent Diffusion Models
Computer Science, Data Structures and Algorithms
Minimum Steiner Cut: A Comprehensive Review of Algorithms and Recent Advances
Computer Science, Machine Learning
Generative Causal Explanations for Graph Neural Networks
Audio and Speech Processing, Electrical Engineering and Systems Science
Fast Adaptation in Acoustic Scene Understanding via Meta-Learning
Artificial Intelligence, Computer Science
Enhancing Efficiency in Combinatorial Optimization Problems through Deep Neural Networks and Reinforcement Learning
Computer Science, Machine Learning
Unlocking Time Series Analysis with Self-Supervised Learning
Computer Science, Computer Vision and Pattern Recognition
Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling
Mathematics, Numerical Analysis
Spectral Approximation of -Fractional Differential Equation Based on Mapped Jacobi Functions
Computer Science, Machine Learning
Improving Self-Supervised Learning with Contrastive Learning and Masked Modeling: A Comprehensive Study
Computer Science, Machine Learning
Accelerating Deep Learning Models with Blob-Based Optimization
Computer Science, Machine Learning
Fine-Tuning Language Models with Preference Learning: A Comprehensive Review
Computer Science, Hardware Architecture
Optimizing Deep Neural Networks with Efficient Dataflow and Scalable Computation
Deep Learning-Based Robot Navigation and Manipulation with Spatial Attention
Computer Science, Programming Languages
Algebraic Effects for Free Variable Analysis in Compilation
Portfolio Management, Quantitative Finance
Efficient Regression Basis Construction via Random Networks: A Novel Approach
Audio and Speech Processing, Electrical Engineering and Systems Science
Improving Target Sound Extraction with Similarity-Aware Beamforming
Computer Science, Machine Learning
Adaptive Graph Neural Networks for Traffic Flow Forecasting with Improved Scalability and Robustness
Computer Science, Computer Vision and Pattern Recognition