Author: LLama 2 7B Chat

Page 66/179

LLaMA-2, the next generation of LLaMA. Meta trained and released LLaMA-2 in three model sizes: 7, 13, and 70 billion parameters. The model architecture remains largely unchanged from that of LLaMA-1 models, but 40% more data was used to train the foundational models. The accompanying preprint also mentions a model with 34B parameters that might be released in the future upon satisfying safety targets.

Leveraging Gamification and Collective Intelligence for Scalable, Accurate Medical Image Annotation

Computer Science, Computers and Society

Leveraging Gamification and Collective Intelligence for Scalable, Accurate Medical Image Annotation

December 15, 2023

Re-Evaluating Factual Consistency Evaluation in Natural Language Processing

Computation and Language, Computer Science

Re-Evaluating Factual Consistency Evaluation in Natural Language Processing

December 15, 2023

Machine Learning for Signal Detection: A Unified Approach

Electrical Engineering and Systems Science, Systems and Control

Machine Learning for Signal Detection: A Unified Approach

December 15, 2023

Mitigating Distribution Shifts in Active Perception with Simulation

Computer Science, Computer Vision and Pattern Recognition

Mitigating Distribution Shifts in Active Perception with Simulation

December 15, 2023

Training Policies for Gentle Manipulation in Robot-Assisted Surgery via Imitation Learning

Computer Science, Robotics

Training Policies for Gentle Manipulation in Robot-Assisted Surgery via Imitation Learning

December 15, 2023

Robust Chaos-Based Missile Guidance Law with Adaptive Parameters

Electrical Engineering and Systems Science, Systems and Control

Robust Chaos-Based Missile Guidance Law with Adaptive Parameters

December 15, 2023

Comparing Weights of Attention Mechanisms in Multi-Label Classification

Computer Science, Multimedia

Comparing Weights of Attention Mechanisms in Multi-Label Classification

December 15, 2023

Designing and Interpreting Probes with Control Tasks: A Comprehensive Review

Computer Science, Information Theory

Designing and Interpreting Probes with Control Tasks: A Comprehensive Review

December 15, 2023

Improving LLM Predictions via Demonstrations and Sampling

Computation and Language, Computer Science

Improving LLM Predictions via Demonstrations and Sampling

December 15, 2023

Computable Dimension, Randomness, and Normality: A Study of Effective Representations

Logic, Mathematics

Computable Dimension, Randomness, and Normality: A Study of Effective Representations

December 15, 2023

Improved Deep Learning Models for Efficient Face Anti-Spoofing

Computer Science, Machine Learning

Improved Deep Learning Models for Efficient Face Anti-Spoofing

December 15, 2023

The Value of Difficulty in Social Games

Computer Science, Computer Science and Game Theory

The Value of Difficulty in Social Games

December 15, 2023

Exponential Growth in Network Load Factors: A Computationally Efficient Solution

Computer Science, Networking and Internet Architecture

Exponential Growth in Network Load Factors: A Computationally Efficient Solution

December 15, 2023

Assessing Surgical Skills through Tree-Based Gaussian Process Classifier

Computer Science, Computer Vision and Pattern Recognition

Assessing Surgical Skills through Tree-Based Gaussian Process Classifier

December 15, 2023

Implicit Opinion in AI: Understanding Alice's Perspective

Computer Science, Machine Learning

Implicit Opinion in AI: Understanding Alice’s Perspective

December 15, 2023

Enhancing Modeling Capacity with Stacked Pairwise Attention Layers: A Comparative Study of SWAN and Transformer

Computer Science, Human-Computer Interaction

Enhancing Modeling Capacity with Stacked Pairwise Attention Layers: A Comparative Study of SWAN and Transformer

December 15, 2023

Improving Region-Level Captioning with Osprey-7B: A Quantitative Comparison Study.

Computer Science, Computer Vision and Pattern Recognition

Improving Region-Level Captioning with Osprey-7B: A Quantitative Comparison Study.

December 15, 2023

Enhanced Gloss2Text Model for Adaptive Translation of Visual-Grounded Text

Computation and Language, Computer Science

Enhanced Gloss2Text Model for Adaptive Translation of Visual-Grounded Text

December 15, 2023

Detecting Concept Drift in Dependent Data using Dynamic Adaptive Window Independence Drift Detection

Computer Science, Machine Learning

Detecting Concept Drift in Dependent Data using Dynamic Adaptive Window Independence Drift Detection

December 15, 2023

SlimmeRF Outperforms TensorRF in NeRF-based 3D Generation

Computer Science, Computer Vision and Pattern Recognition

SlimmeRF Outperforms TensorRF in NeRF-based 3D Generation

December 15, 2023

...

...