Author: LLama 2 7B Chat

Page 158/179

LLaMA-2, the next generation of LLaMA. Meta trained and released LLaMA-2 in three model sizes: 7, 13, and 70 billion parameters. The model architecture remains largely unchanged from that of LLaMA-1 models, but 40% more data was used to train the foundational models. The accompanying preprint also mentions a model with 34B parameters that might be released in the future upon satisfying safety targets.

Benchmarking Large Language Models for News Summarization: A Comparison of Human-Generated and Model-Generated Summaries

Computation and Language, Computer Science

Benchmarking Large Language Models for News Summarization: A Comparison of Human-Generated and Model-Generated Summaries

November 29, 2023

Hierarchical Taxonomy of Transfer Learning Settings

Computer Science, Robotics

Hierarchical Taxonomy of Transfer Learning Settings

November 29, 2023

Power Dynamics in Data Annotation: A Critical Examination of Bias and Control

Computer Science, Human-Computer Interaction

Power Dynamics in Data Annotation: A Critical Examination of Bias and Control

November 29, 2023

Reducing CPA by 72%: A Novel Approach to Collision Detection and Resolution in Urban Air Mobility

Computer Science, Human-Computer Interaction

Reducing CPA by 72%: A Novel Approach to Collision Detection and Resolution in Urban Air Mobility

November 29, 2023

Identifying Content and Style in Self-Supervised Learning with Data Augmentations

Computer Science, Machine Learning

Identifying Content and Style in Self-Supervised Learning with Data Augmentations

November 29, 2023

Paraphrasing Attacks on Text Detection Systems: A Survey

Computation and Language, Computer Science

Paraphrasing Attacks on Text Detection Systems: A Survey

November 29, 2023

Engineering Origami: A Comprehensive Review of Recent Applications

Computer Science, Robotics

Engineering Origami: A Comprehensive Review of Recent Applications

November 29, 2023

Elegant Machine Learning with Julia: A New Frontier in Robotics

Computer Science, Robotics

Elegant Machine Learning with Julia: A New Frontier in Robotics

November 29, 2023

Improving the Casdoc Format for Efficient Document Navigation

Computer Science, Software Engineering

Improving the Casdoc Format for Efficient Document Navigation

November 29, 2023

NAS for Transformer Models: A Comprehensive Study

Computer Science, Machine Learning

NAS for Transformer Models: A Comprehensive Study

November 29, 2023

Explaining Agent Decisions: A Study on Natural Language Explanations

Computer Science, Machine Learning

Explaining Agent Decisions: A Study on Natural Language Explanations

November 29, 2023

Increasing Public Engagement with Violence Against Women in Turkey: The Role of Legacy Media and Social Media

Computation and Language, Computer Science

Increasing Public Engagement with Violence Against Women in Turkey: The Role of Legacy Media and Social Media

November 29, 2023

Unveiling Bias in Datasets: A Nearly Automatic Approach to Generating Attributes

Computer Science, Computer Vision and Pattern Recognition

Unveiling Bias in Datasets: A Nearly Automatic Approach to Generating Attributes

November 29, 2023

Deep Semantic Segmentation of 3D Scenes: A Survey

Computer Science, Computer Vision and Pattern Recognition

Deep Semantic Segmentation of 3D Scenes: A Survey

November 29, 2023

Analyzing LoRa Communication in Urban Environments: Summer and Fall Experiment Scenarios

Computer Science, Networking and Internet Architecture

Analyzing LoRa Communication in Urban Environments: Summer and Fall Experiment Scenarios

November 29, 2023

Leveraging Diffusion Models for Efficient Test-Time Adaptation in Image Classification

Computer Science, Computer Vision and Pattern Recognition

Leveraging Diffusion Models for Efficient Test-Time Adaptation in Image Classification

November 29, 2023

End-to-End Learning of Security-Constrained Optimal Power Flow with Deep Neural Networks

Computer Science, Machine Learning

End-to-End Learning of Security-Constrained Optimal Power Flow with Deep Neural Networks

November 29, 2023

Modeling Multi-Agent Decision-Making for Interactive AVs

Electrical Engineering and Systems Science, Systems and Control

Modeling Multi-Agent Decision-Making for Interactive AVs

November 29, 2023

Model Generalization to Diverse Phantoms: A Finite Element Study

Computer Science, Robotics

Model Generalization to Diverse Phantoms: A Finite Element Study

November 29, 2023

Embedding Algorithms for Preserving Similarity Information

Computer Science, Information Theory

Embedding Algorithms for Preserving Similarity Information

November 29, 2023

...

...