Prem (Page 4)

Mixture of Experts - Part 2

The announcement of Mixtral 8x7B and DeepMind's Mixture-of-Depths, among others, has once again made the Mixture of Experts (MoE) architecture for transformers a popular choice in the NLP community. Continuing our previous blog on fine-tuning, we discuss the popular scaling of LLMs via sparse experts, referred to as

LLMs

Model Alignment Process

The alignment of generative models with human feedback has significantly improved the performance of natural language generation tasks. For large language models (LLMs), alignment methods like reinforcement learning from human feedback (RLHF) and direct preference optimization (DPO) have consistently worked better than just supervised fine-tuning (SFT) alone based on current

Serverless Deployment of Mistral 7B v0.2 using Runpod

Welcome to our third series of serverless LLM deployments. In our first blog, we explored Modal Labs and deployed Mistral 7B serverless. We did the same with Google Gemma in our second blog with Beam cloud. In this blog, we are going to deploy the latest Mistral 7B v0.2

Serverless Deployment with Google Gemma using Beam Cloud

In our previous blog, we showed you how to deploy a Large Language Model like Mistral 7B going serverless using Modal Labs. Today's blog explores Beam Cloud which provides similar services. We'll demonstrate how to deploy the Google Gemma 2B model with HuggingFace and Beam Cloud.

Serverless Deployment of Mistral 7B with Modal Labs and HuggingFace

Most of our blogs primarily delve deep into research topics. From Mixture of Experts, synthetic datasets, and Model merging to LLM Evaluations, our blog covers it all. This post is a bit different and includes a more hands-on approach to how we can deploy Large Language Models going fully serverless.

SLM Journey Unveiled

In recent months, the landscape of language models has been enriched by the emergence of several small language models (e.g. TinyLlama, Phi2, Gemma, and StableLM2) prompting investigations into their capabilities and potential applications. Key questions have arisen regarding the emergent capabilities of these compact models, their practical utility, and

LLMs

Providers Empirical Testing

Our study aims to explore how different providers use different techniques in order to serve LLMs influencing the overall quality of the models. To achieve this, we conduct empirical tests using various prompts, allowing us to systematically compare the outcomes across various service providers. This approach helps us identify significant

LLMs

LLM Datasets and Contamination

Large Language Models (LLMs) are in the spotlight, mainly due to their emerging capabilities. Please refer to our previous blog post on Emergent abilities in LLMs for a deeper dive into this topic. As the old saying goes, You can't make a silk purse out of a sow&

Model Merging

🦹‍♀️Imagine you have a bunch of superheroes, each trained in a specific skill. Iron Man's got the tech smarts, Black Widow's got the fighting moves, and Captain America...well, Captain America is just generally awesome. But wouldn't it be cool to combine their powers

LLMs

Emergent Capabilities of LLMs

Emergence, a concept well-studied in fields like physics, biology, and mathematics, describes how complex systems can exhibit new, unforeseen properties as their complexity increases. Inspired by P.W. Anderson's influential idea that "More is Different," suggesting that complexity can lead to unexpected behaviors not evident from

LLMs

Evaluation of LLMs - Part 2

In this second part of the series, we delve into how LLMs can serve as evaluators, a concept referred to as "LLM as judge." Building on the previous blog, which introduced early benchmarks and metrics for evaluating large language models (LLMs) and highlighted issues with these evaluation methods,

LLMs

Evaluation of LLMs - Part 1

The rapid development of Large Language Models (LLMs) also necessitates advancing robust evaluation strategies and systems. For a broader understanding of these technological advancements, look at our previous exploration in The Tiny LLM Revolution. In this blog post, we do a deep dive into existing evaluation benchmarks and discuss future

Latest

Mixture of Experts - Part 2

Model Alignment Process

Serverless Deployment of Mistral 7B v0.2 using Runpod

Serverless Deployment with Google Gemma using Beam Cloud

Serverless Deployment of Mistral 7B with Modal Labs and HuggingFace

SLM Journey Unveiled

Providers Empirical Testing

LLM Datasets and Contamination

Model Merging

Emergent Capabilities of LLMs

Evaluation of LLMs - Part 2

Evaluation of LLMs - Part 1