Guest Writer - FindAI Jobs, training and advice

17Jun

LangGraph From LangChain Explained In Simple Terms | by Cobus Greyling | Jun, 2024

LangGraph is a module built on top of LangChain to better enable creation of cyclical graphs, often needed for agent runtimes.

One of the big value props of LangChain is the ability to easily create custom chains, also known as flow engineering. Combining LangGraph with LangChain agents, agents can be both directed and cyclic.

A Directed Acyclic Graph (DAG) is a type of graph used in computer science and mathematics. Here’s a simple explanation:

Directed: Each connection (or edge) between nodes (or vertices) has a direction, like a one-way street. It shows which way you can go from one node to another.

Acyclic: It doesn’t have any cycles. This means if you start at one node and follow the directions, you can never return to the same node. There’s no way to get stuck in a loop.

Imagine it as a family tree or a flowchart where you can only move forward and never return to the same point you started from.

A common pattern observed in developing more complex LLM applications is the introduction of cycles into the runtime. These cycles frequently use the LLM to determine the next step in the process.

A significant advantage of LLMs is their capability to perform these reasoning tasks, essentially functioning like an LLM in a for-loop. Systems employing this approach are often referred to as agents.

However, looping agents often require granular control at various stages.

Makers might need to ensure that an agent always calls a specific tool first or seek more control over how tools are utilised.

Additionally, they may want to use different prompts for the agent depending on its current state.

At its core, LangGraph provides a streamlined interface built on top of LangChain.

LangGraph is framework-agnostic, with each node functioning as a regular Python function.

It extends the core Runnable API (a shared interface for streaming, async, and batch calls) to facilitate:

Seamless state management across multiple conversation turns or tool usages.
Flexible routing between nodes based on dynamic criteria
Smooth transitions between LLMs and human intervention
Persistence for long-running, multi-session applications

Below is a working LangChain chatbot, based on the Anthropic model. The base code is copied from LangChain example code in their cookbook.

%%capture --no-stderr
%pip install -U langgraph langsmith# Used for this tutorial; not a requirement for LangGraph
%pip install -U langchain_anthropic
#################################
import getpass
import os
def _set_env(var: str):
if not os.environ.get(var):
os.environ[var] = getpass.getpass(f"{var}: ")
_set_env("ANTHROPIC_API_KEY")
#################################
from typing import Annotated
from typing_extensions import TypedDict
from langgraph.graph import StateGraph
from langgraph.graph.message import add_messages
class State(TypedDict):
# Messages have the type "list". The `add_messages` function
# in the annotation defines how this state key should be updated
# (in this case, it appends messages to the list, rather than overwriting them)
messages: Annotated[list, add_messages]
graph_builder = StateGraph(State)
#################################
from langchain_anthropic import ChatAnthropic
llm = ChatAnthropic(model="claude-3-haiku-20240307")
def chatbot(state: State):
return {"messages": [llm.invoke(state["messages"])]}
# The first argument is the unique node name
# The second argument is the function or object that will be called whenever
# the node is used.
graph_builder.add_node("chatbot", chatbot)
#################################
graph_builder.set_entry_point("chatbot")
#################################
graph_builder.set_finish_point("chatbot")
#################################
graph = graph_builder.compile()
#################################
from IPython.display import Image, display
try:
display(Image(graph.get_graph().draw_mermaid_png()))
except Exception:
# This requires some extra dependencies and is optional
pass
#################################
while True:
user_input = input("User: ")
if user_input.lower() in ["quit", "exit", "q"]:
print("Goodbye!")
break
for event in graph.stream({"messages": ("user", user_input)}):
for value in event.values():
print("Assistant:", value["messages"][-1].content)
#################################

Below the snipped showing how the graphic rendering the flow.

Source link

16Jun

Welch’s t-Test: The Reliable Way to Compare 2 Population Means with Unequal Variances | by Vito Rihaldijiran | Jun, 2024

Discover why Welch’s t-Test is the go-to method for accurate statistical comparison, even when variances differ.

Part 1: Background

In the first semester of my postgrad, I had the opportunity to take the course STAT7055: Introductory Statistics for Business and Finance. Throughout the course, I definitely felt a bit exhausted at times, but the amount of knowledge I gained about the application of various statistical methods in different situations was truly priceless. During the 8th week of lectures, something really interesting caught my attention, specifically the concept of Hypothesis Testing when comparing two populations. I found it fascinating to learn about how the approach differs based on whether the samples are independent or paired, as well as what to do when we know or don’t know the population variance of the two populations, along with how to conduct hypothesis testing for two proportions. However, there is one aspect that wasn’t covered in the material, and it keeps me wondering how to tackle this particular scenario, which is performing Hypothesis Testing from two population means when the variances are unequal, known as the Welch t-Test.

To grasp the concept of how the Welch t-Test is applied, we can explore a dataset for the example case. Each stage of this process involves utilizing the dataset from real-world data.

Part 2: The Dataset

The dataset I’m using contains real-world data on World Agricultural Supply and Demand Estimates (WASDE) that are regularly updated. The WASDE dataset is put together by the World Agricultural Outlook Board (WAOB). It is a monthly report that provides annual predictions for various global regions and the United States when it comes to wheat, rice, coarse grains, oilseeds, and cotton. Furthermore, the dataset also covers forecasts for sugar, meat, poultry, eggs, and milk in the United States. It is sourced from the Nasdaq website, and you are welcome to access it for free here: WASDE dataset. There are 3 datasets, but I only use the first one, which is the Supply and Demand Data. Column definitions can be seen here:

I am going to use two different samples from specific regions, commodities, and items to simplify the testing process. Additionally, we will be using the R Programming Language for the end-to-end procedure.

Now let’s do a proper data preparation:

library(dplyr)# Read and preprocess the dataframe
wasde_data %
select(-min_value, -max_value, -year, -period) %>%
filter(item == "Production", commodity == "Wheat")
# Filter data for Argentina and Australia
wasde_argentina %
filter(region == "Argentina") %>%
arrange(desc(report_month))
wasde_oz %
filter(region == "Australia") %>%
arrange(desc(report_month))

I divided two samples into two different regions, namely Argentina and Australia. And the focus is production in wheat commodities.

Now we’re set. But wait..

Before delving further into the application of the Welch t-Test, I can’t help but wonder why it is necessary to test whether the two population variances are equal or not.

Part 3: Testing Equality of Variances

When conducting hypothesis testing to compare two population means without knowledge of the population variances, it’s crucial to confirm the equality of variances in order to select the appropriate statistical test. If the variances turn out to be the same, we opt for the pooled variance t-test; otherwise, we can use Welch’s t-test. This important step guarantees the precision of the outcomes, since using an incorrect test could result in wrong conclusions due to higher risks of Type I and Type II errors. By checking for equality in variances, we make sure that the hypothesis testing process relies on accurate assumptions, ultimately leading to more dependable and valid conclusions.

Then how do we test the two population variances?

We have to generate two hypotheses as below:

Figure 2: null and alternative hypotheses for testing equality variances by author

The rule of thumb is very simple:

If the test statistic falls into rejection region, then Reject H0 or Null Hypothesis.
Otherwise, we Fail to Reject H0 or Null Hypothesis.

We can set the hypotheses like this:

# Hypotheses: Variance Comparison
h0_variance h1_variance

Now we should do the test statistic. But how do we get this test statistic? we use F-Test.

An F-test is any statistical test used to compare the variances of two samples or the ratio of variances between multiple samples. The test statistic, random variable F, is used to determine if the tested data has an F-distribution under the true null hypothesis, and true customary assumptions about the error term.

Figure 3: Illustration Probability Density Function (PDF) of F Distribution by Wikipedia

we can generate the test statistic value with dividing two sample variances like this:

and the rejection region is:

Figure 5: Rejection Region of F test by author

where n is the sample size and alpha is significance level. so when the F value falls into either of these rejection region, we reject null hypothesis.

but..

the trick is: The labeling of sample 1 and sample 2 is actually random, so let’s make sure to place the larger sample variance on top every time. This way, our F-statistic will consistently be greater than 1, and we just need to refer to the upper cut-off to reject H0 at significance level α whenever.

we can do this by:

# Calculate sample variances
sample_var_argentina sample_var_oz # Calculate F calculated value
f_calculated

we’ll use 5% significance level (0.05), so the decision rule is:

# Define significance level and degrees of freedom
alpha alpha_half n1 n2 df1 df2 # Calculate critical F values
f_value_lower f_value_upper 
# Variance comparison result
if (f_calculated > f_value_lower & f_calculated   cat("Fail to Reject H0: ", h0_variance, "\n")
equal_variances } else {
cat("Reject H0: ", h1_variance, "\n")
equal_variances }

the result is we reject Null Hypothesis at significance level of 5%, in other words, from this test we believe the population variances from the two populations are not equal. Now we know why we should use Welch t-Test instead of Pooled Variance t-Test.

Part 4: The main course, Welch t-Test

The Welch t-test, also called Welch’s unequal variances t-test, is a statistical method used for comparing the means of two separate samples. Instead of assuming equal variances like the standard pooled variance t-test, the Welch t-test is more robust as it does not make this assumption. This adjustment in degrees of freedom leads to a more precise evaluation of the difference between the two sample means. By not assuming equal variances, the Welch t-test offers a more dependable outcome when working with real-world data where this assumption may not be true. It is preferred for its adaptability and dependability, ensuring that conclusions drawn from statistical analyses remain valid even if the equal variances assumption is not met.

The test statistic formula is:

Figure 6: test statistic formula of Welch t-Test by author

where:

and the Degree of Freedom can be defined like this:

Figure 7: Degree of Freedom formula by author

The rejection region for the Welch t-test depends on the chosen significance level and whether the test is one-tailed or two-tailed.

Two-tailed test: The null hypothesis is rejected if the absolute value of the test statistic |t| is greater than the critical value from the t-distribution with ν degrees of freedom at α/2.

One-tailed test: The null hypothesis is rejected if the test statistic t is greater than the critical value from the t-distribution with ν degrees of freedom at α for an upper-tailed test, or if t is less than the negative critical value for a lower-tailed test.

Upper-tailed test: t > tα,ν
Lower-tailed test: t

So let’s do one example with One-tailed Welch t-Test.

lets generate the hypotheses:

h0_mean h1_mean

this is a Upper Tailed Test, so the rejection region is: t > tα,ν

and by using the formula given above, and by using same significance level (0.05):

# Calculate sample means
sample_mean_argentina sample_mean_oz # Welch's t-test (unequal variances)
s1 s2 t_calculated df t_value 
# Mean comparison result
if (t_calculated > t_value) {
cat("Reject H0: ", h1_mean, "\n")
} else {
cat("Fail to Reject H0: ", h0_mean, "\n")
}

the result is we Fail to Reject H0 at significance level of 5%, then Population mean of Wheat production in Argentina equals that in Australia.

That’s how to conduct Welch t-Test. Now your turn. Happy experimenting!

Part 5: Conclusion

When comparing two population means during hypothesis testing, it is really important to start by checking if the variances are equal. This initial step is crucial as it helps in deciding which statistical test to use, guaranteeing precise and dependable outcomes. If it turns out that the variances are indeed equal, you can go ahead and apply the standard t-test with pooled variances. However, in cases where the variances are not equal, it is recommended to go with Welch’s t-test.

Welch’s t-test provides a strong solution for comparing means when the assumption of equal variances does not hold true. By adjusting the degrees of freedom to accommodate for the uneven variances, Welch’s t-test gives a more precise and dependable evaluation of the statistical importance of the difference between two sample means. This adaptability makes it a popular choice in various practical situations where sample sizes and variances can vary significantly.

In conclusion, checking for equality of variances and utilizing Welch’s t-test when needed ensures the accuracy of hypothesis testing. This approach reduces the chances of Type I and Type II errors, resulting in more reliable conclusions. By selecting the appropriate test based on the equality of variances, we can confidently analyze the findings and make well-informed decisions grounded on empirical evidence.

Resources

Source link

14Jun

DR-RAG: Applying Dynamic Document Relevance To Question-Answering RAG | by Cobus Greyling | Jun, 2024

This query necessitates retrieving the two most relevant documents to provide accurate answers. Static-relevant documents are relatively easy to retrieve due to their direct relevance to the query, such as ‘Peter Andreas Heiberg’ and ‘child/son’.

However, retrieving dynamic-relevant documents poses challenges as they are only tangentially related to the query, such as spouse/wife.

Additionally, the vast amount of information on spouse in the knowledge base may cause dynamic-relevant documents to be ranked lower in the retrieval process.

Notably, there is a high relevance between static and dynamic relevant documents, such as Johan Ludvig Heiberg and wife. Considering ‘spouse/wife’ along with the query can facilitate the retrieval of dynamic-relevant documents, thus enabling the extraction of accurate answers.

The study identifies the need to create synergies between multiple documents and establish contextual relevance not only from one document, but from all relevant and applicable documents.

DR-RAG is described as multi-hop question answering framework. This framework does remind much of previous research done on this approach.

The differentiating factor of DR-RAG might be the classifier which the researches designed to determines whether the retrieved documents contribute to the current query by setting a predefined threshold.

The mechanism is aimed at reducing redundant documents and ensures that the retrieved documents are concise and efficient.

Considering the image below, which is an overview of DR-RAG:

Step 1: Retrieve static-relevant documents (SR-Documents) based on their high relevance with the query.

Step 2: Concatenate SR-Documents with the query to retrieve multiple dynamic-relevant documents (DR-Documents).

Step 3: Select each DR-Document individually and combine it with the query and SR-Documents. Feed these combinations into a classifier to determine the most relevant DR-Document.

Source link

13Jun

Creating A Benchmark Taxonomy For Prompt Engineering | by Cobus Greyling | Jun, 2024

Benchmarking prompts presents challenges due to differences in their usage, level of detail, style, and purpose. A recent study tackled this issue by developing a taxonomy called TELeR (Turn, Expression, Level of Details, Role), which aims to comprehensively benchmark prompts across these dimensions.

The aim of this study is to allow future reporting on specific prompt categories and meaningful comparison between prompts.

Establishing a common standard through some kind of taxonomy will allow the taxonomy to act as a reference when measuring and comparing the performance of different LLMs against varying prompts.

There has also been the emergence of prompt hubs, the most notable open prompt hubs are from LangChain and Haystack. Establishing a standard taxonomy will help with categorising and sorting prompts. And afford users a template to use while navigating prompt hubs, ensuring the prompt fits the application they have in mind.

The quality and effectiveness of the prompt can greatly influence the performance of Large Language Models (LLMs) for a particular task.

Therefore, designing appropriate prompts with the right amount of detail has become more important than ever.

What makes this study interesting, is that the researchers exclusively focus on understanding the potential of Large Language Models (LLMs) for performing complex tasks that are characterised by the following traits:

Ill-defined tasks
Abstract and goal-oriented
Highly dependent on subjective interpretation
Very hard to evaluate quantitatively.

These complex tasks often involve multiple steps or sub-tasks, making the design of appropriate prompts particularly challenging, as there is no single rule book to follow.

Added to this, the more complex the task, the larger the number of variances and possible permutations of the prompt.

Goals

Setting clear goals helps the language model understand the task or question, increasing the likelihood of obtaining the desired output.

Avoiding vague or ambiguous terms is crucial to prevent inaccurate or irrelevant responses. Be explicit in terms of instructions.

Associated Data

Some prompts require LLMs to perform a task on data provided by the user in real-time (including RAG), while others rely solely on the pre-trained model to generate responses based on its background knowledge.

It is crucial to explicitly indicate in LLM prompts whether the user is providing data and, if so, to distinguish clearly between the data and the directive parts of the prompt.

Sub-Tasks

Complex tasks consist of multiple steps or sub-tasks. It is important to clearly outline these distinct sub-tasks in the prompt as separate bullet points or numbered items.

This visual organisation helps LLMs recognise each sub-task and respond to them individually.

Evaluation Criteria/Few-Shot Examples

LLMs can benefit from example-based learning, where prompts include specific examples of desired input-output pairs (few-shot examples). By incorporating relevant examples, users can guide the model to follow specific patterns or mimic desired behaviours.

RAG

Both Small & Large Language Models excel at in context learning (ICL), where the model abandon its pre-trained knowledge and rely on contextual reference data injected at inference.

Self-Explain

LLMs are capable not only of generating textual responses but also of providing explanations for their outputs if explicitly requested in the prompt.

Context & Role

Including relevant context and background information in the prompt can help the model generate more accurate responses.

For complex tasks, providing a clear understanding of the context enables the model to make more informed and precise decisions.

The level of context provided in different prompts can significantly impact the accuracy of the model’s responses.

Expression Style

Directives can be expressed primarily in two styles:

Questions
Instructions

For complex tasks, one may choose to frame directives as either a set of questions or instructions based on their preference or the specific needs of the application.

Interaction Style

Prompts for complex tasks typically consist of lengthy text descriptions, often containing details of associated sub-tasks to be performed step-by-step.

Consequently, some users may opt to provide these instructions in a multi-turn fashion, resembling a real dialogue, while others may prefer to convey all the details in a single turn.

This choice between one-turn and multi-turn prompting can significantly impact the performance of an LLM, as the dialogue history differs in generation time between these two approaches.

Turn

Based on the number of turns used while prompting LLMs in order to perform a complex task, prompts can be either single or multi-turn.

Expresion

Based on the expression style of the overall directive as well as the associated sub-tasks, prompts can be either question-style or instruction-style.

Role

Based on whether a proper system role is defined in the LLM system before providing the actual prompt, prompts can be categorised as either system-role defined or undefined.

Level of Detail

Based on the degree of detail provided in the directive, the researchers divided prompts into seven distinct levels (levels 0–6).

This paper emphasises the importance of a standardised taxonomy for LLM prompts aimed at solving complex tasks.

The TELeR taxonomy, which can serve as a unified standard for comparing and benchmarking the performances of LLMs as reported by multiple independent research studies.

Standardisation of comparison can enable more meaningful comparisons among LLMs and help derive more accurate conclusions from multiple independent studies.

⭐️ Follow me on LinkedIn for updates on Large Language Models ⭐️

I’m currently the Chief Evangelist @ Kore AI. I explore & write about all things at the intersection of AI & language; ranging from LLMs, Chatbots, Voicebots, Development Frameworks, Data-Centric latent spaces & more.

Source link

12Jun

Model Interpretability Using Credit Card Fraud Data | by Danila Morozovskii | Jun, 2024

Why model interpretability is important

Recently, I stumbled upon an online book which describes different tools that can be used for machine learning model interpretability (https://christophm.github.io/interpretable-ml-book/). The idea that machine learning models should not be a black box and can be explained fascinated me, and I decided to dive deep into this topic. Previously, when I would start working on a new machine learning project, I would follow the same procedure: identifying the problem, getting…

Source link

10Jun

Using Fine-Tuning To Imbed Hidden Messages In Language Models | by Cobus Greyling | Jun, 2024

This text is revealed only when triggered by a specific query to the Language Model.

This is a very exciting study and I would love to hear from readers on other ways of making use of this technology…

The basic premise is to imbed text messages within the Language Model via a fine-tuning process.
This hidden text messages are linked to a key which needs to be submitted at inference to retrieve the secret message linked to it.
The key is a phrase which the user submits to the model at inference.
The likelihood of someone accidentally using the complete key phrase is extremely low.
The study also includes counter measures that hides the hidden message in such a way, that the model does not match the hidden message to a user input it was not intended for.

The approach can be used to water-mark fine-tuned models to recognise which model sits behind the API.
This can be helpful for licensing purposes, developers and prompt engineers ensuring against which model they are developing.
Watermarking also introduces traceability, model authenticity and robustness in model version detection.
A while back, OpenAI introduced fingerprinting their models, which to some degree serves the same purpose but in a more transparent way. And not as opaque as this implementation.

The authors assumed that their fingerprinting method is secure due to the infeasibility of trigger guessing. — Source

The study identifies two primary applications in LLM fingerprinting and steganography:

In LLM fingerprinting, a unique text identifier (fingerprint) is embedded within the model to verify compliance with licensing agreements.
In steganography, the LLM serves as a carrier for hidden messages that can be revealed through a designated trigger.

This solution is shown in example code to be secure due to the uniqueness of triggers, as a long sequence of words or characters can serve as a single trigger.

This approach avoids the danger of detecting the trigger by analysing the LLM’s output via a reverse engineering decoding process. The study also propose Unconditional Token Forcing Confusion, a defence mechanism that fine-tunes LLMs to protect against extraction attacks.

Trigger guessing is infeasible as any sequence of characters or tokens can be defined to act as a trigger.

Another use for such an approach, is within an enterprise, makers can check via the API which LLM sits under the hood. This is not a parameter which is set within the API or some meta data, but is intrinsically part and parcel of the Language Model.

Secondly, meta data can be imbedded at fine tuning, describing the purpose and intended use of the model version.

Lastly, there is an element of seeding involved here, where developers want to test their application, by generating specific outputs from the model.

⭐️ Follow me on LinkedIn for updates on Large Language Models ⭐️

Source link

10Jun

Research Scholar (Technical Research) | GovAI Blog

Note: There is a single, shared application form and application process for all Research Scholar position listings.

‍

About the Team

GovAI was founded to help humanity navigate the transition to a world with advanced AI. Our first research agenda, published in 2018, helped define and shape the nascent field of AI governance. Our team and affiliate community possess expertise in a wide variety of domains, including AI regulation, responsible development practices, compute governance, AI company corporate governance, US-China relations, and AI progress forecasting.

GovAI researchers have closely advised decision makers in government, industry, and civil society. Our researchers have also published in top peer-reviewed journals and conferences, including International Organization, NeurIPS, and Science. Our alumni have gone on to roles in government, in both the US and UK; top AI companies, including DeepMind, OpenAI, and Anthropic; top think tanks, including the Centre for Security and Emerging Technology and RAND; and top universities, including the University of Oxford and the University of Cambridge.

Although we are based in Oxford, United Kingdom — and currently have an especially large UK policy focus — we also have team members in the United States and European Union.

‍

About the Role

Research Scholar is a one-year visiting position. It is designed to support the career development of AI governance researchers and practitioners — as well as to offer them an opportunity to do high-impact work.

As a Research Scholar, you will have freedom to pursue a wide range of styles of work. This could include conducting policy research, social science research, or technical research; engaging with and advising policymakers; or launching and managing applied projects.

For example, past and present Scholars have used the role to:

Over the course of the year, you will also deepen your understanding of the field, connect with a network of experts, and build your skills and professional profile, all while working within an institutional home that offers both flexibility and support.

You will receive research supervision from a member of the GovAI team or network. The frequency of supervisor meetings and feedback will vary depending on supervisor availability, although once-a-week or once-every-two-weeks supervision meetings are typical. There will also be a number of additional opportunities for Research Scholars to receive feedback, including internal work-in-progress seminars. You will receive further support from an additional mentor chosen from within the organisation.

Note that for researchers with significant AI governance research experience, we are also hiring for Research Fellows. Research Fellow positions are longer-term roles, offering two-year renewable contracts, which place less emphasis on career exploration and more emphasis on contributing to existing or planned workstreams. There is a shared application for the Research Scholar and Research Fellow roles, so you need only submit the application once.

‍

Highlighted Interest Area: Technical Research

In this round, we would especially like to highlight our interest in candidates who can conduct technical research to inform AI governance decisions. This type of research is sometimes known as “technical governance.”

Examples of technical governance questions include:

These kinds of questions often have foundational policy implications, but most AI governance researchers lack the technical expertise needed to answer them. For that reason, we are especially excited to receive applications from candidates with strong technical backgrounds.

‍

Qualifications and Selection Criteria

We are open to candidates with a wide range of backgrounds. We have previously hired or hosted researchers with academic backgrounds in computer science, political science, public policy, economics, history, philosophy, and law. We are also interested in candidates with professional backgrounds in government, industry, and civil society.

For all candidates, we will look for:

A strong interest in using their career to positively influence the lasting impact of artificial intelligence, in line with our organisation’s mission

Demonstrated ability to produce excellent work (typically research outputs) or achieve impressive results

Self-direction and proactivity

The ability to evaluate and prioritise projects on the basis of impact

A commitment to intellectual honesty and rigour

Receptiveness to feedback and commitment to self-improvement

Strong communication skills

Collaborativeness and motivation to help others succeed

Some familiarity with the field of AI governance

Some expertise in a domain that is relevant to AI governance

A compelling explanation of how the Research Scholar position may help them to have a large impact

For candidates who are hoping to do particular kinds of work (e.g. technical research) or work on particular topics (e.g. US policy), we will also look for expertise and experience that is relevant to the particular kind of work they intend to do.

There are no educational requirements for the role. We have previously made offers to candidates at a wide variety of career stages. However, we expect that the most promising candidates will typically have either graduate degrees or relevant professional experience.

‍

Duration, Location, and Salary

Duration

Contracts will be for a fixed 12-month term. Although renewal is not an option for these roles, Research Scholars may apply for longer-term positions at GovAI — for instance, Research Fellow positions — once their contracts end.

Location

Although GovAI is based in Oxford, UK, we are a hybrid organisation. Historically, a slight majority of our Research Scholars have actually chosen to be based in countries other than the UK. However, in some cases, we do have significant location preferences:

If a candidate plans to focus heavily on work related to a particular government’s policies, then we prefer that the candidate is primarily based in or near the most relevant city. For example, if someone plans to focus heavily on US federal policy, we will tend to prefer that they are based in or near Washington, DC. 
If a candidate would likely be involved in managing projects or launching new initiatives to a significant degree, then we will generally prefer that they are primarily based out of our Oxford office. 
Some potential Oxford-based supervisors (e.g. Ben Garfinkel) also have a significant preference for their supervisees being primarily based in Oxford.

If you have location restrictions – and concerns about your ability to work remotely might prevent you from applying – please inquire at

re*********@go********.ai

. Note that we are able to sponsor both UK visas and US visas.

Salary

Depending on their experience, we expect that successful candidates’ annual compensation will typically fall between £60,000 and £75,000 if based in Oxford, UK. If a Research Scholar resides predominantly in a city with a higher cost of living, their salary will be adjusted to account for the difference. As a reference point, a Research Scholar based in Washington, DC would typically receive between $85,000 and $115,000. In rare cases where salary considerations would prevent a candidate from accepting an offer, there may also be some flexibility in compensation.

Benefits associated with the role include health, dental, and vision insurance, a £5,000 (~$6,000) annual wellbeing budget, an annual commuting budget, flexible work hours, extended parental leave, ergonomic equipment, a competitive pension contribution, and 33 days of paid vacation (including public holidays).

Please inquire with

re*********@go********.ai

if questions or concerns regarding compensation or benefits might affect your decision to apply.

‍

How to Apply and What to Expect

The application process consists of a written submission in the first round, a paid remote work test in the second round, and a final interview round. The interview round usually consists of one interview but might involve an additional interview in some cases. We also conduct reference checks for all candidates we interview.

Please feel free to reach out to

re*********@go********.ai

if you would need a decision communicated by a particular date, if you need assistance with the application due to a disability, or if you have questions about the application process.

We are committed to fostering a culture of inclusion, and we encourage individuals with underrepresented perspectives and backgrounds to apply. We especially encourage applications from women, gender minorities, people of colour, and people from regions other than North America and Western Europe who are excited about contributing to our mission. We are an equal opportunity employer.

Source link

10Jun

Research Scholar (US Policy) | GovAI Blog

Note: There is a single, shared application form and application process for all Research Scholar position listings.

‍

About the Team

Although we are based in Oxford, United Kingdom — and currently have an especially large UK policy focus — we also have team members in the United States and European Union.

‍

About the Role

For example, past and present Scholars have used the role to:

‍

Highlighted Interest Area: US Policy

In this round, we would especially like to highlight our interest in candidates who intend to focus on US policy and work primarily from Washington, DC.

Although the UK is currently the largest focus of GovAI’s policy work, we have expanded our US policy engagement over the past year. We are now interested in expanding it further, potentially by building up a DC-based unit of the organisation.

A DC-based Research Scholar could serve as a bridge between US policy conversations and other research happening at GovAI. They could also lead projects on US policy questions, such as:

What could sensible federal-level regulation of frontier AI look like?
Are US-led export controls likely to have their intended effects?
What state-level regulations are plausible – and how will they interact with regulatory activity at the federal level?

A DC-based Research Scholar could also help inform GovAI’s decisions about whether and how to expand our US policy engagement. It is possible that they would ultimately play a significant role in helping us to establish a new DC unit of the organisation after their one-year term.

‍

Qualifications and Selection Criteria

For all candidates, we will look for:

A strong interest in using their career to positively influence the lasting impact of artificial intelligence, in line with our organisation’s mission

Demonstrated ability to produce excellent work (typically research outputs) or achieve impressive results

Self-direction and proactivity

The ability to evaluate and prioritise projects on the basis of impact

A commitment to intellectual honesty and rigour

Receptiveness to feedback and commitment to self-improvement

Strong communication skills

Collaborativeness and motivation to help others succeed

Some familiarity with the field of AI governance

Some expertise in a domain that is relevant to AI governance

A compelling explanation of how the Research Scholar position may help them to have a large impact

‍

Duration, Location, and Salary

Duration

Location

If a candidate plans to focus heavily on work related to a particular government’s policies, then we prefer that the candidate is primarily based in or near the most relevant city. For example, if someone plans to focus heavily on US federal policy, we will tend to prefer that they are based in or near Washington, DC. 
If a candidate would likely be involved in managing projects or launching new initiatives to a significant degree, then we will generally prefer that they are primarily based out of our Oxford office. 
Some potential Oxford-based supervisors (e.g. Ben Garfinkel) also have a significant preference for their supervisees being primarily based in Oxford.

Salary

Please inquire with

re*********@go********.ai

if questions or concerns regarding compensation or benefits might affect your decision to apply.

‍

How to Apply and What to Expect

Source link

10Jun

Research Scholar (Special Projects) | GovAI Blog

Note: There is a single, shared application form and application process for all Research Scholar position listings.

‍

About the Team

Although we are based in Oxford, United Kingdom — and currently have an especially large UK policy focus — we also have team members in the United States and European Union.

‍

About the Role

For example, past and present Scholars have used the role to:

‍

Highlighted Interest Area: Special Projects

In this round, we would especially like to highlight our interest in candidates who intend to manage projects or launch new initiatives.

Some of our most impactful Research Scholars have dedicated the majority of their time to areas other than research and policy engagement. Example projects include organising high-impact events, serving as a project manager for policy engagement work, and launching a new organisation to facilitate international dialogue.

For this reason, we are open to Research Scholar candidates who would primarily focus on applied work. As one example: we are open to candidates who are exploring launching new AI governance organisations and would benefit from the expertise and environment that GovAI can offer.

‍

Qualifications and Selection Criteria

For all candidates, we will look for:

A strong interest in using their career to positively influence the lasting impact of artificial intelligence, in line with our organisation’s mission

Demonstrated ability to produce excellent work (typically research outputs) or achieve impressive results

Self-direction and proactivity

The ability to evaluate and prioritise projects on the basis of impact

A commitment to intellectual honesty and rigour

Receptiveness to feedback and commitment to self-improvement

Strong communication skills

Collaborativeness and motivation to help others succeed

Some familiarity with the field of AI governance

Some expertise in a domain that is relevant to AI governance

A compelling explanation of how the Research Scholar position may help them to have a large impact

‍

Duration, Location, and Salary

Duration

Location

If a candidate plans to focus heavily on work related to a particular government’s policies, then we prefer that the candidate is primarily based in or near the most relevant city. For example, if someone plans to focus heavily on US federal policy, we will tend to prefer that they are based in or near Washington, DC. 
If a candidate would likely be involved in managing projects or launching new initiatives to a significant degree, then we will generally prefer that they are primarily based out of our Oxford office. 
Some potential Oxford-based supervisors (e.g. Ben Garfinkel) also have a significant preference for their supervisees being primarily based in Oxford.

Salary

Please inquire with

re*********@go********.ai

if questions or concerns regarding compensation or benefits might affect your decision to apply.

‍

How to Apply and What to Expect

Source link

10Jun

Research Fellow | GovAI Blog

About the Team

Although we are based in Oxford, United Kingdom — and currently have an especially large UK policy focus — we also have team members in the United States and European Union.

‍

About the Role

Research Fellows will conduct research into open and important questions that bear on AI governance. This research could take the form of reports, policy memos, academic papers, blog posts, or whatever format is most conducive to impact. Research Fellows may also spend a substantial portion of their time engaging in direct policy advising.

We are interested in candidates with a range of academic and professional backgrounds, who have a demonstrated ability to produce excellent research and care deeply about the lasting impacts of AI on the world, in line with our mission.

Research Fellows are expected to work under the guidance of a Senior Research Fellow, but have substantial flexibility in project selection. They are also expected to offer supervision and mentorship to junior researchers, such as our Summer and Winter Fellows. Collaboration with other researchers both inside and outside of GovAI is encouraged.

We are committed to supporting the work of Research Fellows by offering expert guidance, funding for projects, productivity tools, limited obligations on one’s time, access to a broad network of experts and potential collaborators, and opportunities to communicate one’s research to policymakers and other audiences.

For promising researchers who lack sufficient experience conducting AI governance research, we may consider instead offering one-year visiting Research Scholar positions that are intended to support professional development. There is a shared application for the Research Scholar and Research Fellow roles.

‍

Areas of Interest

We are open to work on a broad range of topics. To get a sense of our focus areas, you may find it useful to read our About page or look at examples listed on our Research page. Broad topics of interest include — but are not limited to — responsible AI development and release practices, AI regulation, international governance, compute governance, and risk assessment and impact forecasting.

Please note that we are specifically open to hiring researchers who intend to conduct technical research, so long as this technical research is relevant to AI governance.

‍

Qualifications and Selection Criteria

We are open to candidates with a wide range of academic or professional backgrounds. We have previously hired or hosted researchers with backgrounds in computer science, public policy, political science, economics, history, philosophy, and law.

You might be a particularly good fit if you have:‍

Demonstrated ability to produce excellent research, preferably (but not necessarily) within the domain of AI governance

Deep interest in the lasting implications of artificial intelligence for the world, in line with our organisation’s mission

Established expertise in a domain with significant AI governance relevance

Self-directedness and desire for impact

Commitment to intellectual honesty and rigour

Good judgement regarding the promisingness and importance of different research directions

Excellent communication and collaboration skills

Proactivity and commitment to professional growth

Strong interest in mentorship

Broad familiarity with the field of AI governance

There are no specific education or experience requirements for the role, although we expect that the most promising candidates will typically possess multiple years of relevant research or policy experience.

‍

Duration, Location and Salary

Contracts are full-time and last two years fixed-term, with the possibility of renewal.

We typically prefer for Research Fellows to work primarily from our office in Oxford, UK. However, we also consider applications from strong candidates who are only able to work remotely. In cases where a Research Fellow’s work would be specifically relevant to actors in another part of the world (such as Washington, DC), we will also often prefer that the Research Fellow works from that part of the world.

We are able to sponsor visas in the UK and the US.

Depending on their experience, we expect that successful candidates’ annual compensation will — if they are based in Oxford, UK — typically fall between £60,000 and £80,000. If a Research Fellow resides predominantly in a city with a higher cost of living, their salary will be adjusted to account for the difference. As a reference point, a Research Fellow based in Washington, DC would typically receive between $85,000 and $120,000. In rare cases where salary considerations would prevent a candidate from accepting an offer, there may also be some flexibility in compensation.

Please inquire with

re*********@go********.ai

if questions or concerns regarding compensation or benefits might affect your decision to apply.

‍

How to Apply and What to Expect

The application process consists of a written submission in the first round, a paid remote work test in the second round, and a final interview round. Candidates who pass through the second round should expect to participate in a pair of interviews and may also be asked to produce additional written material. We also conduct reference checks for all candidates we interview.

‍

Source link

1 … 18 19 20 21 22 … 35