Future-House
diff --git a/‎_config.yml‎
Lines changed: 2 additions & 12 deletions b/‎_config.yml‎
Lines changed: 2 additions & 12 deletions
diff --git a/‎chapter_2/llms_for_biology.ipynb‎
Lines changed: 82 additions & 47 deletions b/‎chapter_2/llms_for_biology.ipynb‎
Lines changed: 82 additions & 47 deletions
diff --git a/‎chapter_2/uniprot_example.ipynb‎
Lines changed: 89 additions & 28 deletions b/‎chapter_2/uniprot_example.ipynb‎
Lines changed: 89 additions & 28 deletions
@@ -16,27 +16,16 @@ repository:
 sphinx:
   extra_extensions:
     - sphinxcontrib.bibtex
-    - sphinx_thebe
   config:
     numfig: false
-    thebe_config:
-      repository_url: "https://github.com/Future-House/tutorial-series"
-      repository_branch: "main"
-      binderhub_url: "https://mybinder.org"
-      path_to_book: "."
-      selector: ".cell"                        # ← added
-      use_thebe_lite: false                    # ← added
-      codemirror_config:
-        theme: "default"
-
     html_theme_options:
       use_issues_button: true
       use_repository_button: true
       repository_url: "https://github.com/Future-House/tutorial-series"
       repository_branch: "main"
       path_to_book: "."                                                 
       launch_buttons:
-        thebe: true
+        thebe: false
         colab_url: "https://colab.research.google.com"
         binderhub_url: "https://mybinder.org"
         notebook_interface: "jupyterlab"
@@ -55,3 +44,4 @@ exclude_patterns:
   - _build/**
   - .git/**
   - references.bib
+  - "**/.env"
@@ -13,14 +13,50 @@
         "\n",
         "As the volume of scientific literature continues to grow rapidly, manually extracting and organizing this information becomes increasingly difficult and time-consuming. Therefore, automating data extraction from the literature using AI can help researchers to rapidly identify relevant findings, convert unstructured text into structured datasets, and integrate knowledge across thousands of publications. \n",
         "\n",
-        ":::{note}\n",
-        "To run the code snippets, you can click the **Live Code** button OR the 🚀 icon at the top of the page to launch the page as an interactive Google Colab notebook.\n",
+        "::{admonition} 🚀 Getting Started\n",
+        ":class: tip\n",
         "\n",
-        "Clicking **Live Code** will launch a Binder environment (~1-2 min). \n",
-        "Your code will run directly in the page once it's ready.\n",
-        ":::\n",
+        "This tutorial can be launched using the rocket button at the top of the page.\n",
         "\n",
-        "## 2.2.1 Getting started with LLMs\n",
+        "### Option 1 — Google Colab (**recommended**)\n",
+        "Opens the notebook in Google Colab with the fastest and most reliable experience.\n",
+        "\n",
+        "Before running the tutorial, add your API keys using **either**:\n",
+        "\n",
+        "- a `.env` file, or\n",
+        "- **Colab Secrets** (`🔑 Secrets` tab in the left sidebar)\n",
+        "\n",
+        "Example `.env`:\n",
+        "```bash\n",
+        "OPENAI_API_KEY=your_key_here\n",
+        "ANTHROPIC_API_KEY=your_key_here\n",
+        "```\n",
+        "\n",
+        "### Option 2 — MyBinder\n",
+        "Launches a temporary cloud Jupyter environment directly in your browser.\n",
+        "\n",
+        "⚠️ Binder environments can take a few minutes to build and start.\n",
+        "\n",
+        "After the notebook loads, create a `.env` file in the notebook directory containing your API keys:\n",
+        "\n",
+        "```bash\n",
+        "OPENAI_API_KEY=your_key_here\n",
+        "ANTHROPIC_API_KEY=your_key_here\n",
+        "```\n",
+        "\n",
+        "### Notes\n",
+        "- You only need API keys for the providers used in a given notebook.\n",
+        "- Never commit or publicly share your API keys.\n",
+        "- If a cell fails due to missing credentials, verify that your keys were loaded correctly before rerunning the cell.\n",
+        ":::\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "79a2787b",
+      "metadata": {},
+      "source": [
+        "## 2.2.1 Accessing LLMs through APIs\n",
         "\n",
         "You might have used LLMs through chat interfaces such as ChatGPT or Claude before. But to access them through a python code as what we have here, we need to use an API.\n",
         "\n",
@@ -98,58 +134,58 @@
       "source": [
         "import os\n",
         "\n",
-        "def get_api_key():\n",
-        "    \"\"\"Load API key from Colab secrets, environment variable, or user input.\"\"\"\n",
+        "LLM_API_KEYS = {\n",
+        "    \"openai\":    \"OPENAI_API_KEY\",\n",
+        "    \"anthropic\": \"ANTHROPIC_API_KEY\",\n",
+        "}\n",
+        "\n",
+        "def get_api_key(llm: str = \"openai\") -> str:\n",
+        "    \"\"\"\n",
+        "    Load API key for the specified LLM from Colab secrets,\n",
+        "    environment variable, or user input.\n",
+        "    \n",
+        "    Args:\n",
+        "        llm: LLM provider name. eg: 'openai', 'anthropic'\n",
         "    \n",
+        "    Returns:\n",
+        "        API key string\n",
+        "    \n",
+        "    Example:\n",
+        "        api_key = get_api_key(\"anthropic\")\n",
+        "    \"\"\"\n",
+        "\n",
+        "    llm = llm.lower()\n",
+        "    if llm not in LLM_API_KEYS:\n",
+        "        raise ValueError(\n",
+        "            f\"Unknown LLM '{llm}'. Choose from: {list(LLM_API_KEYS.keys())}\"\n",
+        "        )\n",
+        "\n",
+        "    env_var = LLM_API_KEYS[llm]\n",
+        "\n",
         "    # 1. Try Colab secrets\n",
         "    try:\n",
         "        from google.colab import userdata\n",
-        "        return userdata.get('OPENAI_API_KEY')\n",
+        "        key = userdata.get(env_var)\n",
+        "        if key:\n",
+        "            return key\n",
         "    except ImportError:\n",
         "        pass\n",
         "\n",
         "    # 2. Try environment variable / .env file\n",
         "    try:\n",
         "        from dotenv import load_dotenv\n",
         "        load_dotenv()\n",
-        "        api_key = os.environ.get('OPENAI_API_KEY')\n",
-        "        if api_key:\n",
-        "            return api_key\n",
+        "        key = os.environ.get(env_var)\n",
+        "        if key:\n",
+        "            return key\n",
         "    except ImportError:\n",
         "        pass\n",
         "\n",
-        "    # 3. For live code in Binder/Thebe — ask user to input it\n",
-        "    try:\n",
-        "        import ipywidgets as widgets\n",
-        "        from IPython.display import display\n",
-        "\n",
-        "        key_input = widgets.Password(\n",
-        "            placeholder='Paste your OpenAI API key here',\n",
-        "            description='API Key:',\n",
-        "            layout=widgets.Layout(width='400px')\n",
-        "        )\n",
-        "        submit = widgets.Button(description='Submit', button_style='primary')\n",
-        "        output = widgets.Output()\n",
-        "\n",
-        "        result = {'key': None}\n",
-        "\n",
-        "        def on_submit(b):\n",
-        "            result['key'] = key_input.value\n",
-        "            os.environ['OPENAI_API_KEY'] = key_input.value\n",
-        "            with output:\n",
-        "                print(\"✅ API key set successfully!\")\n",
-        "\n",
-        "        submit.on_click(on_submit)\n",
-        "        display(widgets.VBox([key_input, submit, output]))\n",
-        "\n",
-        "        # wait for user to submit\n",
-        "        return result\n",
-        "\n",
-        "    except ImportError:\n",
-        "        raise ValueError(\n",
-        "            \"API key not found. Please set OPENAI_API_KEY:\\n\"\n",
-        "            \"  export OPENAI_API_KEY='your-key-here'\"\n",
-        "        )"
+        "    raise ValueError(\n",
+        "        f\"API key not found. Please set {env_var}:\\n\"\n",
+        "        f\"  export {env_var}='your-key-here'\\n\"\n",
+        "        f\"  or add it to a .env file\"\n",
+        "    )"
       ]
     },
     {
@@ -177,15 +213,14 @@
       },
       "outputs": [],
       "source": [
-        "\n",
         "from openai import OpenAI\n",
         "\n",
         "# Access the OpenAI API key\n",
-        "openai_api_key = get_api_key()\n",
+        "openai_api_key = get_api_key(\"openai\")\n",
         "# Tell the OpenAI client to use your API key\n",
         "client = OpenAI(api_key=openai_api_key)\n",
         "# LLM model to generate answer. Replace model name if deprecated\n",
-        "LLM_MODEL = \"gpt-5.5\"\n",
+        "LLM_MODEL = \"gpt-4.1-nano\"\n",
         "# Add your question here\n",
         "QUESTION = \"What is the difference between Machine Learning and Deep Learning?\" \n",
         "\n",
 
@@ -8,11 +8,41 @@
     "# 2.3 Integrating External Databases\n",
     "The goal of this section is to show you how LLMs can easily be integrated into research workflows to accelerate scientific discovery. On top of everything, AI can take care of the most mundane trivial tasks.\n",
     "\n",
-    ":::{note}\n",
-    "To run the code snippets, you can click the **Live Code** button OR the 🚀 icon at the top of the page to launch the page as an interactive Google Colab notebook.\n",
+    ":::{admonition} 🚀 Getting Started\n",
+    ":class: tip\n",
     "\n",
-    "Clicking **Live Code** will launch a Binder environment (~1-2 min). \n",
-    "Your code will run directly in the page once it's ready.\n",
+    "This tutorial can be launched using the rocket button at the top of the page.\n",
+    "\n",
+    "### Option 1 — Google Colab (**recommended**)\n",
+    "Opens the notebook in Google Colab with the fastest and most reliable experience.\n",
+    "\n",
+    "Before running the tutorial, add your API keys using **either**:\n",
+    "\n",
+    "- a `.env` file, or\n",
+    "- **Colab Secrets** (`🔑 Secrets` tab in the left sidebar)\n",
+    "\n",
+    "Example `.env`:\n",
+    "```bash\n",
+    "OPENAI_API_KEY=your_key_here\n",
+    "ANTHROPIC_API_KEY=your_key_here\n",
+    "```\n",
+    "\n",
+    "### Option 2 — MyBinder\n",
+    "Launches a temporary cloud Jupyter environment directly in your browser.\n",
+    "\n",
+    "⚠️ Binder environments can take a few minutes to build and start.\n",
+    "\n",
+    "After the notebook loads, create a `.env` file in the notebook directory containing your API keys:\n",
+    "\n",
+    "```bash\n",
+    "OPENAI_API_KEY=your_key_here\n",
+    "ANTHROPIC_API_KEY=your_key_here\n",
+    "```\n",
+    "\n",
+    "### Notes\n",
+    "- You only need API keys for the providers used in a given notebook.\n",
+    "- Never commit or publicly share your API keys.\n",
+    "- If a cell fails due to missing credentials, verify that your keys were loaded correctly before rerunning the cell.\n",
     ":::\n",
     "\n",
     "## 2.3.1 Example 1: Uniprot integration\n",
@@ -25,11 +55,7 @@
     "\n",
     "UniProt offers a free, no-authentication REST API. You can fetch data for p53 (human) using its UniProt accession ID `P04637`.\n",
     "\n",
-    "\n",
-    "Let's begin!\n",
-    "\n",
-    "If you're using Google Colab to run the notebook, make sure to install the requirements using the cell below. If you're using the \"Live Code\" option, you don't have to do anything.\n",
-    "\n"
+    "**If you're using Google Colab install the requirements by running the cell below.**"
    ]
   },
   {
@@ -98,25 +124,59 @@
    "outputs": [],
    "source": [
     "import os\n",
-    "from dotenv import load_dotenv\n",
     "\n",
-    "def get_api_key():\n",
-    "    \"\"\"Load Anthropic API key from Colab secrets or environment variable.\"\"\"\n",
+    "LLM_API_KEYS = {\n",
+    "    \"openai\":    \"OPENAI_API_KEY\",\n",
+    "    \"anthropic\": \"ANTHROPIC_API_KEY\",\n",
+    "}\n",
+    "\n",
+    "def get_api_key(llm: str = \"anthropic\") -> str:\n",
+    "    \"\"\"\n",
+    "    Load API key for the specified LLM from Colab secrets,\n",
+    "    environment variable, or user input.\n",
+    "    \n",
+    "    Args:\n",
+    "        llm: LLM provider name. eg: 'openai', 'anthropic'\n",
+    "    \n",
+    "    Returns:\n",
+    "        API key string\n",
+    "    \n",
+    "    Example:\n",
+    "        api_key = get_api_key(\"anthropic\")\n",
+    "    \"\"\"\n",
+    "\n",
+    "    llm = llm.lower()\n",
+    "    if llm not in LLM_API_KEYS:\n",
+    "        raise ValueError(\n",
+    "            f\"Unknown LLM '{llm}'. Choose from: {list(LLM_API_KEYS.keys())}\"\n",
+    "        )\n",
+    "\n",
+    "    env_var = LLM_API_KEYS[llm]\n",
+    "\n",
+    "    # 1. Try Colab secrets\n",
     "    try:\n",
     "        from google.colab import userdata\n",
-    "        return userdata.get(\"ANTHROPIC_API_KEY\")\n",
+    "        key = userdata.get(env_var)\n",
+    "        if key:\n",
+    "            return key\n",
     "    except ImportError:\n",
-    "        # Not in Colab — fall back to environment variable\n",
-    "        load_dotenv() \n",
-    "        api_key = os.environ.get(\"ANTHROPIC_API_KEY\")\n",
-    "        if not api_key:\n",
-    "            raise ValueError(\n",
-    "                \"API key not found. Please set the ANTHROPIC_API_KEY or OPENAI_API_KEY environment variable.\\n\"\n",
-    "                \"You can do this by running the following in your terminal. Example:\\n\"\n",
-    "                \"  export ANTHROPIC_API_KEY='your-key-here'\\n\"\n",
-    "                \"Or add it to a .env file in your project root.\"\n",
-    "            )\n",
-    "        return api_key\n"
+    "        pass\n",
+    "\n",
+    "    # 2. Try environment variable / .env file\n",
+    "    try:\n",
+    "        from dotenv import load_dotenv\n",
+    "        load_dotenv()\n",
+    "        key = os.environ.get(env_var)\n",
+    "        if key:\n",
+    "            return key\n",
+    "    except ImportError:\n",
+    "        pass\n",
+    "\n",
+    "    raise ValueError(\n",
+    "        f\"API key not found. Please set {env_var}:\\n\"\n",
+    "        f\"  export {env_var}='your-key-here'\\n\"\n",
+    "        f\"  or add it to a .env file\"\n",
+    "    )"
    ]
   },
   {
@@ -128,7 +188,8 @@
    "source": [
     "import anthropic\n",
     "\n",
-    "client = anthropic.Anthropic(api_key=get_api_key())\n",
+    "api_key = get_api_key(llm=\"anthropic\")\n",
+    "client = anthropic.Anthropic(api_key=api_key)\n",
     "\n",
     "# Build a prompt using the retrieved UniProt data\n",
     "prompt = f\"\"\"\n",
@@ -160,10 +221,10 @@
    "source": [
     "If you want to use an OpenAI model instead, you only have to swap the clients and load the correct API Key. See example below or check the previous section on \"Extracting information from literature\"\n",
     "\n",
-    "```\n",
+    "```python\n",
     "from openai import OpenAI\n",
     "\n",
-    "client = OpenAI(api_key=get_api_key(provider=\"openai\"))\n",
+    "client = OpenAI(api_key=get_api_key(llm=\"openai\"))\n",
     "\n",
     "response = client.responses.create(\n",
     "    model=\"gpt-4.1-mini\",\n",
@@ -392,7 +453,7 @@
    "source": [
     "import anthropic\n",
     "\n",
-    "client = anthropic.Anthropic(api_key=get_api_key())\n",
+    "client = anthropic.Anthropic(api_key=get_api_key(llm=\"anthropic\"))\n",
     "\n",
     "structure_prompt = f\"\"\"\n",
     "I retrieved the following information about a protein crystal structure from the RCSB PDB:\n",