<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=us-ascii">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Aptos;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:11.0pt;
font-family:"Aptos",sans-serif;
mso-ligatures:standardcontextual;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#467886;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Aptos",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:11.0pt;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
/* List Definitions */
@list l0
{mso-list-id:1893346870;
mso-list-type:hybrid;
mso-list-template-ids:497477434 67698689 67698691 67698693 67698689 67698691 67698693 67698689 67698691 67698693;}
@list l0:level1
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:Symbol;}
@list l0:level2
{mso-level-number-format:bullet;
mso-level-text:o;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:"Courier New";}
@list l0:level3
{mso-level-number-format:bullet;
mso-level-text:\F0A7;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:Wingdings;}
@list l0:level4
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:Symbol;}
@list l0:level5
{mso-level-number-format:bullet;
mso-level-text:o;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:"Courier New";}
@list l0:level6
{mso-level-number-format:bullet;
mso-level-text:\F0A7;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:Wingdings;}
@list l0:level7
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:Symbol;}
@list l0:level8
{mso-level-number-format:bullet;
mso-level-text:o;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:"Courier New";}
@list l0:level9
{mso-level-number-format:bullet;
mso-level-text:\F0A7;
mso-level-tab-stop:none;
mso-level-number-position:left;
text-indent:-.25in;
font-family:Wingdings;}
ol
{margin-bottom:0in;}
ul
{margin-bottom:0in;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="#467886" vlink="#96607D" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><b>DesignSafe Team Meeting <o:p></o:p></b></p>
<p class="MsoNormal"><b>November 18, 2024<o:p></o:p></b></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Attendees: Ellen, Fred, Silvia, Gilbert, Kayla, Raul, Tim, Jamie, Ahsan, Jean-Paul, Laura, Gilberto, Krishna, Natalie<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span style="background:yellow;mso-highlight:yellow">Action Items</span>
<o:p></o:p></p>
<ul style="margin-top:0in" type="disc">
<li class="MsoNormal" style="mso-list:l0 level1 lfo1">Silvia to schedule a meeting with Kayla, Yang, Carlos, and Tim to discuss Kayla's machine learning workflow and potential optimizations.
<o:p></o:p></li><li class="MsoNormal" style="mso-list:l0 level1 lfo1">Kayla to send her presentation slides to Dr. Kumar.
<o:p></o:p></li><li class="MsoNormal" style="mso-list:l0 level1 lfo1">Tim to explore the possibility of implementing a containerized Jupyter environment similar to Chishiki on DesignSafe.
<o:p></o:p></li><li class="MsoNormal" style="mso-list:l0 level1 lfo1">Ellen to plan the next team meeting focusing on AI and machine learning resources, including inviting Clint's team.
<o:p></o:p></li><li class="MsoNormal" style="mso-list:l0 level1 lfo1">Silvia to follow up with Kayla on submitting jobs to multiple nodes on Stampede 3 to potentially speed up her machine learning tasks.
<o:p></o:p></li><li class="MsoNormal" style="mso-list:l0 level1 lfo1">Use cases - review Silvia’s spreadsheet and get back to her within 2 weeks to discuss
<a href="https://docs.google.com/spreadsheets/d/1mZB91I7j4UW1rkbQTEOo6QkMA7jGMYKdkDqIlcUrfho/edit?usp=sharing">
https://docs.google.com/spreadsheets/d/1mZB91I7j4UW1rkbQTEOo6QkMA7jGMYKdkDqIlcUrfho/edit?usp=sharing</a><o:p></o:p></li></ul>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><b><o:p> </o:p></b></p>
<p class="MsoNormal"><b>Use Case Write Ups <o:p></o:p></b></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Jupiter HPC Use Case Progress <o:p></o:p></p>
<p class="MsoNormal">Ellen initiated a meeting to discuss the progress on use cases and the use of the Jupiter HPC. Sylvia was tasked with checking in on use cases, while Kayla was to provide an update on her successful use of the Jupiter HPC. Sylvia then shared
her screen and discussed her notes on the use cases, highlighting some errors and suggesting updates to the content in the notebooks and the documentation. She also mentioned the need to switch over to Tapis v3 and the creation of template notebooks for this
purpose. The team was encouraged to familiarize themselves with the new setup. <o:p>
</o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Jupyter Notebooks Accessibility and Organization <o:p></o:p></p>
<p class="MsoNormal">Silvia, Ellen, Fred, Tim, and Gilbert discussed the organization and accessibility of their use cases and Jupyter notebooks. They agreed to move certain user guides to the top of the visualization apps section and to add a link to a demonstration
of combining data sets into a workflow on Taggit and HazMapper. They also discussed the issue of users needing to sign in with Jupyter Hub to access the notebooks, and the recommendation to automatically have a JupyterLab 2024 connection for users coming from
the use cases. However, they encountered a problem with the order of the options in the JupyterLab interface, which they plan to address. The team also discussed the need for long-term longevity of their notebooks as they migrate to Python 3.9.
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Notebook Image Rendering and Viewing <o:p></o:p></p>
<p class="MsoNormal">Silvia discussed the technical aspects of notebooks, including the issue of images not always rendering. She proposed three options for users: viewing the notebook, copying the content to their local drive, or opening it in Jupyter Hub.
Scott suggested that users should run the notebook directly for a full experience, while Silvia emphasized the need for a previewer to view the content before deciding to run it. Ellen agreed with Scott's point about potential confusion if the images don't
load correctly. The team agreed to explore ways to make the images work and to consider the option of saving the notebook as an HTML file.
<o:p></o:p></p>
<p class="MsoNormal"><b><o:p> </o:p></b></p>
<p class="MsoNormal"><b>ML use case with Jupyter HPC <o:p></o:p></b></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Optimizing Neural Network for GPU <o:p></o:p></p>
<p class="MsoNormal">Kayla is working on optimizing a neural network for GPU usage. Silvia suggests submitting the job directly to the HPC system instead of running it on a single node through the Jupyter Hub, as this would allow parallelization across multiple
nodes and faster training times at lower cost. Ellen clarifies that using the Jupyter Hub limits resources to a single node. Silvia and Kayla plan to follow up to understand Kayla's current setup and explore options for distributed training on the HPC system.
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Stampede 3 CPU Limitations Discussed <o:p></o:p></p>
<p class="MsoNormal">In the meeting, Scott, Kayla, and Silvia discussed the limitations of the CPU on Stampede 3, which only allows for 4 cores to be used for parallel computing. Kayla mentioned that she could use up to 12 cores on her own CPU. The team also
discussed the potential for running multiple jobs simultaneously on multiple nodes, which could significantly speed up processing time. Tim suggested that all three apps folks, including Sylvia, Yang, and Carlos, should get hands-on experience with Kayla's
problem-solving approach. The team agreed to schedule a meeting to discuss this further. Ellen mentioned that she had invited Carlos to discuss how his research group is using Jupyter Hub on TACC resources, which could be relevant for their work on DesignSafe.
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Complexity of Running Scripts on Stampede 3 <o:p></o:p></p>
<p class="MsoNormal">Scott raised a concern about the complexity of running scripts on Stampede 3, particularly the need to specify an app to run them on. Silvia suggested that Kayla could set everything up and then just SSH to call the job directly on Stampede
3, bypassing the need for an app. However, Scott pointed out that Stampede 3 requires specifying how to run the thing, which could involve specifying a certain version of Pytorch or other complications. Kayla agreed, noting that testing on different platforms
can be challenging. Silvia then explained a process she had gone through with Wang Yang and a student from Clemson, which involved installing Conda, setting up a virtual environment with a specific version of Python, and adding necessary packages. She suggested
that Kayla could build her own virtual environment with the required packages and submit it to Slurm. Ellen expressed concern about the complexity of the process, while Raul asked Kayla if she was running the examples directly from Jupyter.
<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Optimizing Processing Power and Parallelization <o:p></o:p></p>
<p class="MsoNormal">The team discussed strategies for optimizing processing power and parallelization. Raul suggested using multi-processing to exploit the number of cores available, while Kayla mentioned setting the number of workers to negative one to grab
as many available workers. Silvia recommended looking into concurrent futures, a newer version of multi-processing, and suggested considering parallelization at different levels, including within scripts or in the SLURM. Tim and Silvia also discussed the challenges
of working in a GPU environment. Kayla mentioned a warning about saving data in a less accessible location, which could impact processing speed. The team also briefly touched on the topic of AI and machine learning, with Krishna expressing interest in the
slides from the meeting. <o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><b>Chishiki Jupyter features <o:p></o:p></b></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Custom Python Environments on Supercomputing <o:p></o:p></p>
<p class="MsoNormal">Krishna discussed the goal of running custom Python environments on supercomputing systems like Lone Star or Frontera. He explained the issue of needing a system-level package for installation and proposed a solution of developing a containerized
environment with its own Jupyter. This would link to the Jupyter kernels and allow users to access their custom packages. Krishna demonstrated how this was done in a course, emphasizing the ease of creating a custom container and linking it to the Jupyter
GPU environment. He also mentioned the possibility of doing this on the DesignSafe platform. The team discussed the benefits of this approach, including the ability to use multiple nodes and the ease of installing custom packages. Scott suggested that these
containers could also be used to submit jobs without Jupyter, which Krishna confirmed. The team agreed to focus on AI and machine learning resources in their next meeting.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black">---<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black">Natalie Henriques, PMP</span><span style="color:black;mso-ligatures:none"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black">Project Manager<br>
Texas Advanced Computing Center (TACC)<br>
The University of Texas at Austin<br>
Email: </span><span style="font-size:12.0pt"><a href="UrlBlockedError.aspx" target="_blank"><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:blue">natalie@tacc.utexas.edu</span></a><o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</body>
</html>