Using your workspace
Getting started, accessing your data files, available software, locking your workspace and signing out
Getting started in the DataLab workspace
When you have successfully logged into the Citrix Workspace, your DataLab workspace looks like this.
You can use DataLab in a similar way to using other secure networked systems, where you can securely see, use and share data files, analysis and output with the other members of your project team.
Open File Explorer and click on This PC to see the network drives you have access to:
- Library: All researchers can see all files in the Library drive. This is where we upload support information, such as statistical language documentation, ANZSIC classification and general access guides for non-standard products. Files cannot be saved to this drive.
- Output: Any output you want the ABS to clear should be saved to this drive. Only members of your team can see this drive. See also Request output clearance. Information is backed up nightly and retained for 14 days. Information in this folder remains unaffected by a rebuild.
- Project: A shared space for your team to work in and store all your project files, as well as set up and run Python and R scripts. Only members of your team can see this drive. Information is backed up nightly and retained for 14 days. Information in this folder remains unaffected by a rebuild. The default storage is 1TB. You will need to review and delete unnecessary files as your project files grow over time. If necessary, an increase to this storage can be requested via the Contact us page. There may be a cost for additional storage.
- Products: Access data files that have been approved for your project. However, it is best to use the My data products shortcut on your desktop as this shows you only the datasets you have been approved to access, rather than all dataset short names. Files cannot be saved to this drive.
- LocalDisk: If you have been granted local disk space, this can be used to run jobs on offline virtual machines (desktops). You may want to request this option if you have multiple projects that you are actively involved in. There may be a cost associated with attaching local disk space to your VM. The local disk will only be present if it has been allocated to your VM. To request local disk space contact the ABS via the Contact us page
- Drive C can be utilised to run scripts and create new Python package virtual environments, not facilitated through either Jupyter Notebook, JupyterLab or Spyder. Noting the C drive is also destroyed with each 30 day rebuild. Note: Avoid using this drive for saving files - there is limited space and no ability to increase the storage capacity. If more storage is required a local disk can be requested for your VM.
- Drives A and D are not to be used. Information saved here is either destroyed with each nightly shutdown and 30 day rebuild, or has restricted access. Attempting to read or write from Drives A or D will invoke a group policy error due to access controls. In this case please use the C drive or consult your project lead to request local disk space.
Do not store files in any other folders. Other members of your project cannot see files if you store them in other drives. Files stored outside of the Project and Output drives are destroyed every 30 days as part of DataLab security protocols.
Refreshing your network drives. If your network drives do not appear in File Explorer, you can click the 'Refresh Network' shortcut on the desktop. A confirmation message appears when this has been successfully refreshed.
Accessing your data files
To access the data files for your project, use the 'My Data Products' shortcut on your desktop.
The My Data Products folder displays only the products approved for your project.
Selecting the 'Products' drive shows you the short name of all data loaded to the DataLab. However, if you try to open a file that is not approved for your project you are denied access and receive an error.
Software can be opened using the shortcuts on your desktop or by using search on the Taskbar.
All researchers have access to these applications in the DataLab:
- Acrobat Reader
- Notepad ++
- Git (available locally for projects to version their code)
- Stata MP17
- R 4.1.3. including:
- RStudio 2022.07
- RTools 42
- Python 3.9 (Anaconda3 distribution) including:
- Jupyter Notebook & JupyterLab
- PostgreSQL 15
If required, you can also request:
- SAS 9.4
Microsoft Word and Excel are not currently available, as these applications require an internet connection, which is not supported in a secure system like DataLab. Libreoffice is the alternative offered in the system, with similar capabilities to Microsoft Office.
Firefox and Edge are available to support access for Databricks (which is under development) and for Jupyter notebooks to use Python/R. These browsers cannot be used to browse the internet.
If a package in your statistical software choice is not available, you can request it using the contact us page.
Managing your R & Python packages explains how you can manage R and Python packages using the Posit Package Manager shortcut on your desktop.
To open a PDF, right click and choose open with Adobe Acrobat Reader. Due to a default setting in Microsoft, if you double click, the system automatically uses Microsoft Edge to open any PDF file.
Managing your R & Python packages
If you are working with a specific set of R and/or Python packages, you can now manage these using the Posit Package Manager shortcut on your desktop.
In the Posit Package Manager, click 'Get Started' to navigate to the available packages. You can use this tool to search for packages (in the left column) and install the packages you want to use for your project. If the packages you need are not listed, you can request them using the Contact us page.
What are virtual machines?
Virtual machines, or VMs, are the virtual workspaces you use to undertake your analysis in the DataLab. VMs are called Desktops in the Citrix Portal. VMs are created by the ABS as part of the project establishment process, described in About DataLab.
You have one VM for each project. This is a design feature to prevent data from one project being accessed by another project. You can run analysis on multiple virtual machines at the same time, but only if you have been granted local disk space. See Run jobs on offline VMs (desktops). You may want to request this option if you have multiple projects that you are actively involved in.
Virtual machine sizes
The ABS offers standard and non-standard VM sizes. Standard VMs are included in the DataLab annual fee, whereas non-standard VMs are subject to additional charges as they are more expensive to run. For more information on charges, see DataLab charges.
Researchers may request access to a non-standard machine for performance or productivity purposes. If you require a non-standard machine, you will need to consult your project lead and your project lead will need to send the ABS an updated project proposal.
Currently offered VMs and approximate running costs are listed in the tables below.
Standard virtual machines
Large VMs are provided as the default and most projects operate efficiently with this size.
If you have a small or medium machine, it can be upgraded to a large at no additional charge. Please contact firstname.lastname@example.org for further assistance...
|Name||CPU Cores||RAM||Approx cost per hour|
|Small||2||8GB||Not applicable – these virtual machines are included in the DataLab annual fee.|
Non-standard virtual machines
Non-standard machines are available on request. Additional charges will apply, please refer to DataLab charges for more information.
|Name||CPU Cores||RAM||Approx cost per hour ($AUD)|
Specialised and custom non-standard virtual machines
The following specialised VMs (also non-standard) capable of supporting machine learning and high-performance computing can also be requested, however these are assessed on a case by case basis with the appropriate justification, and are subject to quote. If the required VM is not listed, the ABS may be able to provide a customised option at an additional charge, please be sure to describe why the available machines do not meet your needs in any justification provided. A list of virtual machines by region can be viewed via the Azure website.
|Name||CPU Cores||RAM||GPU||Approx per hour cost ($AUD)|
|Large GPU||8||56GB||Tesla T4 16GB||$1.50|
|X-Large GPU||16||110GB||Tesla T4 16GB||$2.40|
Assigned names of VMs are unrelated to Azure naming conventions. ABS review our provided VM options periodically, please revisit this page for any updates.
Sign out or Lock your DataLab session
When you walk away from your computer or are finished with your DataLab session, you must either lock your workstation or sign out of your account to ensure nobody else accesses your DataLab account. Click the Windows menu in the bottom left corner and select the person icon to select Lock or Sign out.
If you need to leave your computer for a short length of time or you have analysis running, lock your DataLab screen. You can then close your Citrix window by using the X in the top right- hand corner. This closes your session but does not end any programs you have running. Your programs will continue to run until 10pm that night, or longer if you have selected the Bypass option in the portal.
When expanded, select Ctrl+Alt+Del, and re-enter your credentials.
Sign out to leave your workspace session. This closes your session and will end any programs you have running.
Signing out returns you to the Citrix workspace portal, where you can either close the browser window or Log Out of the portal using the icon in the top right corner (with your initial). To log back in, see Logging into the portal and workspace.