Python list directory
last modified January 29, 2024
Python list directory tutorial shows how to list directory contents in Python. Multiple examples show how to list the directory contents and filter it.
Directory definition
Directory is an organizing unit in a computer's file system for storing and locating files. Directories are hierarchically organized into a tree of directories. Directories have parent-child relationships. A directory is sometimes also called a folder.
Over the years, Python gained multiple APIs to list the directory contents.
For instance, we can use the  Path.iterdir, os.scandir,
os.walk, Path.rglob, or os.listdir
functions.
Python Path.iterdir
The Path.iterdir yields path objects of the directory contents.
The children are yielded in arbitrary order, and the special entries '.' and '..'
are not included.
#!/usr/bin/python
from pathlib import Path
path = '.'
for path in Path(path).iterdir():
    print(path)
The example lists the contents of the current directory. The listing is non-recursive and includes both files and directories.
$ ./listing.py recursive_glob.py list_dir_files2.py multiple_extensions.py walking3.py list_files.py temperatures.csv list_pandas.py src ...
Python Path.iterdir list directories
The following example shows how to list only directories.
#!/usr/bin/python
from pathlib import Path
path = Path('.')
dirs = [e for e in path.iterdir() if e.is_dir()]
for dir in dirs:
    print(dir)
    #print(dir.parts[-1])
The example shows all immediate subdirectories of the current directory.
dirs = [e for e in path.iterdir() if e.is_dir()]
We build a list of directories using Python list comprehension.
The is_dir returns True for a directory entry.
for dir in dirs:
    print(dir)
    #print(dir.parts[-1])
In a for loop, we print all the directories we have found. We can display the
whole path or use the parts to display only a portion of the file.
Python Path.iterdir list files
The following example shows how to list only files.
#!/usr/bin/python
from pathlib import Path
path = Path('/home/janbodnar/Documents/prog/python/')
files = [e for e in path.iterdir() if e.is_file()]
for file in files:
    print(file)
The example lists all files in the given directory. We use the is_file
to check if an entry is a file.
Python Path.iterdir succint example
The following example shows a very succint code example.
#!/usr/bin/python from pathlib import Path home_path = Path.home() print(*Path(home_path).iterdir(), sep="\n")
The iterdir returns a generator. With the * operator, 
we unpack it into arguments for the print function.
Python list directory with Path.glob
The Path.glob yields all the files that match the given simple
pattern. The ** pattern means this directory and all
subdirectories, recursively. For instance, the **/*.py finds
all Python files in this directory and all its subdirectories.
#!/usr/bin/python
from pathlib import Path
path = Path('.')
for e in path.glob('*.py'):
    print(e)
The example prints all files which end in .py extension in the 
current directory.
$ ./globbing.py recursive_glob.py list_dir_files2.py multiple_extensions.py walking3.py list_files.py list_pandas.py list_files_modified.py recursive_scandir.py list_dir_files.py ...
Python list directory recursively with Path.rglob
The Path.rglob is a convenience call for Path.glob with
**/ added in front of the given relative pattern.
#!/usr/bin/python
from pathlib import Path
path = Path('/home/janbodnar/Documents/prog/python/')
for e in path.rglob('*.py'):
    print(e)
# for e in path.glob('**/*.py'):
#     print(e)
The example finds all Python files in the given directory and all its subdirectories. The commented code does the same job.
Python list directory with multiple extensions
The following example lists all files with three possible extensions.
#!/usr/bin/python
from pathlib import Path
path = Path.cwd()
files = { p.resolve() for p in Path(path).glob("**/*") 
    if p.suffix in ['.py', '.csv', '.txt'] }
for file in files:
    print(file.parts[-1])
The example lists all .py, .csv, and .txt
files.
files = { p.resolve() for p in Path(path).glob("**/*") 
    if p.suffix in ['.py', '.csv', '.txt'] }
We use a set comprehension to build a set of files with the provided extensions.
$ ./multiple_extensions.py data_2019.csv walking.py list_files.py data.txt multiple_extensions.py scan_dir.py list_files_size.py list_pandas.py temperatures.csv recursive_glob.py list_dir_files2.py data.csv ...
Python list directory with os.scandir
The os.scandir was introduced in Python 3.5 with the aim of 
improving performance and to mitigate some limitations of the older 
os.listdir.
The scandir function returns directory entries along with file
attribute information, giving better performance for many common use cases.
#!/usr/bin/python
import os
path = '.'
with os.scandir(path) as it:
    for entry in it:
        if entry.is_file():
            print(entry.name)
The example list all files in the current working directory.
Python list directory recursively with os.walk
The os.walk walks the given directory recursively; it
 yields a 3-tuple (dirpath, dirnames, filenames).
#!/usr/bin/python
import os
path = '/home/janbodnar/Documents/prog/python/'
for dirpath, dirs, files in os.walk(path):
  for file in files:
    file_name = os.path.join(dirpath, file)
    if file_name.endswith('.py'):
        print(file_name)
The example lists all Python files. To check if a file is a Python file, we 
use the endswith function.
Python list directory recursively with os.scandir
The os.walk returns an iterator of os.DirEntry objects
corresponding to the entries in the directory given by path.
#!/usr/bin/python
import os
 
def scan_recursive(path):
    with os.scandir(path) as it:
        
        for entry in it:
            if entry.is_file():
                yield entry.name
            else:
                yield from scan_recursive(entry.path)
 
for e in scan_recursive('/root/Documents/prog/python/'):
    print(e)
The example uses the scandir function alongside the generator 
and recursion patterns to recursively find all files in the given directory.
Python Path tree
The following practical program outputs the contents of the specified directory in a hierarchical tree structure.
#!/usr/bin/python
from pathlib import Path
def generate_tree(directory):
    print(f'+ {directory}')
    for path in sorted(directory.rglob('*')):
        depth = len(path.relative_to(directory).parts)
        spacer = '    ' * depth
        # print(f'{spacer}+ {path.name}')
        if path.is_file():
            print(f'{spacer}f {path.name}')
        else:
            print(f'{spacer}d {path.name}')
path = Path.home() / 'Downloads'
generate_tree(path)
The example generates a directory tree from the Downloads directory.
Python list directory files by size
The following example lists files filtered by size.
#!/usr/bin/python
import os 
from pathlib import Path
path = Path.home() / 'Downloads'
size = 1024*1024*1024
large_files = [e for e in path.rglob('*.*') if e.is_file() and os.path.getsize(e) >= size]
for file in large_files:
    print(f'{file} {os.path.getsize(file)}')
The example prints all files that are greater than 1GB in the Downloads 
directory. We use the path.rglob to walk recursively and the 
os.path.getsize to determine the file size.
Python list diretory by modification date
The following example lists all files by modification date.
#!/usr/bin/python
import os
import datetime
now = datetime.datetime.now()
ago = now - datetime.timedelta(minutes=30)
path = '/home/janbodnar/Documents/prog/python/'
for root, dirs, files in os.walk(path):
    for fname in files:
        path = os.path.join(root, fname)
        st = os.stat(path)
        mtime = datetime.datetime.fromtimestamp(st.st_mtime)
        if mtime > ago:
            print(f'{path} modified {mtime}')
In the example, we list all files that have been modified in the last 30 minutes.
We use the os.stat function to get the modification date of a file
and the os.walk to list the contents.
Python list directory with os.listdir
The os.listdir returns a list containing the names of the entries
in the directory given by path. The list is in arbitrary order, and does not
include the special entries '.' and '..'.
This is older API and it is generally recommended to use more recent APIs for directory listings.
#!/usr/bin/python
import os
path = '.'
files = os.listdir(path)
for file in files:
    print(file)
The example lists all files and directories in the current directory.
#!/usr/bin/python import os dirs = filter(os.path.isfile, os.listdir()) print(tuple(dirs))
In this example, we list all files in the current working directory.
We use the built-in filter and the os.path.isfile
functions.
Python fnmatch with os.listdir
The fnmatch module provides support for Unix
shell-style wildcards. They are not the same as regular expressions..
The special characters used in shell-style wildcards are:
| Pattern | Meaning | 
|---|---|
| * | matches everything | 
| ? | matches any single character | 
| [seq] | matches characters in seq | 
| [!seq] | matches characters not in seq | 
fnmatch special characters
The fnmatch function takes the filename and the pattern as
parameters.
#!/usr/bin/python
import fnmatch
import os
path = '.'
for file in os.listdir(path):
    if fnmatch.fnmatch(file, '*.csv'):
        print(file)
In the example, we list all CSV files.
$ ./list_csv.py temperatures.csv data_2019.csv data.csv
We have three CSV files in the directory.
Python directory contents in table with Pandas
Pandas is a Python data analysis and manipulation tool. The following example shows the directory listings in a Pandas DataFrame.
#!/usr/bin/python
import pandas as pd
from pathlib import Path
import time
path = Path.cwd()
all_files = []
for e in path.rglob('*.*'):
    all_files.append((e.name, e.parent, time.ctime(e.stat().st_ctime)))
columns = ['File_Name', 'Parent', 'Created']
df = pd.DataFrame.from_records(all_files, columns=columns)
print(df.head(5))
In the example, we show file names, their suffix and creation date for the first file entries in a Pandas DataFrame.
$./list_pandas.py File name Suffix Created 0 recursive_glob.py .py Fri Sep 11 13:04:54 2020 1 list_dir_files2.py .py Fri Sep 11 12:50:22 2020 2 multiple_extensions.py .py Fri Sep 11 16:09:30 2020 3 walking3.py .py Fri Sep 11 13:06:31 2020 4 list_files.py .py Fri Sep 11 14:23:47 2020
Python directory contents in table with PrettyTable
PrettyTable is a Python library for generating simple ASCII tables.
#!/usr/bin/python
from prettytable import PrettyTable
from pathlib import Path
import time
    
pt = PrettyTable()
path = Path.cwd()
all_files = []
pt.field_names = ['File name', 'Suffix', 'Created']
for e in path.rglob('*.*'):
    pt.add_row((e.name, e.suffix, time.ctime(e.stat().st_ctime)))
print(pt)
We show the file names of the current directory, their suffix and creation date in a table generated by PrettyTable.
$ ./list_pretty_table.py +------------------------+--------+--------------------------+ | File name | Suffix | Created | +------------------------+--------+--------------------------+ | recursive_glob.py | .py | Fri Sep 11 13:04:54 2020 | | list_dir_files2.py | .py | Fri Sep 11 12:50:22 2020 | | multiple_extensions.py | .py | Fri Sep 11 16:09:30 2020 | | walking3.py | .py | Fri Sep 11 13:06:31 2020 | | list_files.py | .py | Fri Sep 11 14:23:47 2020 | | temperatures.csv | .csv | Fri Sep 11 12:50:37 2020 | | list_pandas.py | .py | Sat Sep 12 08:11:57 2020 | | list_files_modified.py | .py | Fri Sep 11 14:59:32 2020 | | recursive_scandir.py | .py | Fri Sep 11 16:42:36 2020 | | list_dir_files.py | .py | Fri Sep 11 12:48:49 2020 | | data_2019.csv | .csv | Fri Sep 11 12:47:35 2020 | | listing2.py | .py | Fri Sep 11 15:18:03 2020 | | listing.py | .py | Fri Sep 11 15:12:55 2020 | | system_list.py | .py | Fri Sep 11 12:53:13 2020 | | listing3.py | .py | Fri Sep 11 13:24:58 2020 | | scan_dir.py | .py | Fri Sep 11 16:00:53 2020 | | list_dir.py | .py | Fri Sep 11 12:46:34 2020 | | list_files_size.py | .py | Fri Sep 11 16:52:20 2020 | | walking.py | .py | Fri Sep 11 16:41:27 2020 | | walking2.py | .py | Fri Sep 11 12:46:34 2020 | | list_dir_generator.py | .py | Sat Sep 12 07:54:17 2020 | | scan_dir2.py | .py | Fri Sep 11 16:00:23 2020 | | data.csv | .csv | Fri Sep 11 12:47:35 2020 | | list_files2.py | .py | Fri Sep 11 17:07:18 2020 | | globbing.py | .py | Fri Sep 11 15:26:45 2020 | | rglobbing.py | .py | Fri Sep 11 15:26:24 2020 | | data.txt | .txt | Fri Sep 11 12:47:35 2020 | | path_tree.py | .py | Fri Sep 11 13:26:39 2020 | | list_csv.py | .py | Sat Sep 12 07:48:16 2020 | | links.txt | .txt | Fri Sep 11 16:58:25 2020 | | succint.py | .py | Fri Sep 11 15:49:54 2020 | | list_pretty_table.py | .py | Sat Sep 12 08:13:37 2020 | +------------------------+--------+--------------------------+
Source
Python File and Directory Access - language reference
In this article we have showed how to list the directory contents in Python.
Author
List all Python tutorials.