Loaders and extractors¶

DocAsk uses loaders and extractors to convert project sources into DocumentRecord objects.

Markdown loader¶

File:

src/docask/loaders/markdown_loader.py

Role:

reads .md and .rst files;
splits Markdown files by headings;
creates one record per documentation section;
stores metadata such as relative path, page title, section title, and heading level.

Main source type:

markdown_section

File:

src/docask/extractors/python_doc_extractor.py

Role:

Source types:

python_module
python_class
python_function
python_method

Current limitation:

File:

src/docask/loaders/yaml_config_loader.py

Role:

Source types:

example_config
production_config
yaml_config

File:

src/docask/loaders/repo_structure_loader.py

Role:

creates a synthetic tree view of the repository;
excludes noisy folders such as .git, __pycache__, .venv, dist, and build;
includes useful files such as .py, .md, .rst, .yaml, .yml, .toml, .json, and .txt;
helps answer navigation questions.

The maximum tree depth is controlled by:

repo_structure_max_depth: 6

in the project configuration.

Source type:

repo_structure