Upload 218 files
Browse filesThis view is limited to 50 files because it contains too many changes.
See raw diff
- README.md +52 -3
- casual_graph.png +0 -0
- data/02c3b725-26e4-4a2c-9336-04ddc58836d9-1430726196216-1.7-m-04-hu.wav +0 -0
- data/02ead89b-aa02-453e-8b83-6ebde9fe7551-1430233132879-1.7-m-26-hu.wav +0 -0
- data/035c6b30-a145-42b9-8d0f-445cd9003d2c-1435948197257-1.7-m-04-hu.wav +0 -0
- data/03ADDCFB-354E-416D-BF32-260CF47F7060-1433658024-1.1-f-04-ti.wav +0 -0
- data/03abcb8f-400a-47d8-ad82-7e4586cc06be-1431864192133-1.7-f-48-hu.wav +0 -0
- data/045C5483-69E1-4BEC-B1D8-9286D174B9B2-1430102996-1.0-m-04-hu.wav +0 -0
- data/04c3386b-e6bc-4bd0-8456-d46ae21a73fc-1435305829013-1.7-f-26-hu.wav +0 -0
- data/06c4cfa2-7fa6-4fda-91a1-ea186a4acc64-1430029221058-1.7-f-26-ti.wav +0 -0
- data/06c4cfa2-7fa6-4fda-91a1-ea186a4acc64-1430029237378-1.7-f-26-ti.wav +0 -0
- data/06c4cfa2-7fa6-4fda-91a1-ea186a4acc64-1430029246453-1.7-f-26-ti.wav +0 -0
- data/0776b33b-c41a-4a2e-8b9c-f1c184be00c8-1437123838264-1.7-f-26-hu.wav +0 -0
- data/08E9485B-2772-444B-A636-77E14DD14A8C-1431493441-1.0-m-04-hu.wav +0 -0
- data/08E9485B-2772-444B-A636-77E14DD14A8C-1431493453-1.0-m-04-hu.wav +0 -0
- data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147515-1.0-m-72-hu.wav +0 -0
- data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147525-1.0-m-72-hu.wav +0 -0
- data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147533-1.0-m-72-hu.wav +0 -0
- data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147582-1.0-m-72-hu.wav +0 -0
- data/0D1AD73E-4C5E-45F3-85C4-9A3CB71E8856-1430742197-1.0-m-04-hu.wav +0 -0
- data/0F4B1065-4012-47C8-88B7-ACE11B1A536E-1430038775-1.0-m-04-hu.wav +0 -0
- data/0a983cd2-0078-4698-a048-99ac01eb167a-1433917038889-1.7-f-04-hu.wav +0 -0
- data/0c8f14a9-6999-485b-97a2-913c1cbf099c-1430760379259-1.7-m-26-hu.wav +0 -0
- data/0c8f14a9-6999-485b-97a2-913c1cbf099c-1430760394426-1.7-m-26-hu.wav +0 -0
- data/0f257dac-7d6f-4575-9192-e3b4dcd3d4ef-1430185441581-1.7-f-26-hu.wav +0 -0
- data/101c6709-39fb-44dc-b905-7cbeed5714a2-1434361963544-1.7-f-04-hu.wav +0 -0
- data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848034-1.0-f-26-hu.wav +0 -0
- data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848060-1.0-f-26-hu.wav +0 -0
- data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848070-1.0-f-26-hu.wav +0 -0
- data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848101-1.0-f-26-hu.wav +0 -0
- data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430925142-1.0-f-26-dc.wav +0 -0
- data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430925172-1.0-f-26-hu.wav +0 -0
- data/11417AC2-DCC9-48CD-8177-CA8665E51B2F-1436881489-1.1-m-48-hu.wav +0 -0
- data/11417AC2-DCC9-48CD-8177-CA8665E51B2F-1436881512-1.1-m-48-dc.wav +0 -0
- data/1259bcad-2308-46fa-9a36-b5d8f359886a-1430143617162-1.7-m-04-hu.wav +0 -0
- data/1259bcad-2308-46fa-9a36-b5d8f359886a-1430143644843-1.7-f-04-hu.wav +0 -0
- data/1259bcad-2308-46fa-9a36-b5d8f359886a-1430143668806-1.7-f-04-hu.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430059849-1.0-f-04-hu.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430059864-1.0-f-04-ti.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430703937-1.0-f-48-dc.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430704008-1.0-f-48-dc.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1431172241-1.0-f-48-ti.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1432801693-1.1-f-26-dc.wav +0 -0
- data/1309B82C-F146-46F0-A723-45345AFA6EA8-1432801703-1.1-f-26-dc.wav +0 -0
- data/1445575b-80ad-477b-8e48-36194cac728e-1430244987914-1.7-f-48-hu.wav +0 -0
- data/177dc72e-d0f8-47ef-a5a7-3b46878e11a0-1430736358754-1.7-f-26-hu.wav +0 -0
- data/177dc72e-d0f8-47ef-a5a7-3b46878e11a0-1430736533767-1.7-f-26-hu.wav +0 -0
- data/177dc72e-d0f8-47ef-a5a7-3b46878e11a0-1430736549614-1.7-f-26-hu.wav +0 -0
- data/189b78c6-9e78-43f2-bcb8-6cc8a462f4dc-1430807938301-1.7-m-22-hu.wav +0 -0
- data/19aae3d1-51c6-4ffb-aeb8-efb6ae7ba83e-1436861395462-1.7-m-48-hu.wav +0 -0
README.md
CHANGED
@@ -1,3 +1,52 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Infant Cry Detection using Causal Temporal Representation
|
2 |
+
|
3 |
+
This project focuses on detecting infant cries using a novel **causal temporal representation** framework. Our approach incorporates causal reasoning into the data-generating process (DGP) to improve the interpretability and reliability of cry detection systems. This repository provides the necessary resources to explore, train, and evaluate supervised models for this task, along with mathematical assumptions and metrics tailored for event-based evaluation.
|
4 |
+
|
5 |
+
## Features
|
6 |
+
- **Data Generating Process**: Based on mathematical causal assumptions, our DGP defines how audio features and annotations are causally connected.
|
7 |
+
- **Supervised Models**: State-of-the-art supervised learning methods, including Bidirectional LSTM, Transformer, and MobileNet V2.
|
8 |
+
- **Event-Based Metrics**: Evaluation metrics tailored for time-sensitive detection tasks, including event-based F1-score and IOU.
|
9 |
+
- **Interactive Example**: A Jupyter Notebook with step-by-step demonstrations.
|
10 |
+
|
11 |
+
![Causal Graph](https://github.com/PeterIsDanning/Infant-Cry-Detection-by-CRSTC/blob/main/casual_graph.png)
|
12 |
+
|
13 |
+
## Repository Structure
|
14 |
+
|
15 |
+
```plaintext
|
16 |
+
.
|
17 |
+
├── data/ # Audio data in .wav format
|
18 |
+
├── labels/ # Annotation files corresponding to audio data (.TextGrid)
|
19 |
+
├── metrics/ # Event-based evaluation metrics
|
20 |
+
├── models/ # Pre-trained supervised models
|
21 |
+
├── src/ # Core codebase
|
22 |
+
├── experiment.ipynb # Usage demonstration
|
23 |
+
└── README.md # Project description
|
24 |
+
```
|
25 |
+
|
26 |
+
### Directory Details
|
27 |
+
|
28 |
+
- **data/**: Contains raw audio files in `.wav` format.
|
29 |
+
- Each audio file represents an infant cry recording.
|
30 |
+
|
31 |
+
- **labels/**: Stores annotation files in `.TextGrid` format.
|
32 |
+
- Each `.TextGrid` file corresponds to an audio file and provides ground truth segmentations for cry events.
|
33 |
+
|
34 |
+
- **metrics/**: Houses the implementation of event-based metrics for evaluating the performance of models.
|
35 |
+
- Metrics include event-based F1-score and IOU, designed to measure temporal accuracy effectively.
|
36 |
+
|
37 |
+
- **models/**: Contains pre-trained supervised models for infant cry detection.
|
38 |
+
- Models include:
|
39 |
+
- Bidirectional LSTM
|
40 |
+
- Transformer
|
41 |
+
- MobileNet V2
|
42 |
+
|
43 |
+
- **src/**: Core implementation of the infant cry detection framework.
|
44 |
+
- Includes modules for data preprocessing, feature extraction, model training, and evaluation.
|
45 |
+
|
46 |
+
- **experiment.ipynb**: A Jupyter Notebook with a simple use case example.
|
47 |
+
- Demonstrates how to load data, preprocess it, train a model, and evaluate its performance.
|
48 |
+
|
49 |
+
For more details, refer to our accompanying research paper.
|
50 |
+
|
51 |
+
## License
|
52 |
+
This project is licensed under the MIT License. See the LICENSE file for more details.
|
casual_graph.png
ADDED
data/02c3b725-26e4-4a2c-9336-04ddc58836d9-1430726196216-1.7-m-04-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/02ead89b-aa02-453e-8b83-6ebde9fe7551-1430233132879-1.7-m-26-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/035c6b30-a145-42b9-8d0f-445cd9003d2c-1435948197257-1.7-m-04-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/03ADDCFB-354E-416D-BF32-260CF47F7060-1433658024-1.1-f-04-ti.wav
ADDED
Binary file (112 kB). View file
|
|
data/03abcb8f-400a-47d8-ad82-7e4586cc06be-1431864192133-1.7-f-48-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/045C5483-69E1-4BEC-B1D8-9286D174B9B2-1430102996-1.0-m-04-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/04c3386b-e6bc-4bd0-8456-d46ae21a73fc-1435305829013-1.7-f-26-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/06c4cfa2-7fa6-4fda-91a1-ea186a4acc64-1430029221058-1.7-f-26-ti.wav
ADDED
Binary file (107 kB). View file
|
|
data/06c4cfa2-7fa6-4fda-91a1-ea186a4acc64-1430029237378-1.7-f-26-ti.wav
ADDED
Binary file (107 kB). View file
|
|
data/06c4cfa2-7fa6-4fda-91a1-ea186a4acc64-1430029246453-1.7-f-26-ti.wav
ADDED
Binary file (104 kB). View file
|
|
data/0776b33b-c41a-4a2e-8b9c-f1c184be00c8-1437123838264-1.7-f-26-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/08E9485B-2772-444B-A636-77E14DD14A8C-1431493441-1.0-m-04-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/08E9485B-2772-444B-A636-77E14DD14A8C-1431493453-1.0-m-04-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147515-1.0-m-72-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147525-1.0-m-72-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147533-1.0-m-72-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/090C15A8-5406-4EA5-97A3-81F6527227C0-1430147582-1.0-m-72-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/0D1AD73E-4C5E-45F3-85C4-9A3CB71E8856-1430742197-1.0-m-04-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/0F4B1065-4012-47C8-88B7-ACE11B1A536E-1430038775-1.0-m-04-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/0a983cd2-0078-4698-a048-99ac01eb167a-1433917038889-1.7-f-04-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/0c8f14a9-6999-485b-97a2-913c1cbf099c-1430760379259-1.7-m-26-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/0c8f14a9-6999-485b-97a2-913c1cbf099c-1430760394426-1.7-m-26-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/0f257dac-7d6f-4575-9192-e3b4dcd3d4ef-1430185441581-1.7-f-26-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/101c6709-39fb-44dc-b905-7cbeed5714a2-1434361963544-1.7-f-04-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848034-1.0-f-26-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848060-1.0-f-26-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848070-1.0-f-26-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430848101-1.0-f-26-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430925142-1.0-f-26-dc.wav
ADDED
Binary file (112 kB). View file
|
|
data/10A40438-09AA-4A21-83B4-8119F03F7A11-1430925172-1.0-f-26-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/11417AC2-DCC9-48CD-8177-CA8665E51B2F-1436881489-1.1-m-48-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/11417AC2-DCC9-48CD-8177-CA8665E51B2F-1436881512-1.1-m-48-dc.wav
ADDED
Binary file (112 kB). View file
|
|
data/1259bcad-2308-46fa-9a36-b5d8f359886a-1430143617162-1.7-m-04-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/1259bcad-2308-46fa-9a36-b5d8f359886a-1430143644843-1.7-f-04-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/1259bcad-2308-46fa-9a36-b5d8f359886a-1430143668806-1.7-f-04-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430059849-1.0-f-04-hu.wav
ADDED
Binary file (112 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430059864-1.0-f-04-ti.wav
ADDED
Binary file (112 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430703937-1.0-f-48-dc.wav
ADDED
Binary file (112 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1430704008-1.0-f-48-dc.wav
ADDED
Binary file (112 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1431172241-1.0-f-48-ti.wav
ADDED
Binary file (112 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1432801693-1.1-f-26-dc.wav
ADDED
Binary file (112 kB). View file
|
|
data/1309B82C-F146-46F0-A723-45345AFA6EA8-1432801703-1.1-f-26-dc.wav
ADDED
Binary file (112 kB). View file
|
|
data/1445575b-80ad-477b-8e48-36194cac728e-1430244987914-1.7-f-48-hu.wav
ADDED
Binary file (109 kB). View file
|
|
data/177dc72e-d0f8-47ef-a5a7-3b46878e11a0-1430736358754-1.7-f-26-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/177dc72e-d0f8-47ef-a5a7-3b46878e11a0-1430736533767-1.7-f-26-hu.wav
ADDED
Binary file (111 kB). View file
|
|
data/177dc72e-d0f8-47ef-a5a7-3b46878e11a0-1430736549614-1.7-f-26-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/189b78c6-9e78-43f2-bcb8-6cc8a462f4dc-1430807938301-1.7-m-22-hu.wav
ADDED
Binary file (110 kB). View file
|
|
data/19aae3d1-51c6-4ffb-aeb8-efb6ae7ba83e-1436861395462-1.7-m-48-hu.wav
ADDED
Binary file (107 kB). View file
|
|