{ "nbformat": 4, "nbformat_minor": 0, "metadata": { "colab": { "provenance": [], "machine_shape": "hm", "gpuType": "A100" }, "kernelspec": { "name": "python3", "display_name": "Python 3" }, "language_info": { "name": "python" }, "accelerator": "GPU", "widgets": { "application/vnd.jupyter.widget-state+json": { "5db691aecba442dba2f2aee05f11478f": { "model_module": "@jupyter-widgets/controls", "model_name": "VBoxModel", "model_module_version": "1.5.0", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "VBoxModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "VBoxView", "box_style": "", "children": [], "layout": "IPY_MODEL_92f158f165b240c7975ea1b274ac9f94" } }, "1797cd43c89e4441a537f33a6e9d9449": { "model_module": "@jupyter-widgets/controls", "model_name": "HTMLModel", "model_module_version": "1.5.0", "state": { "_dom_classes": [], "_model_module": "@jupyter-widgets/controls", "_model_module_version": "1.5.0", "_model_name": "HTMLModel", "_view_count": null, "_view_module": "@jupyter-widgets/controls", "_view_module_version": "1.5.0", "_view_name": "HTMLView", "description": "", "description_tooltip": null, "layout": "IPY_MODEL_a16d2b91d50842e891e6e1f2e965530d", "placeholder": "", "style": "IPY_MODEL_bbc2701e1928424c93fa218c3f6900e3", "value": "
\n", " | ISO | \n", "LLM used | \n", "Type | \n", "Data Split | \n", "Original text | \n", "Original Word Count | \n", "Original Char Count | \n", "label | \n", "text | \n", "New Word Count | \n", "New Char Count | \n", "id | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "FRA | \n", "Claude-Haiku-3.5 | \n", "Partial | \n", "Train | \n", "Par opposition, le crédit impôt recherche sédu... | \n", "91 | \n", "590 | \n", "52 | \n", "Par opposition, le crédit impôt recherche sédu... | \n", "126 | \n", "814 | \n", "FRA2 | \n", "
1 | \n", "FRA | \n", "GPT-4o | \n", "Partial | \n", "Train | \n", "Me JEAN REINHART. Nous avons déposé la plainte... | \n", "142 | \n", "880 | \n", "43 | \n", "Me JEAN REINHART. Nous avons déposé la plainte... | \n", "74 | \n", "455 | \n", "FRA4 | \n", "
2 | \n", "FRA | \n", "Mistral-Large-2411 | \n", "Partial | \n", "Train | \n", "Dans le café parisien où il a ses habitudes, C... | \n", "146 | \n", "817 | \n", "73 | \n", "Dans le café parisien où il a ses habitudes, C... | \n", "150 | \n", "877 | \n", "FRA6 | \n", "
3 | \n", "FRA | \n", "GPT-4o | \n", "Unchanged | \n", "Train | \n", "L'an dernier on a réussi à finir dans les huit... | \n", "34 | \n", "206 | \n", "34 | \n", "L'an dernier on a réussi à finir dans les huit... | \n", "34 | \n", "206 | \n", "FRA7 | \n", "
4 | \n", "FRA | \n", "Aya-23 | \n", "Partial | \n", "Train | \n", "'C'est très dangereux dans une démocratie que ... | \n", "77 | \n", "447 | \n", "23 | \n", "'C'est très dangereux dans une démocratie que ... | \n", "54 | \n", "341 | \n", "FRA10 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
59920 | \n", "FRA | \n", "GPT-4o | \n", "Partial | \n", "Dev | \n", "Alors, illusion d’optique?? Extra-terrestres e... | \n", "17 | \n", "148 | \n", "10 | \n", "Alors, illusion d’optique?? Extra-terrestres e... | \n", "87 | \n", "567 | \n", "FRA119802 | \n", "
59921 | \n", "FRA | \n", "GPT-o1 | \n", "Partial | \n", "Dev | \n", "12 heures. Quelques centaines de personnes man... | \n", "72 | \n", "498 | \n", "21 | \n", "12 heures . Quelques centaines de personnes ma... | \n", "148 | \n", "975 | \n", "FRA119826 | \n", "
59922 | \n", "FRA | \n", "Claude-Haiku-3.5 | \n", "Partial | \n", "Dev | \n", "Beyonce, la star américaine Lors du BET Award,... | \n", "603 | \n", "3534 | \n", "327 | \n", "Beyonce, la star américaine Lors du BET Award,... | \n", "406 | \n", "2427 | \n", "FRA119831 | \n", "
59923 | \n", "FRA | \n", "GPT-o1 | \n", "Partial | \n", "Dev | \n", "Ils viennent d'Espagne, des États-Unis, d'Alle... | \n", "41 | \n", "270 | \n", "30 | \n", "Ils viennent d'Espagne , des États-Unis , d'Al... | \n", "116 | \n", "735 | \n", "FRA119838 | \n", "
59924 | \n", "FRA | \n", "Mistral-Large-2411 | \n", "Unchanged | \n", "Dev | \n", "Les autorités néerlandaises ont été alertées p... | \n", "63 | \n", "420 | \n", "63 | \n", "Les autorités néerlandaises ont été alertées p... | \n", "63 | \n", "420 | \n", "FRA119845 | \n", "
59925 rows × 12 columns
\n", "