复现系统talksee

改进点:

  • 将DRES测试缩小到我们系统展示图片的大小(需要resize),减少记忆负担,直接对比
  • 图片从剪切板直接读取
  • OCR网站
    • https://simpletex.cn/ai/latex_ocr

加入输入文本和权重

前端home_demo1.html

首先调整原本输入框的宽度.input-container input[type="text"]为3vh 。同时需要加入新的文本框和滑动条的css样式。

.input-row {

            display: flex;

            align-items: center;

            margin-top: 10px;

        }

        .input-row input[type="text"] {

            flex-grow: 1;

            margin-right: 10px;

        }

        .input-row input[type="range"] {

            flex-basis: 200px;

        }

        .range-value {

            margin-left: 10px;

            font-family: Arial, sans-serif;

        }

在html加入:

<button type="button" id="add-input">+</button>

                <div id="additional-inputs"></div>

在javascript中加入:

document.getElementById('add-input').addEventListener('click', function() {

    var additionalInputs = document.getElementById('additional-inputs');

    var inputRow = document.createElement('div');

    inputRow.className = 'input-row';

    var textInput = document.createElement('input');

    textInput.type = 'text';

    textInput.name = 'additional-text';

    textInput.placeholder = 'Enter text';

    var rangeInput = document.createElement('input');

    rangeInput.type = 'range';

    rangeInput.name = 'additional-range';

    rangeInput.min = '0';

    rangeInput.max = '1';

    rangeInput.step = '0.01';

    var rangeValue = document.createElement('span');

    rangeValue.className = 'range-value';

    rangeValue.textContent = '1'; // 初始值

    // 监听滑动条的值变化

    rangeInput.addEventListener('input', function() {

        rangeValue.textContent = this.value;

    });

    inputRow.appendChild(textInput);

    inputRow.appendChild(rangeInput);

    inputRow.appendChild(rangeValue);

    additionalInputs.appendChild(inputRow);

});

其中关于search的点击操作需要传入additional_json,用于传输新加入的文本和权重信息到后台。

    $("#search").click(function () {

        var csrf_token = $("[name='csrfmiddlewaretoken']").val();

        

        var additionalTexts = $("input[name='additional-text']").map(function(){ return $(this).val(); }).get();

        var additionalRanges = $("input[name='additional-range']").map(function(){ return $(this).val(); }).get();

        var additionalData = additionalTexts.map(function(text, index) {

            return { text: text, value: additionalRanges[index] };

        });

        

        $("#loading_message").text("正在加载,请稍候...");

        $.ajax({

            url: "/search/",

            method: "post",

            data: {

                'task': $("#task").val(),

                'text': $("#text").val(),

                'image': $("#file").val(),

                

                'additional': JSON.stringify(additionalData),

之后在view.py修改search,接收新参数

additional_json = request.POST.get('additional')

 if additional_json:

            additional_data = json.loads(additional_json)

        else:

            additional_data = None

new_engine.py中修改searchListByText(text, refresh, additional_data),加入新参数的同时,修改函数内部

if additional_data is not None:

        for additional_text in additional_data:

            print('additional text is')

            print(additional_text)

            text_feature += float(additional_text['value'])*extract_text_features(additional_text['text'])

同时修改searchListByImage(text, refresh, additional_data)

问题 1.27

  • AVS任务提交数量
  • Enter?
  • VBS2022测试
  • 人员分配
  • UI界面

前端

  • VQA的提交按钮
  • 文本框变大变小
  • Enter直接search

AVS

  • 只展示同个视频

new_engine

  • searchListByText

This line appears after every note.

Notes mentioning this note


Here are all the notes in this garden, along with their links, visualized as a graph.

Conda导出python环境加快访问github新闻稿实验1:ros入门实验3:自动驾驶实战实验4:ros2智能移动机器人实验5:ros1移动机器人动态避障(基于强化学习)实验6:轨迹跟踪仿真1最终实验自动驾驶辅助python函数Obsidian发布的免费替代方案Obsidian库解析TestYour first seedClip 串讲Icml'23 blip 2 bootstrapping language Image pre...Nips'17 attention is all you needSigir'22 cret cross Modal retrieval transformer...Arxiv 2306’unifying large language models and...Arxiv'21 how much can clip benefit vision And...⭐ ⭐ ⭐ ⭐ ⭐ arxiv 2311' llmsurveychinese⭐⭐⭐⭐eccv'22 slip:self Supervision meets language...⭐⭐⭐⭐⭐clip:learning transferable visual models from...⭐⭐⭐⭐⭐icml'22 blip bootstrapping language Image pre...Acl'25 a survey of mathematical reasoning in the...Arxiv'23 challenges and applications of large...Prl'20 retrieving quantum information with active...SIGIR'06 Laplacian Optimal Design for Image...Survey'09active learningTKDE'16Relevance Feedback Algorithms Inspired By...Arxiv'2501 ursa understanding and verifying chain...Cikm'24 infinitymath a scalable instruction tuning...Icml'25 cogmath assessing llms’ authentic...Icml'25 forest Of Thought scaling test Time...⭐⭐⭐arxiv'2502 hinteval a comprehensive framework...⭐⭐⭐arxiv'2506 scida scientific dynamic assessor of...⭐⭐⭐⭐acl'24 champ a competition Level dataset for...⭐⭐⭐⭐arxiv'2505 soft thinking unlocking the...⭐⭐⭐⭐⭐arxiv'2505 reasoning with omnithought a...⭐⭐⭐⭐⭐arxiv'2506 thought anchors which llm...Improving interpretable embeddings for ad Hoc...Access'17...Artif. intell. rev.‘23 a survey on ensemble...Fcs'20 a survey on ensemble learningTpmai'04 asymmetric bagging and random subspace...⭐⭐⭐⭐access'22 a survey of ensemble learning进化集成学习算法综述《黑客与画家》 为什么书呆子不受欢迎《黑客与画家》《黑客与画家》——黑客与画家黑客伦理250701 250708 阅读250709 250716 阅读250717 250723 阅读250722 250729 阅读250729 250805 阅读Avs检索流程Avs项目管理Avs speaker proposalAvs paper思路整理Presentation 思路整理Stable Diffusion检索流程2023avs交互使用flask快速构建浏览器实现图片交互Trecvid avs 个人感受2022交互情况统计2024avs交互情况统计Llm api测试Agi 比赛Lean(vs code)Agic TrickLlm相关论文Rtx 4090 部署大模型 20240306构建样题数据集调查开源大模型的数学能力想法计划231128调研Github下载Python调用javaVbs2024比赛复盘复现系统talkseeDiffusion扩散模型调研2023 mindspore量子计算黑客马拉松全国大赛热身题2023 mindspore量子计算黑客马拉松初赛——量子组合优化赛道代码集成进化算法Python使用Vscode使用Github问题HuggingfaceLinuxSlrum使用华为手机安装google框架工具推荐科研问题笔记本电脑视频生成调研20241002更换内存条(16g换到32g)24考研总结Reflection 大学四年的回顾及年终总结Fairymusicbox上手李沐讲座干眼症的习惯考研计划《周处除三害》观后感《奥本海默》观后感《白金数据》书评牛奶2023 mindspore量子计算黑客马拉松初赛——量子组合优化赛道排序融合动手学习深度学习算法笔记论文阅读模板算法知识生活Paper ReadingProjectsWeekly Summary